************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /work/00131/fuentes/exec/dddas_barcelona-cxx on a barcelona named i149-205.ranger.tacc.utexas.edu with 48 processors, by fuentes Sun Nov 28 09:17:53 2010 Using Petsc Release Version 3.0.0, Patch 11, Mon Feb 1 11:01:51 CST 2010 Max Max/Min Avg Total Time (sec): 4.901e+02 1.00001 4.901e+02 Objects: 2.723e+04 1.00000 2.723e+04 Flops: 2.591e+09 2.23681 1.935e+09 9.290e+10 Flops/sec: 5.288e+06 2.23678 3.949e+06 1.896e+08 MPI Messages: 3.654e+05 3.08536 2.296e+05 1.102e+07 MPI Message Lengths: 9.931e+08 3.57477 1.777e+03 1.959e+10 MPI Reductions: 6.293e+04 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 4.0935e+01 8.4% 8.0866e+09 8.7% 1.551e+06 14.1% 6.698e+02 37.7% 1.585e+03 2.5% 1: Initialization: 1.9145e+01 3.9% 0.0000e+00 0.0% 6.862e+03 0.1% 2.914e+00 0.2% 1.900e+01 0.0% 3: function evaluation: 1.9929e+02 40.7% 6.1252e+10 65.9% 6.864e+06 62.3% 5.512e+02 31.0% 3.906e+04 62.1% 4: gradient evaluation: 2.3068e+02 47.1% 2.3558e+10 25.4% 2.599e+06 23.6% 5.535e+02 31.1% 1.596e+04 25.4% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage --- Event Stage 1: Initialization --- Event Stage 2: Unknown --- Event Stage 3: function evaluation --- Event Stage 4: gradient evaluation --- Event Stage 5: Unknown --- Event Stage 6: Unknown --- Event Stage 7: Unknown ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. --- Event Stage 0: Main Stage TAO Solver 1 2 3864 0 Vec 26 28 36928 0 Vec Scatter 228 229 179444 0 Index Set 228 228 51341112 0 Matrix 3 0 0 0 TAO Application 1 2 25512 0 Krylov Solver 1 1 1048 0 Preconditioner 1 1 640 0 Viewer 1 0 0 0 --- Event Stage 1: Initialization Vec 4489 2236 30695808 0 Vec Scatter 2 1 788 0 Index Set 3 3 229204 0 Matrix 9 0 0 0 --- Event Stage 2: Unknown --- Event Stage 3: function evaluation Vec 13766 13765 177864152 0 Vec Scatter 1777 1776 1399488 0 Index Set 3110 3110 24829940 0 Matrix 444 444 196477104 0 Krylov Solver 888 888 8382720 0 Preconditioner 888 888 625152 0 SNES 444 444 458208 0 --- Event Stage 4: gradient evaluation Vec 20 1 1304 0 Vec Scatter 445 444 349872 0 Index Set 449 446 3970820 0 Matrix 1 0 0 0 Krylov Solver 2 0 0 0 Preconditioner 2 0 0 0 --- Event Stage 5: Unknown --- Event Stage 6: Unknown --- Event Stage 7: Unknown ======================================================================================================================== Average time to get PetscTime(): 5.96046e-07 Average time for MPI_Barrier(): 1.57833e-05 Average time for zero size MPI_Send(): 4.83294e-06 #PETSc Option Table entries: -info -info_exclude null,vec,mat,pc,ksp,snes -ksp_rtol 1.e-9 -pc_type bjacobi -snes_converged_reason -snes_ls basic -snes_monitor -snes_rtol 1.e-6 -tao_max_funcs 0 #End o PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 Configure run at: Tue Mar 23 14:13:09 2010 Configure options: --with-x=0 -with-pic --with-blas-lib="[/opt/apps/intel/mkl/10.0.1.014/lib/em64t/libmkl_em64t.a,libmkl.a,libguide.a,libpthread.a]" --with-lapack-lib="[/opt/apps/intel/mkl/10.0.1.014/lib/em64t/libmkl_em64t.a,libmkl.a,libguide.a,libpthread.a]" --with-external-packages-dir=/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/externalpackages --with-mpi-compilers=1 --with-mpi-dir=/opt/apps/intel10_1/mvapich/1.0.1 --with-clanguage=C++ --with-scalar-type=real --with-dynamic=0 --with-shared=0 --with-spai=1 --download-spai=1 --with-parmetis=1 --download-parmetis=yes --with-hdf5=1 --with-hdf5-dir=/opt/apps/intel10_1/mvapich1_1_0_1/phdf5/1.8.2 --with-hypre=1 --download-hypre=1 --with-plapack=1 --download-plapack=1 --with-ml=1 --download-ml=yes --with-mumps=1 --download-mumps=/share/home/0000/build/rpms/SOURCES/MUMPS_4.9.tar.gz --with-scalapack=1 --download-scalapack=yes --with-blacs=1 --download-blacs=yes --with-spooles=1 --download-spooles=1 --with-superlu=1 --download-superlu=yes --with-superlu_dist=1 --download-superlu_dist=yes --with-parmetis=1 --download-parmetis=yes --with-debugging=no --COPTFLAGS=-xW --CXXOPTFLAGS=-xW --FOPTFLAGS=-xW ----------------------------------------- Libraries compiled on Tue Mar 23 14:13:09 CDT 2010 on build.ranger.tacc.utexas.edu Machine characteristics: Linux build.ranger.tacc.utexas.edu 2.6.18.8.TACC.lustre.perfctr #9 SMP Mon Oct 19 22:06:10 CDT 2009 x86_64 x86_64 x86_64 GNU/Linux Using PETSc directory: /opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0 Using PETSc arch: barcelona-cxx ----------------------------------------- Using C compiler: /opt/apps/intel10_1/mvapich/1.0.1/bin/mpicxx -xW -fPIC -xW Using Fortran compiler: /opt/apps/intel10_1/mvapich/1.0.1/bin/mpif90 -fPIC -xW ----------------------------------------- Using include paths: -I/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/include -I/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/include -I/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/include -I/opt/apps/intel10_1/mvapich1_1_0_1/phdf5/1.8.2/include -I/opt/apps/intel10_1/mvapich/1.0.1/include ------------------------------------------ Using C linker: /opt/apps/intel10_1/mvapich/1.0.1/bin/mpicxx -xW Using Fortran linker: /opt/apps/intel10_1/mvapich/1.0.1/bin/mpif90 -fPIC -xW Using libraries: -Wl,-rpath,/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/lib -L/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/lib -lpetscts -lpetscsnes -lpetscksp -lpetscdm -lpetscmat -lpetscvec -lpetsc -Wl,-rpath,/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/lib -L/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/lib -lsuperlu_dist_2.3 -lcmumps -ldmumps -lsmumps -lzmumps -lmumps_common -lpord -lparmetis -lmetis -lscalapack -lblacs -lHYPRE -lspai -lspooles -lsuperlu_3.1 -lPLAPACK -lml -Wl,-rpath,/opt/apps/intel10_1/mvapich1_1_0_1/phdf5/1.8.2/lib -L/opt/apps/intel10_1/mvapich1_1_0_1/phdf5/1.8.2/lib -lhdf5 -lz -Wl,-rpath,/opt/apps/intel/mkl/10.0.1.014/lib/em64t -L/opt/apps/intel/mkl/10.0.1.014/lib/em64t -lmkl_em64t -lmkl -lguide -lpthread -lmkl_em64t -lmkl -lguide -lpthread -lPEPCF90 -Wl,-rpath,/opt/apps/intel10_1/mvapich/1.0.1/lib/shared -L/opt/apps/intel10_1/mvapich/1.0.1/lib/shared -Wl,-rpath,/opt/apps/intel10_1/mvapich/1.0.1/lib -L/opt/apps/intel10_1/mvapich/1.0.1/lib -ldl -lmpich -Wl,-rpath,/opt/ofed/lib64 -L/opt/ofed/lib64 -libverbs -libumad -lpthread -lrt -Wl,-rpath,/opt/apps/intel/10.1/cc/lib -L/opt/apps/intel/10.1/cc/lib -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/3.4.6 -L/usr/lib/gcc/x86_64-redhat-linux/3.4.6 -limf -lsvml -lipgo -lintlc -lgcc_s -lirc_s -lmpichf90nc -lmpichfarg -Wl,-rpath,/opt/apps/intel/10.1/fc/lib -L/opt/apps/intel/10.1/fc/lib -lifport -lifcore -lm -lm -lpmpich++ -lstdc++ -lpmpich++ -lstdc++ -ldl -lmpich -libverbs -libumad -lpthread -lrt -limf -lsvml -lipgo -lintlc -lgcc_s -lirc_s -ldl ------------------------------------------