************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /work/00131/fuentes/exec/dddas_barcelona-cxx on a barcelona named i111-408.ranger.tacc.utexas.edu with 16 processors, by fuentes Sat Nov 27 22:58:09 2010 Using Petsc Release Version 3.0.0, Patch 11, Mon Feb 1 11:01:51 CST 2010 Max Max/Min Avg Total Time (sec): 1.130e+03 1.00000 1.130e+03 Objects: 4.480e+04 1.00000 4.480e+04 Flops: 6.079e+09 1.47208 5.009e+09 8.015e+10 Flops/sec: 5.379e+06 1.47208 4.433e+06 7.093e+07 MPI Messages: 3.900e+05 2.06561 2.836e+05 4.538e+06 MPI Message Lengths: 7.778e+08 2.01457 1.713e+03 7.773e+09 MPI Reductions: 1.050e+05 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 7.2309e+01 6.4% 2.3087e+09 2.9% 2.804e+05 6.2% 4.337e+02 25.3% 2.593e+03 2.5% 1: Initialization: 1.1579e+01 1.0% 0.0000e+00 0.0% 7.500e+02 0.0% 1.173e+00 0.1% 1.900e+01 0.0% 3: function evaluation: 4.8136e+02 42.6% 5.6233e+10 70.2% 3.079e+06 67.9% 5.883e+02 34.3% 6.545e+04 62.3% 4: gradient evaluation: 5.6475e+02 50.0% 2.1606e+10 27.0% 1.178e+06 26.0% 6.896e+02 40.3% 2.665e+04 25.4% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage --- Event Stage 1: Initialization --- Event Stage 2: Unknown --- Event Stage 3: function evaluation --- Event Stage 4: gradient evaluation --- Event Stage 5: Unknown --- Event Stage 6: Unknown --- Event Stage 7: Unknown ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. --- Event Stage 0: Main Stage TAO Solver 1 2 3864 0 Vec 26 28 36928 0 Vec Scatter 372 373 292916 0 Index Set 372 372 43832808 0 Matrix 3 0 0 0 TAO Application 1 2 25512 0 Krylov Solver 1 1 1048 0 Preconditioner 1 1 640 0 Viewer 1 0 0 0 --- Event Stage 1: Initialization Vec 7369 3676 69167616 0 Vec Scatter 2 1 788 0 Index Set 3 3 119812 0 Matrix 9 0 0 0 --- Event Stage 2: Unknown --- Event Stage 3: function evaluation Vec 22694 22693 401242712 0 Vec Scatter 2929 2928 2307264 0 Index Set 5126 5126 60998804 0 Matrix 732 732 467663088 0 Krylov Solver 1464 1464 13820160 0 Preconditioner 1464 1464 1030656 0 SNES 732 732 755424 0 --- Event Stage 4: gradient evaluation Vec 20 1 1304 0 Vec Scatter 733 732 576816 0 Index Set 737 734 10165796 0 Matrix 1 0 0 0 Krylov Solver 2 0 0 0 Preconditioner 2 0 0 0 --- Event Stage 5: Unknown --- Event Stage 6: Unknown --- Event Stage 7: Unknown ======================================================================================================================== Average time to get PetscTime(): 6.19888e-07 Average time for MPI_Barrier(): 6.00815e-06 Average time for zero size MPI_Send(): 5.49853e-06 #PETSc Option Table entries: -info -info_exclude null,vec,mat,pc,ksp,snes -ksp_rtol 1.e-9 -pc_type bjacobi -snes_converged_reason -snes_ls basic -snes_monitor -snes_rtol 1.e-6 -tao_max_funcs 0 #End o PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 Configure run at: Tue Mar 23 14:13:09 2010 Configure options: --with-x=0 -with-pic --with-blas-lib="[/opt/apps/intel/mkl/10.0.1.014/lib/em64t/libmkl_em64t.a,libmkl.a,libguide.a,libpthread.a]" --with-lapack-lib="[/opt/apps/intel/mkl/10.0.1.014/lib/em64t/libmkl_em64t.a,libmkl.a,libguide.a,libpthread.a]" --with-external-packages-dir=/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/externalpackages --with-mpi-compilers=1 --with-mpi-dir=/opt/apps/intel10_1/mvapich/1.0.1 --with-clanguage=C++ --with-scalar-type=real --with-dynamic=0 --with-shared=0 --with-spai=1 --download-spai=1 --with-parmetis=1 --download-parmetis=yes --with-hdf5=1 --with-hdf5-dir=/opt/apps/intel10_1/mvapich1_1_0_1/phdf5/1.8.2 --with-hypre=1 --download-hypre=1 --with-plapack=1 --download-plapack=1 --with-ml=1 --download-ml=yes --with-mumps=1 --download-mumps=/share/home/0000/build/rpms/SOURCES/MUMPS_4.9.tar.gz --with-scalapack=1 --download-scalapack=yes --with-blacs=1 --download-blacs=yes --with-spooles=1 --download-spooles=1 --with-superlu=1 --download-superlu=yes --with-superlu_dist=1 --download-superlu_dist=yes --with-parmetis=1 --download-parmetis=yes --with-debugging=no --COPTFLAGS=-xW --CXXOPTFLAGS=-xW --FOPTFLAGS=-xW ----------------------------------------- Libraries compiled on Tue Mar 23 14:13:09 CDT 2010 on build.ranger.tacc.utexas.edu Machine characteristics: Linux build.ranger.tacc.utexas.edu 2.6.18.8.TACC.lustre.perfctr #9 SMP Mon Oct 19 22:06:10 CDT 2009 x86_64 x86_64 x86_64 GNU/Linux Using PETSc directory: /opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0 Using PETSc arch: barcelona-cxx ----------------------------------------- Using C compiler: /opt/apps/intel10_1/mvapich/1.0.1/bin/mpicxx -xW -fPIC -xW Using Fortran compiler: /opt/apps/intel10_1/mvapich/1.0.1/bin/mpif90 -fPIC -xW ----------------------------------------- Using include paths: -I/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/include -I/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/include -I/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/include -I/opt/apps/intel10_1/mvapich1_1_0_1/phdf5/1.8.2/include -I/opt/apps/intel10_1/mvapich/1.0.1/include ------------------------------------------ Using C linker: /opt/apps/intel10_1/mvapich/1.0.1/bin/mpicxx -xW Using Fortran linker: /opt/apps/intel10_1/mvapich/1.0.1/bin/mpif90 -fPIC -xW Using libraries: -Wl,-rpath,/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/lib -L/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/lib -lpetscts -lpetscsnes -lpetscksp -lpetscdm -lpetscmat -lpetscvec -lpetsc -Wl,-rpath,/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/lib -L/opt/apps/intel10_1/mvapich1_1_0_1/petsc/3.0.0/barcelona-cxx/lib -lsuperlu_dist_2.3 -lcmumps -ldmumps -lsmumps -lzmumps -lmumps_common -lpord -lparmetis -lmetis -lscalapack -lblacs -lHYPRE -lspai -lspooles -lsuperlu_3.1 -lPLAPACK -lml -Wl,-rpath,/opt/apps/intel10_1/mvapich1_1_0_1/phdf5/1.8.2/lib -L/opt/apps/intel10_1/mvapich1_1_0_1/phdf5/1.8.2/lib -lhdf5 -lz -Wl,-rpath,/opt/apps/intel/mkl/10.0.1.014/lib/em64t -L/opt/apps/intel/mkl/10.0.1.014/lib/em64t -lmkl_em64t -lmkl -lguide -lpthread -lmkl_em64t -lmkl -lguide -lpthread -lPEPCF90 -Wl,-rpath,/opt/apps/intel10_1/mvapich/1.0.1/lib/shared -L/opt/apps/intel10_1/mvapich/1.0.1/lib/shared -Wl,-rpath,/opt/apps/intel10_1/mvapich/1.0.1/lib -L/opt/apps/intel10_1/mvapich/1.0.1/lib -ldl -lmpich -Wl,-rpath,/opt/ofed/lib64 -L/opt/ofed/lib64 -libverbs -libumad -lpthread -lrt -Wl,-rpath,/opt/apps/intel/10.1/cc/lib -L/opt/apps/intel/10.1/cc/lib -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/3.4.6 -L/usr/lib/gcc/x86_64-redhat-linux/3.4.6 -limf -lsvml -lipgo -lintlc -lgcc_s -lirc_s -lmpichf90nc -lmpichfarg -Wl,-rpath,/opt/apps/intel/10.1/fc/lib -L/opt/apps/intel/10.1/fc/lib -lifport -lifcore -lm -lm -lpmpich++ -lstdc++ -lpmpich++ -lstdc++ -ldl -lmpich -libverbs -libumad -lpthread -lrt -limf -lsvml -lipgo -lintlc -lgcc_s -lirc_s -ldl ------------------------------------------