|
|
|
|
HPM Counter Statistics | ||||
---|---|---|---|---|
Event | Ntasks | Avg | Min(rank) | Max(rank) |
PAPI_FP_OPS | * | 248620817.65 | 224206539 (495) | 250691642 (999) |
PAPI_L1_DCA | * | 414604045.28 | 356684934 (0) | 572932558 (998) |
PAPI_L1_DCM | * | 1928781.52 | 1799101 (591) | 3243505 (999) |
PAPI_TOT_INS | * | 997444527.49 | 860642466 (0) | 1581176562 (998) |
Communication Event Statistics (100.00% detail, -7.1667e-06 error) | |||||||
---|---|---|---|---|---|---|---|
Buffer Size | Ncalls | Total Time | Min Time | Max Time | %MPI | %Wall | |
MPI_Send | 32768 | 72000 | 21.096 | 1.502e-05 | 5.714e-02 | 40.07 | 1.49 |
MPI_Waitsome | 0 | 246512 | 12.641 | 0.000e+00 | 1.860e-02 | 24.01 | 0.89 |
MPI_Allreduce | 16 | 3000 | 5.946 | 2.098e-05 | 1.563e-02 | 11.29 | 0.42 |
MPI_Send | 8192 | 54000 | 5.495 | 5.007e-06 | 5.254e-03 | 10.44 | 0.39 |
MPI_Allreduce | 8 | 9000 | 1.336 | 2.313e-05 | 1.672e-03 | 2.54 | 0.09 |
MPI_Send | 8 | 112736 | 1.292 | 0.000e+00 | 6.600e-03 | 2.45 | 0.09 |
MPI_Send | 512 | 81360 | 1.070 | 0.000e+00 | 7.114e-03 | 2.03 | 0.08 |
MPI_Allgather | 8 | 1000 | 0.951 | 8.140e-04 | 1.390e-03 | 1.81 | 0.07 |
MPI_Barrier | 0 | 1000 | 0.707 | 8.111e-04 | 1.053e-03 | 1.34 | 0.05 |
MPI_Reduce | 8 | 2000 | 0.585 | 1.597e-05 | 2.073e-02 | 1.11 | 0.04 |
MPI_Irecv | 8 | 112736 | 0.295 | 0.000e+00 | 4.478e-03 | 0.56 | 0.02 |
MPI_Send | 2048 | 54000 | 0.247 | 0.000e+00 | 3.521e-04 | 0.47 | 0.02 |
MPI_Send | 128 | 74520 | 0.197 | 0.000e+00 | 1.969e-04 | 0.37 | 0.01 |
Load balance by task: HPM counters |
---|
by MPI rank, by MPI time |
Load balance by task: memory, flops, timings |
by MPI rank, by MPI time |
Communication balance by task (sorted by MPI time) |
by MPI rank , time detail by MPI time , time detail by rank , call list |
Message Buffer Size Distributions: time |
|
Message Buffer Size Distributions: Ncalls |
|
Communication Topology : point to point data flow |
|
Switch Traffic (volume by node) |
|
Memory usage by node |
|