|
|
Regions | |||||
---|---|---|---|---|---|
Label | Ntasks | <MPI sec> | <Wall sec> | %Wall | [gflop/sec] |
ipm_noregion | 8 | 0.0118 | 0.2282 | 54.91 | 0.0000e+00 |
hypre_BoomerAMGCycle | 8 | 0.0042 | 0.1205 | 29.00 | 0.0000e+00 |
|
|
HPM Counter Statistics | ||||
---|---|---|---|---|
Event | Ntasks | Avg | Min(rank) | Max(rank) |
Communication Event Statistics (0.00% detail, 3.3677e-02 error) | |||||||
---|---|---|---|---|---|---|---|
Buffer Size | Ncalls | Total Time | Min Time | Max Time | %MPI | %Wall | |
MPI_Waitall | 32768 | 368 | 0.016 | 8.822e-06 | 4.549e-04 | 46.84 | 1.63 |
MPI_Allreduce | 8 | 712 | 0.011 | 2.861e-06 | 2.859e-04 | 32.11 | 1.12 |
MPI_Iprobe | 0 | 64603 | 0.008 | 0.000e+00 | 4.580e-04 | 22.44 | 0.78 |
MPI_Testall | 0 | 46494 | 0.007 | 0.000e+00 | 2.098e-05 | 20.78 | 0.72 |
MPI_Waitall | 5120 | 208 | 0.002 | 1.192e-06 | 2.229e-04 | 6.66 | 0.23 |
MPI_Recv | 8 | 1000 | 0.004 | 0.000e+00 | 5.090e-04 | 13.08 | 0.46 |
MPI_Waitall | 10240 | 25 | 0.001 | 2.861e-06 | 8.321e-05 | 1.72 | 0.06 |
MPI_Allreduce | 4 | 408 | 0.004 | 2.146e-06 | 7.606e-05 | 11.55 | 0.40 |
MPI_Waitall | 16384 | 353 | 0.003 | 2.861e-06 | 1.650e-04 | 10.17 | 0.35 |
MPI_Isend | 8 | 1176 | 0.002 | 0.000e+00 | 2.871e-04 | 5.96 | 0.21 |
MPI_Waitall | 131072 | 13 | 0.003 | 1.159e-04 | 3.169e-04 | 8.08 | 0.28 |
MPI_Waitall | 2048 | 32 | 0.001 | 1.907e-06 | 2.160e-04 | 3.29 | 0.11 |
MPI_Waitall | 262144 | 8 | 0.002 | 1.569e-04 | 4.199e-04 | 7.39 | 0.26 |
MPI_Isend | 12288 | 1111 | 0.001 | 0.000e+00 | 1.407e-05 | 2.97 | 0.10 |
MPI_Waitall | 640 | 18 | 0.001 | 9.537e-07 | 2.620e-04 | 2.93 | 0.10 |
MPI_Isend | 1536 | 582 | 0.000 | 0.000e+00 | 6.914e-06 | 1.06 | 0.04 |
MPI_Isend | 6144 | 1114 | 0.002 | 0.000e+00 | 2.098e-05 | 5.03 | 0.18 |
MPI_Waitall | 229376 | 8 | 0.002 | 1.600e-04 | 2.580e-04 | 5.02 | 0.17 |
MPI_Waitall | 40960 | 21 | 0.002 | 2.003e-05 | 1.891e-04 | 4.58 | 0.16 |
MPI_Test | 0 | 16419 | 0.001 | 0.000e+00 | 1.192e-06 | 3.84 | 0.13 |
MPI_Waitall | 81920 | 11 | 0.001 | 7.701e-05 | 1.900e-04 | 3.78 | 0.13 |
MPI_Waitall | 49152 | 16 | 0.001 | 3.791e-05 | 1.721e-04 | 3.77 | 0.13 |
MPI_Waitall | 2560 | 179 | 0.001 | 1.907e-06 | 1.161e-04 | 3.42 | 0.12 |
MPI_Waitall | 114688 | 8 | 0.001 | 1.211e-04 | 1.531e-04 | 3.34 | 0.12 |
MPI_Irecv | 12288 | 1111 | 0.001 | 0.000e+00 | 2.289e-05 | 1.78 | 0.06 |
MPI_Waitall | 57344 | 10 | 0.001 | 4.888e-05 | 1.130e-04 | 2.29 | 0.08 |
MPI_Waitall | 768 | 5 | 0.000 | 3.099e-06 | 2.551e-04 | 0.80 | 0.03 |
MPI_Scan | 4 | 56 | 0.001 | 5.007e-06 | 6.104e-05 | 1.98 | 0.07 |
MPI_Isend | 112 | 361 | 0.000 | 0.000e+00 | 1.907e-06 | 0.43 | 0.02 |
MPI_Waitall | 1024 | 144 | 0.001 | 9.537e-07 | 3.290e-05 | 1.71 | 0.06 |
MPI_Irecv | 0 | 476 | 0.001 | 0.000e+00 | 5.240e-04 | 1.78 | 0.06 |
MPI_Waitall | 65536 | 6 | 0.001 | 8.011e-05 | 1.030e-04 | 1.62 | 0.06 |
MPI_Waitall | 320 | 102 | 0.000 | 9.537e-07 | 1.502e-05 | 0.72 | 0.02 |
MPI_Comm_size | 0 | 4900 | 0.000 | 0.000e+00 | 1.192e-06 | 0.84 | 0.03 |
MPI_Send | 28 | 865 | 0.001 | 0.000e+00 | 1.001e-05 | 1.55 | 0.05 |
MPI_Waitall | 192 | 10 | 0.000 | 9.537e-07 | 1.631e-04 | 1.19 | 0.04 |
MPI_Irecv | 28 | 889 | 0.000 | 0.000e+00 | 3.078e-04 | 1.44 | 0.05 |
MPI_Waitall | 163840 | 3 | 0.000 | 1.059e-04 | 2.201e-04 | 1.39 | 0.05 |
MPI_Waitall | 384 | 57 | 0.000 | 9.537e-07 | 1.192e-05 | 0.39 | 0.01 |
MPI_Irecv | 81920 | 26 | 0.000 | 0.000e+00 | 1.102e-04 | 1.30 | 0.05 |
MPI_Isend | 96 | 298 | 0.000 | 0.000e+00 | 2.146e-06 | 0.41 | 0.01 |
MPI_Waitall | 96 | 1 | 0.000 | 2.139e-04 | 2.139e-04 | 0.64 | 0.02 |
MPI_Comm_rank | 0 | 4784 | 0.000 | 0.000e+00 | 1.192e-06 | 0.66 | 0.02 |
MPI_Send | 4 | 791 | 0.000 | 0.000e+00 | 4.053e-06 | 1.16 | 0.04 |
MPI_Isend | 64 | 667 | 0.000 | 0.000e+00 | 3.099e-06 | 0.74 | 0.03 |
MPI_Isend | 256 | 622 | 0.000 | 0.000e+00 | 2.146e-06 | 0.72 | 0.02 |
MPI_Waitall | 20480 | 28 | 0.000 | 3.815e-06 | 5.102e-05 | 0.98 | 0.03 |
MPI_Isend | 56 | 819 | 0.000 | 0.000e+00 | 1.192e-06 | 0.89 | 0.03 |
MPI_Waitall | 64 | 11 | 0.000 | 9.537e-07 | 1.280e-04 | 0.47 | 0.02 |
MPI_Waitall | 12288 | 8 | 0.000 | 4.053e-06 | 1.409e-04 | 0.91 | 0.03 |
MPI_Isend | 80 | 397 | 0.000 | 0.000e+00 | 3.099e-06 | 0.41 | 0.01 |
MPI_Isend | 768 | 613 | 0.000 | 0.000e+00 | 5.960e-06 | 0.86 | 0.03 |
MPI_Waitall | 6144 | 12 | 0.000 | 1.907e-06 | 1.562e-04 | 0.83 | 0.03 |
MPI_Bcast | 4 | 96 | 0.000 | 9.537e-07 | 5.960e-06 | 0.77 | 0.03 |
MPI_Isend | 48 | 473 | 0.000 | 0.000e+00 | 2.146e-06 | 0.54 | 0.02 |
MPI_Waitall | 160 | 9 | 0.000 | 9.537e-07 | 1.459e-04 | 0.69 | 0.02 |
MPI_Waitall | 48 | 2 | 0.000 | 9.513e-05 | 1.309e-04 | 0.67 | 0.02 |
MPI_Irecv | 32768 | 45 | 0.000 | 0.000e+00 | 6.080e-05 | 0.60 | 0.02 |
MPI_Irecv | 65536 | 22 | 0.000 | 0.000e+00 | 5.484e-05 | 0.60 | 0.02 |
MPI_Isend | 5120 | 84 | 0.000 | 0.000e+00 | 2.503e-05 | 0.55 | 0.02 |
MPI_Irecv | 6144 | 1042 | 0.000 | 0.000e+00 | 1.192e-06 | 0.54 | 0.02 |
MPI_Irecv | 24576 | 24 | 0.000 | 0.000e+00 | 2.909e-05 | 0.50 | 0.02 |
MPI_Isend | 7168 | 46 | 0.000 | 9.537e-07 | 1.121e-05 | 0.48 | 0.02 |
MPI_Waitall | 896 | 18 | 0.000 | 1.907e-06 | 5.293e-05 | 0.42 | 0.01 |
MPI_Isend | 0 | 238 | 0.000 | 0.000e+00 | 1.597e-05 | 0.40 | 0.01 |
MPI_Waitall | 0 | 376 | 0.000 | 0.000e+00 | 3.099e-06 | 0.39 | 0.01 |
MPI_Waitall | 512 | 3 | 0.000 | 9.537e-07 | 1.061e-04 | 0.34 | 0.01 |
MPI_Irecv | 14336 | 53 | 0.000 | 0.000e+00 | 2.790e-05 | 0.34 | 0.01 |
MPI_Isend | 4 | 222 | 0.000 | 0.000e+00 | 9.060e-06 | 0.30 | 0.01 |
MPI_Isend | 4096 | 54 | 0.000 | 0.000e+00 | 8.822e-06 | 0.29 | 0.01 |
Load balance by task: HPM counters |
---|
by MPI rank, by MPI time |
Load balance by task: memory, flops, timings |
by MPI rank, by MPI time |
Communication balance by task (sorted by MPI time) |
by MPI rank , time detail by MPI time , time detail by rank , call list |
Message Buffer Size Distributions: time |
|
Message Buffer Size Distributions: Ncalls |
|
Communication Topology : point to point data flow |
|
Switch Traffic (volume by node) |
|
Memory usage by node |
|