|
|
Regions | |||||
---|---|---|---|---|---|
Label | Ntasks | <MPI sec> | <Wall sec> | %Wall | [gflop/sec] |
ipm_noregion | 27 | 0.1581 | 0.5783 | 63.20 | 0.0000e+00 |
hypre_BoomerAMGCycle | 27 | 0.0343 | 0.2439 | 26.65 | 0.0000e+00 |
|
|
HPM Counter Statistics | ||||
---|---|---|---|---|
Event | Ntasks | Avg | Min(rank) | Max(rank) |
Communication Event Statistics (0.00% detail, 9.2580e-01 error) | |||||||
---|---|---|---|---|---|---|---|
Buffer Size | Ncalls | Total Time | Min Time | Max Time | %MPI | %Wall | |
MPI_Allreduce | 8 | 2592 | 1.371 | 6.914e-06 | 5.573e-02 | 148.11 | 19.94 |
MPI_Waitall | 32768 | 468 | 0.601 | 6.914e-06 | 1.254e-02 | 64.95 | 8.75 |
MPI_Waitall | 49152 | 612 | 0.489 | 2.003e-05 | 9.890e-03 | 52.82 | 7.11 |
MPI_Testall | 0 | 1531540 | 0.432 | 0.000e+00 | 1.216e-03 | 46.64 | 6.28 |
MPI_Iprobe | 0 | 1777372 | 0.331 | 0.000e+00 | 1.321e-03 | 35.74 | 4.81 |
MPI_Waitall | 57344 | 320 | 0.220 | 2.718e-05 | 1.026e-02 | 23.79 | 3.20 |
MPI_Recv | 8 | 7181 | 0.082 | 0.000e+00 | 4.439e-03 | 8.82 | 1.19 |
MPI_Waitall | 16384 | 407 | 0.062 | 3.099e-06 | 3.881e-03 | 6.71 | 0.90 |
MPI_Allreduce | 4 | 1539 | 0.070 | 4.053e-06 | 1.131e-03 | 7.59 | 1.02 |
MPI_Waitall | 24576 | 564 | 0.045 | 2.146e-06 | 3.300e-03 | 4.89 | 0.66 |
MPI_Waitall | 10240 | 93 | 0.004 | 3.099e-06 | 5.751e-04 | 0.43 | 0.06 |
MPI_Test | 0 | 234091 | 0.039 | 0.000e+00 | 1.502e-05 | 4.23 | 0.57 |
MPI_Waitall | 5120 | 302 | 0.014 | 1.907e-06 | 6.552e-04 | 1.51 | 0.20 |
MPI_Waitall | 7168 | 245 | 0.011 | 2.861e-06 | 5.131e-04 | 1.16 | 0.16 |
MPI_Waitall | 2048 | 86 | 0.018 | 2.861e-06 | 2.513e-03 | 1.99 | 0.27 |
MPI_Isend | 12288 | 5285 | 0.018 | 0.000e+00 | 4.270e-04 | 1.97 | 0.27 |
MPI_Waitall | 3072 | 43 | 0.022 | 5.007e-06 | 2.528e-03 | 2.38 | 0.32 |
MPI_Waitall | 12288 | 39 | 0.019 | 4.053e-06 | 5.168e-03 | 2.05 | 0.28 |
MPI_Waitall | 14336 | 32 | 0.002 | 5.007e-06 | 4.911e-04 | 0.18 | 0.02 |
MPI_Isend | 8 | 11788 | 0.013 | 0.000e+00 | 8.071e-04 | 1.43 | 0.19 |
MPI_Waitall | 3584 | 334 | 0.019 | 3.099e-06 | 1.049e-03 | 2.09 | 0.28 |
MPI_Waitall | 2560 | 247 | 0.013 | 1.907e-06 | 1.028e-03 | 1.45 | 0.20 |
MPI_Waitall | 28672 | 302 | 0.020 | 3.099e-06 | 1.620e-03 | 2.13 | 0.29 |
MPI_Waitall | 640 | 130 | 0.010 | 5.007e-06 | 1.348e-03 | 1.07 | 0.14 |
MPI_Waitall | 65536 | 108 | 0.014 | 3.004e-05 | 6.661e-04 | 1.53 | 0.21 |
MPI_Waitall | 262144 | 29 | 0.016 | 2.010e-04 | 1.202e-03 | 1.69 | 0.23 |
MPI_Waitall | 1792 | 150 | 0.003 | 3.815e-06 | 1.650e-04 | 0.32 | 0.04 |
MPI_Waitall | 40960 | 41 | 0.015 | 1.288e-05 | 1.070e-03 | 1.66 | 0.22 |
MPI_Waitall | 8192 | 216 | 0.008 | 5.007e-06 | 4.120e-04 | 0.92 | 0.12 |
MPI_Waitall | 896 | 97 | 0.012 | 1.907e-06 | 1.229e-03 | 1.26 | 0.17 |
MPI_Waitall | 393216 | 19 | 0.014 | 1.750e-04 | 1.512e-03 | 1.51 | 0.20 |
MPI_Waitall | 1024 | 164 | 0.012 | 1.907e-06 | 1.313e-03 | 1.26 | 0.17 |
MPI_Waitall | 1280 | 197 | 0.004 | 2.861e-06 | 2.220e-04 | 0.45 | 0.06 |
MPI_Isend | 6144 | 5234 | 0.013 | 0.000e+00 | 7.105e-05 | 1.44 | 0.19 |
MPI_Scan | 4 | 189 | 0.012 | 1.288e-05 | 1.224e-03 | 1.34 | 0.18 |
MPI_Irecv | 12288 | 5285 | 0.007 | 0.000e+00 | 3.128e-04 | 0.80 | 0.11 |
MPI_Waitall | 131072 | 35 | 0.012 | 5.698e-05 | 1.017e-03 | 1.29 | 0.17 |
MPI_Waitall | 256 | 34 | 0.005 | 3.099e-06 | 9.432e-04 | 0.59 | 0.08 |
MPI_Waitall | 6144 | 121 | 0.006 | 3.815e-06 | 4.640e-04 | 0.65 | 0.09 |
MPI_Send | 28 | 6334 | 0.010 | 0.000e+00 | 8.440e-04 | 1.11 | 0.15 |
MPI_Isend | 16 | 2270 | 0.001 | 0.000e+00 | 1.502e-05 | 0.13 | 0.02 |
MPI_Waitall | 4096 | 176 | 0.007 | 5.007e-06 | 9.520e-04 | 0.72 | 0.10 |
MPI_Waitall | 196608 | 24 | 0.009 | 1.540e-04 | 8.590e-04 | 1.02 | 0.14 |
MPI_Waitall | 320 | 129 | 0.005 | 2.146e-06 | 1.033e-03 | 0.59 | 0.08 |
MPI_Waitall | 224 | 20 | 0.006 | 1.907e-06 | 1.209e-03 | 0.59 | 0.08 |
MPI_Waitall | 20480 | 40 | 0.008 | 6.914e-06 | 2.522e-03 | 0.89 | 0.12 |
MPI_Isend | 1536 | 2698 | 0.002 | 0.000e+00 | 1.907e-05 | 0.23 | 0.03 |
MPI_Waitall | 768 | 98 | 0.004 | 5.960e-06 | 1.243e-03 | 0.40 | 0.05 |
MPI_Waitall | 81920 | 40 | 0.007 | 2.289e-05 | 4.530e-04 | 0.80 | 0.11 |
MPI_Waitall | 512 | 91 | 0.002 | 4.053e-06 | 1.190e-04 | 0.17 | 0.02 |
MPI_Send | 4 | 5367 | 0.007 | 0.000e+00 | 3.281e-04 | 0.74 | 0.10 |
MPI_Waitall | 64 | 37 | 0.001 | 9.537e-07 | 2.890e-04 | 0.13 | 0.02 |
MPI_Waitall | 98304 | 34 | 0.007 | 5.388e-05 | 3.271e-04 | 0.73 | 0.10 |
MPI_Waitall | 229376 | 16 | 0.006 | 2.670e-04 | 5.360e-04 | 0.69 | 0.09 |
MPI_Waitall | 192 | 24 | 0.004 | 1.907e-06 | 1.145e-03 | 0.46 | 0.06 |
MPI_Waitall | 1536 | 142 | 0.004 | 3.099e-06 | 2.229e-04 | 0.45 | 0.06 |
MPI_Waitall | 448 | 75 | 0.001 | 1.907e-06 | 1.428e-04 | 0.12 | 0.02 |
MPI_Waitall | 160 | 98 | 0.002 | 5.960e-06 | 7.610e-04 | 0.23 | 0.03 |
MPI_Waitall | 128 | 111 | 0.002 | 1.192e-06 | 5.250e-04 | 0.24 | 0.03 |
MPI_Waitall | 384 | 95 | 0.002 | 3.099e-06 | 7.410e-04 | 0.25 | 0.03 |
MPI_Waitall | 114688 | 18 | 0.005 | 7.296e-05 | 5.212e-04 | 0.56 | 0.08 |
MPI_Isend | 32 | 1932 | 0.001 | 0.000e+00 | 1.502e-05 | 0.10 | 0.01 |
MPI_Isend | 128 | 2148 | 0.001 | 0.000e+00 | 1.478e-05 | 0.13 | 0.02 |
MPI_Isend | 96 | 2437 | 0.001 | 0.000e+00 | 2.194e-05 | 0.14 | 0.02 |
MPI_Waitall | 524288 | 6 | 0.004 | 4.091e-04 | 9.060e-04 | 0.45 | 0.06 |
MPI_Isend | 112 | 1876 | 0.001 | 0.000e+00 | 8.106e-06 | 0.10 | 0.01 |
MPI_Waitall | 80 | 28 | 0.002 | 9.060e-06 | 3.669e-04 | 0.20 | 0.03 |
MPI_Isend | 64 | 3702 | 0.002 | 0.000e+00 | 1.383e-05 | 0.21 | 0.03 |
MPI_Waitall | 112 | 61 | 0.001 | 9.537e-07 | 3.071e-04 | 0.15 | 0.02 |
MPI_Isend | 48 | 3741 | 0.002 | 0.000e+00 | 4.911e-05 | 0.21 | 0.03 |
MPI_Isend | 4 | 5913 | 0.003 | 0.000e+00 | 2.694e-05 | 0.32 | 0.04 |
MPI_Comm_size | 0 | 24915 | 0.002 | 0.000e+00 | 3.099e-06 | 0.18 | 0.02 |
MPI_Isend | 56 | 3891 | 0.002 | 0.000e+00 | 1.502e-05 | 0.21 | 0.03 |
MPI_Isend | 80 | 1675 | 0.001 | 0.000e+00 | 7.868e-06 | 0.10 | 0.01 |
MPI_Isend | 256 | 2836 | 0.001 | 0.000e+00 | 1.502e-05 | 0.15 | 0.02 |
MPI_Irecv | 28 | 7299 | 0.003 | 0.000e+00 | 4.501e-04 | 0.30 | 0.04 |
MPI_Irecv | 81920 | 111 | 0.002 | 0.000e+00 | 3.059e-04 | 0.27 | 0.04 |
MPI_Comm_rank | 0 | 23617 | 0.001 | 0.000e+00 | 1.001e-05 | 0.14 | 0.02 |
MPI_Isend | 40 | 2517 | 0.001 | 0.000e+00 | 8.106e-06 | 0.13 | 0.02 |
MPI_Isend | 224 | 1640 | 0.001 | 0.000e+00 | 5.960e-06 | 0.08 | 0.01 |
MPI_Irecv | 0 | 2592 | 0.002 | 0.000e+00 | 8.042e-04 | 0.20 | 0.03 |
MPI_Waitall | 163840 | 7 | 0.002 | 1.521e-04 | 3.929e-04 | 0.20 | 0.03 |
MPI_Isend | 768 | 2946 | 0.002 | 0.000e+00 | 2.503e-05 | 0.20 | 0.03 |
MPI_Irecv | 65536 | 105 | 0.002 | 0.000e+00 | 1.040e-04 | 0.20 | 0.03 |
MPI_Bcast | 4 | 351 | 0.002 | 0.000e+00 | 2.098e-05 | 0.18 | 0.02 |
MPI_Irecv | 6144 | 4913 | 0.002 | 0.000e+00 | 1.097e-05 | 0.17 | 0.02 |
MPI_Irecv | 16384 | 444 | 0.001 | 0.000e+00 | 3.409e-05 | 0.15 | 0.02 |
MPI_Waitall | 327680 | 4 | 0.001 | 2.420e-04 | 4.141e-04 | 0.14 | 0.02 |
MPI_Isend | 7168 | 289 | 0.001 | 0.000e+00 | 2.503e-05 | 0.14 | 0.02 |
MPI_Isend | 12 | 2792 | 0.001 | 0.000e+00 | 1.097e-05 | 0.14 | 0.02 |
MPI_Irecv | 4 | 10507 | 0.001 | 0.000e+00 | 4.053e-06 | 0.13 | 0.02 |
MPI_Irecv | 40960 | 126 | 0.001 | 0.000e+00 | 3.040e-04 | 0.13 | 0.02 |
MPI_Waitall | 40 | 19 | 0.001 | 1.907e-06 | 2.370e-04 | 0.09 | 0.01 |
MPI_Waitall | 655360 | 1 | 0.001 | 1.055e-03 | 1.055e-03 | 0.11 | 0.02 |
MPI_Irecv | 14336 | 256 | 0.001 | 0.000e+00 | 3.982e-05 | 0.11 | 0.01 |
MPI_Isend | 16384 | 444 | 0.001 | 0.000e+00 | 2.499e-04 | 0.11 | 0.01 |
MPI_Isend | 5120 | 453 | 0.001 | 0.000e+00 | 1.597e-05 | 0.11 | 0.01 |
MPI_Isend | 14336 | 256 | 0.001 | 0.000e+00 | 2.809e-04 | 0.10 | 0.01 |
MPI_Isend | 40960 | 126 | 0.001 | 0.000e+00 | 2.420e-04 | 0.09 | 0.01 |
MPI_Waitall | 0 | 1432 | 0.001 | 0.000e+00 | 4.053e-06 | 0.08 | 0.01 |
MPI_Irecv | 32768 | 197 | 0.001 | 0.000e+00 | 6.008e-05 | 0.09 | 0.01 |
MPI_Isend | 0 | 1552 | 0.001 | 0.000e+00 | 2.861e-06 | 0.08 | 0.01 |
MPI_Isend | 4096 | 313 | 0.001 | 0.000e+00 | 1.621e-05 | 0.08 | 0.01 |
MPI_Irecv | 24576 | 106 | 0.001 | 0.000e+00 | 4.482e-05 | 0.08 | 0.01 |
Load balance by task: HPM counters |
---|
by MPI rank, by MPI time |
Load balance by task: memory, flops, timings |
by MPI rank, by MPI time |
Communication balance by task (sorted by MPI time) |
by MPI rank , time detail by MPI time , time detail by rank , call list |
Message Buffer Size Distributions: time |
|
Message Buffer Size Distributions: Ncalls |
|
Communication Topology : point to point data flow |
|
Switch Traffic (volume by node) |
|
Memory usage by node |
|