Created
December 19, 2022 09:00
-
-
Save keichi/bc5f036a105b2aef92df4989632dde22 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
[keichi@muffin2 ~]$ singularity run --nv --env LD_LIBRARY_PATH=/usr/local/cuda/lib64 -B $(pwd):/data hpc-benchmarks_21.4-hpcg.sif mpirun -n 1 --bind-to socket hpcg.sh --cpu-affinity 0-7 --cpu-cores-per-rank 8 --gpu-affinity 0 --dat /data/hpcg_a100_40gb.dat | |
WARNING: group: unknown groupid 200000 | |
NOTE: MOFED driver for multi-node communication was not detected. | |
Multi-node communication performance may be reduced. | |
INFO: host=muffin2 rank=0 lrank=0 cores=8 gpu=0 cpu=0-7 mem= ucx= bin=/workspace/hpcg-linux-x86_64/xhpcg | |
HPCG-CUDA-Benchmark v1.0.0 | |
HPCG-Benchmark v3.1 | |
start of application (8 OMP threads)... | |
2022-12-19 17:22:13.776 | |
Problem setup... | |
Setup time: 0.239429 sec | |
GPU: 'NVIDIA A100 80GB PCIe' | |
Memory use: 8699 MB / 81251 MB | |
1x1x1 process grid | |
256x256x256 local domain | |
Reference SpMV+MG... | |
Reference CG... | |
Initial Residual: 5.660695e+03 Max_err: 1.000000e+00 tot_err: 4.096000e+03 | |
REF Iter = 1 Scaled Residual: 1.857836e-01 Max error: 1.000000e+00 tot_error: 9.698619e-01 | |
REF Iter = 2 Scaled Residual: 1.018949e-01 Max error: 1.000000e+00 tot_error: 9.419039e-01 | |
REF Iter = 3 Scaled Residual: 7.024168e-02 Max error: 1.000000e+00 tot_error: 9.145661e-01 | |
REF Iter = 4 Scaled Residual: 5.357793e-02 Max error: 1.000000e+00 tot_error: 8.875508e-01 | |
REF Iter = 5 Scaled Residual: 4.324831e-02 Max error: 1.000000e+00 tot_error: 8.607934e-01 | |
REF Iter = 6 Scaled Residual: 3.620988e-02 Max error: 1.000000e+00 tot_error: 8.342848e-01 | |
REF Iter = 7 Scaled Residual: 3.111134e-02 Max error: 1.000000e+00 tot_error: 8.080245e-01 | |
REF Iter = 8 Scaled Residual: 2.726007e-02 Max error: 1.000000e+00 tot_error: 7.820212e-01 | |
REF Iter = 9 Scaled Residual: 2.425797e-02 Max error: 1.000000e+00 tot_error: 7.562831e-01 | |
REF Iter = 10 Scaled Residual: 2.185265e-02 Max error: 1.000000e+00 tot_error: 7.308183e-01 | |
REF Iter = 11 Scaled Residual: 1.987491e-02 Max error: 1.000000e+00 tot_error: 7.056301e-01 | |
REF Iter = 12 Scaled Residual: 1.820861e-02 Max error: 9.999999e-01 tot_error: 6.807200e-01 | |
REF Iter = 13 Scaled Residual: 1.677371e-02 Max error: 9.999999e-01 tot_error: 6.560872e-01 | |
REF Iter = 14 Scaled Residual: 1.551939e-02 Max error: 9.999998e-01 tot_error: 6.317310e-01 | |
REF Iter = 15 Scaled Residual: 1.441463e-02 Max error: 9.999998e-01 tot_error: 6.076542e-01 | |
REF Iter = 16 Scaled Residual: 1.343903e-02 Max error: 9.999996e-01 tot_error: 5.838617e-01 | |
REF Iter = 17 Scaled Residual: 1.257818e-02 Max error: 9.999994e-01 tot_error: 5.603609e-01 | |
REF Iter = 18 Scaled Residual: 1.181835e-02 Max error: 9.999990e-01 tot_error: 5.371598e-01 | |
REF Iter = 19 Scaled Residual: 1.114377e-02 Max error: 9.999984e-01 tot_error: 5.142648e-01 | |
REF Iter = 20 Scaled Residual: 1.053831e-02 Max error: 9.999974e-01 tot_error: 4.916784e-01 | |
REF Iter = 21 Scaled Residual: 9.986375e-03 Max error: 9.999956e-01 tot_error: 4.694006e-01 | |
REF Iter = 22 Scaled Residual: 9.474857e-03 Max error: 9.999926e-01 tot_error: 4.474276e-01 | |
REF Iter = 23 Scaled Residual: 8.996370e-03 Max error: 9.999872e-01 tot_error: 4.257548e-01 | |
REF Iter = 24 Scaled Residual: 8.548468e-03 Max error: 9.999773e-01 tot_error: 4.043789e-01 | |
REF Iter = 25 Scaled Residual: 8.131537e-03 Max error: 9.999588e-01 tot_error: 3.832983e-01 | |
REF Iter = 26 Scaled Residual: 7.747382e-03 Max error: 9.999236e-01 tot_error: 3.625121e-01 | |
REF Iter = 27 Scaled Residual: 7.396224e-03 Max error: 9.998548e-01 tot_error: 3.420207e-01 | |
REF Iter = 28 Scaled Residual: 7.075252e-03 Max error: 9.997179e-01 tot_error: 3.218202e-01 | |
REF Iter = 29 Scaled Residual: 6.779564e-03 Max error: 9.994407e-01 tot_error: 3.019029e-01 | |
REF Iter = 30 Scaled Residual: 6.502792e-03 Max error: 9.988717e-01 tot_error: 2.822529e-01 | |
REF Iter = 31 Scaled Residual: 6.239884e-03 Max error: 9.976942e-01 tot_error: 2.628437e-01 | |
REF Iter = 32 Scaled Residual: 5.992256e-03 Max error: 9.952573e-01 tot_error: 2.436245e-01 | |
REF Iter = 33 Scaled Residual: 5.775328e-03 Max error: 9.902587e-01 tot_error: 2.244817e-01 | |
REF Iter = 34 Scaled Residual: 5.636700e-03 Max error: 9.801624e-01 tot_error: 2.051084e-01 | |
REF Iter = 35 Scaled Residual: 5.687363e-03 Max error: 9.600071e-01 tot_error: 1.846774e-01 | |
REF Iter = 36 Scaled Residual: 6.107710e-03 Max error: 9.195170e-01 tot_error: 1.611846e-01 | |
REF Iter = 37 Scaled Residual: 6.942002e-03 Max error: 8.377234e-01 tot_error: 1.310250e-01 | |
REF Iter = 38 Scaled Residual: 7.315811e-03 Max error: 6.925335e-01 tot_error: 9.347398e-02 | |
REF Iter = 39 Scaled Residual: 5.704675e-03 Max error: 5.288183e-01 tot_error: 6.268556e-02 | |
REF Iter = 40 Scaled Residual: 3.478732e-03 Max error: 4.273827e-01 tot_error: 4.913752e-02 | |
REF Iter = 41 Scaled Residual: 2.707532e-03 Max error: 3.692166e-01 tot_error: 4.305959e-02 | |
REF Iter = 42 Scaled Residual: 3.039918e-03 Max error: 3.114679e-01 tot_error: 3.655679e-02 | |
REF Iter = 43 Scaled Residual: 2.527545e-03 Max error: 2.541628e-01 tot_error: 2.960905e-02 | |
REF Iter = 44 Scaled Residual: 2.063980e-03 Max error: 2.158009e-01 tot_error: 2.507440e-02 | |
REF Iter = 45 Scaled Residual: 1.928522e-03 Max error: 1.786816e-01 tot_error: 2.087140e-02 | |
REF Iter = 46 Scaled Residual: 1.621703e-03 Max error: 1.431311e-01 tot_error: 1.742793e-02 | |
REF Iter = 47 Scaled Residual: 1.507093e-03 Max error: 1.091921e-01 tot_error: 1.421963e-02 | |
REF Iter = 48 Scaled Residual: 1.166336e-03 Max error: 8.411006e-02 tot_error: 1.185468e-02 | |
REF Iter = 49 Scaled Residual: 1.151750e-03 Max error: 6.081485e-02 tot_error: 9.639058e-03 | |
REF Iter = 50 Scaled Residual: 8.219993e-04 Max error: 4.124194e-02 tot_error: 8.015353e-03 | |
Optimization... | |
Optimization time: 8.779421e-02 sec | |
Validation... | |
Optimized CG Setup... | |
Initial Residual: 5.660695e+03 Max_err: 1.000000e+00 tot_err: 4.096000e+03 | |
Iteration = 1 Scaled Residual: 2.202347e-01 Max error: 1.000000e+00 tot_error: 9.696712e-01 | |
Iteration = 2 Scaled Residual: 1.191485e-01 Max error: 1.000000e+00 tot_error: 9.413010e-01 | |
Iteration = 3 Scaled Residual: 8.097051e-02 Max error: 1.000000e+00 tot_error: 9.136654e-01 | |
Iteration = 4 Scaled Residual: 6.124309e-02 Max error: 1.000000e+00 tot_error: 8.864398e-01 | |
Iteration = 5 Scaled Residual: 4.925623e-02 Max error: 1.000000e+00 tot_error: 8.594742e-01 | |
Iteration = 6 Scaled Residual: 4.116105e-02 Max error: 1.000000e+00 tot_error: 8.327582e-01 | |
Iteration = 7 Scaled Residual: 3.533424e-02 Max error: 1.000000e+00 tot_error: 8.062738e-01 | |
Iteration = 8 Scaled Residual: 3.093881e-02 Max error: 1.000000e+00 tot_error: 7.800319e-01 | |
Iteration = 9 Scaled Residual: 2.749230e-02 Max error: 1.000000e+00 tot_error: 7.540381e-01 | |
Iteration = 10 Scaled Residual: 2.471932e-02 Max error: 1.000000e+00 tot_error: 7.283065e-01 | |
Iteration = 11 Scaled Residual: 2.243848e-02 Max error: 1.000000e+00 tot_error: 7.028385e-01 | |
Iteration = 12 Scaled Residual: 2.052399e-02 Max error: 1.000000e+00 tot_error: 6.776463e-01 | |
Iteration = 13 Scaled Residual: 1.889633e-02 Max error: 1.000000e+00 tot_error: 6.527332e-01 | |
Iteration = 14 Scaled Residual: 1.749318e-02 Max error: 9.999999e-01 tot_error: 6.281060e-01 | |
Iteration = 15 Scaled Residual: 1.626950e-02 Max error: 9.999998e-01 tot_error: 6.037698e-01 | |
Iteration = 16 Scaled Residual: 1.519347e-02 Max error: 9.999995e-01 tot_error: 5.797322e-01 | |
Iteration = 17 Scaled Residual: 1.423881e-02 Max error: 9.999988e-01 tot_error: 5.559936e-01 | |
Iteration = 18 Scaled Residual: 1.338609e-02 Max error: 9.999974e-01 tot_error: 5.325614e-01 | |
Iteration = 19 Scaled Residual: 1.261968e-02 Max error: 9.999943e-01 tot_error: 5.094370e-01 | |
Iteration = 20 Scaled Residual: 1.192666e-02 Max error: 9.999880e-01 tot_error: 4.866250e-01 | |
Iteration = 21 Scaled Residual: 1.129720e-02 Max error: 9.999752e-01 tot_error: 4.641264e-01 | |
Iteration = 22 Scaled Residual: 1.072253e-02 Max error: 9.999498e-01 tot_error: 4.419457e-01 | |
Iteration = 23 Scaled Residual: 1.019576e-02 Max error: 9.999005e-01 tot_error: 4.200796e-01 | |
Iteration = 24 Scaled Residual: 9.710714e-03 Max error: 9.998067e-01 tot_error: 3.985317e-01 | |
Iteration = 25 Scaled Residual: 9.261364e-03 Max error: 9.996318e-01 tot_error: 3.772973e-01 | |
Iteration = 26 Scaled Residual: 8.843235e-03 Max error: 9.993118e-01 tot_error: 3.563748e-01 | |
Iteration = 27 Scaled Residual: 8.451849e-03 Max error: 9.987384e-01 tot_error: 3.357558e-01 | |
Iteration = 28 Scaled Residual: 8.083237e-03 Max error: 9.977310e-01 tot_error: 3.154322e-01 | |
Iteration = 29 Scaled Residual: 7.735438e-03 Max error: 9.959961e-01 tot_error: 2.953821e-01 | |
Iteration = 30 Scaled Residual: 7.408143e-03 Max error: 9.930677e-01 tot_error: 2.755781e-01 | |
Iteration = 31 Scaled Residual: 7.105613e-03 Max error: 9.882205e-01 tot_error: 2.559564e-01 | |
Iteration = 32 Scaled Residual: 6.840312e-03 Max error: 9.803454e-01 tot_error: 2.364040e-01 | |
Iteration = 33 Scaled Residual: 6.635506e-03 Max error: 9.677530e-01 tot_error: 2.166942e-01 | |
Iteration = 34 Scaled Residual: 6.528774e-03 Max error: 9.478351e-01 tot_error: 1.964153e-01 | |
Iteration = 35 Scaled Residual: 6.566661e-03 Max error: 9.164654e-01 tot_error: 1.748605e-01 | |
Iteration = 36 Scaled Residual: 6.773683e-03 Max error: 8.671280e-01 tot_error: 1.510058e-01 | |
Iteration = 37 Scaled Residual: 7.070073e-03 Max error: 7.907894e-01 tot_error: 1.238698e-01 | |
Iteration = 38 Scaled Residual: 7.137298e-03 Max error: 6.808658e-01 tot_error: 9.407025e-02 | |
Iteration = 39 Scaled Residual: 6.466326e-03 Max error: 5.483560e-01 tot_error: 6.628375e-02 | |
Iteration = 40 Scaled Residual: 4.970917e-03 Max error: 4.282996e-01 tot_error: 4.725462e-02 | |
Iteration = 41 Scaled Residual: 3.367823e-03 Max error: 3.464041e-01 tot_error: 3.772342e-02 | |
Iteration = 42 Scaled Residual: 2.395907e-03 Max error: 2.950169e-01 tot_error: 3.302722e-02 | |
Iteration = 43 Scaled Residual: 2.263901e-03 Max error: 2.523657e-01 tot_error: 2.933024e-02 | |
Iteration = 44 Scaled Residual: 2.591148e-03 Max error: 2.044977e-01 tot_error: 2.432630e-02 | |
Iteration = 45 Scaled Residual: 2.237716e-03 Max error: 1.512566e-01 tot_error: 1.879228e-02 | |
Iteration = 46 Scaled Residual: 1.489969e-03 Max error: 1.197750e-01 tot_error: 1.565498e-02 | |
Iteration = 47 Scaled Residual: 1.395674e-03 Max error: 1.031406e-01 tot_error: 1.352714e-02 | |
Iteration = 48 Scaled Residual: 1.490014e-03 Max error: 7.738042e-02 tot_error: 1.072478e-02 | |
Iteration = 49 Scaled Residual: 9.967295e-04 Max error: 5.267676e-02 tot_error: 8.618764e-03 | |
Iteration = 50 Scaled Residual: 8.738094e-04 Max error: 3.708438e-02 tot_error: 7.390304e-03 | |
Iteration = 51 Scaled Residual: 8.648751e-04 Max error: 2.235215e-02 tot_error: 5.904643e-03 | |
Iteration = 52 Scaled Residual: 5.595820e-04 Max error: 1.652918e-02 tot_error: 4.947250e-03 | |
Starting Benchmarking Phase... | |
Performing 152 CG sets expected time: 180.0 seconds expected Perf: 256.3 GF (256.3 GF_per) | |
2022-12-19 17:24:28.323 | |
progress = 1.3% 2.4 / 180.0 sec elapsed 177.6 sec remain 256.345 GF 256.345 GF_per | |
progress = 2.6% 4.7 / 180.0 sec elapsed 175.3 sec remain 256.332 GF 256.332 GF_per | |
progress = 3.9% 7.1 / 180.0 sec elapsed 172.9 sec remain 256.328 GF 256.328 GF_per | |
progress = 5.3% 9.5 / 180.0 sec elapsed 170.5 sec remain 256.324 GF 256.324 GF_per | |
progress = 6.6% 11.9 / 180.0 sec elapsed 168.1 sec remain 256.322 GF 256.322 GF_per | |
progress = 7.9% 14.2 / 180.0 sec elapsed 165.8 sec remain 256.328 GF 256.328 GF_per | |
progress = 9.2% 16.6 / 180.0 sec elapsed 163.4 sec remain 256.330 GF 256.330 GF_per | |
progress = 10.5% 19.0 / 180.0 sec elapsed 161.0 sec remain 256.329 GF 256.329 GF_per | |
progress = 11.8% 21.3 / 180.0 sec elapsed 158.7 sec remain 256.331 GF 256.331 GF_per | |
progress = 13.2% 23.7 / 180.0 sec elapsed 156.3 sec remain 256.333 GF 256.333 GF_per | |
progress = 14.5% 26.1 / 180.0 sec elapsed 153.9 sec remain 256.332 GF 256.332 GF_per | |
progress = 15.8% 28.4 / 180.0 sec elapsed 151.6 sec remain 256.335 GF 256.335 GF_per | |
progress = 17.1% 30.8 / 180.0 sec elapsed 149.2 sec remain 256.334 GF 256.334 GF_per | |
progress = 18.4% 33.2 / 180.0 sec elapsed 146.8 sec remain 256.332 GF 256.332 GF_per | |
progress = 19.7% 35.5 / 180.0 sec elapsed 144.5 sec remain 256.333 GF 256.333 GF_per | |
progress = 21.1% 37.9 / 180.0 sec elapsed 142.1 sec remain 256.333 GF 256.333 GF_per | |
progress = 22.4% 40.3 / 180.0 sec elapsed 139.7 sec remain 256.334 GF 256.334 GF_per | |
progress = 23.7% 42.7 / 180.0 sec elapsed 137.3 sec remain 256.335 GF 256.335 GF_per | |
progress = 25.0% 45.0 / 180.0 sec elapsed 135.0 sec remain 256.335 GF 256.335 GF_per | |
progress = 26.3% 47.4 / 180.0 sec elapsed 132.6 sec remain 256.335 GF 256.335 GF_per | |
progress = 27.6% 49.8 / 180.0 sec elapsed 130.2 sec remain 256.335 GF 256.335 GF_per | |
progress = 29.0% 52.1 / 180.0 sec elapsed 127.9 sec remain 256.334 GF 256.334 GF_per | |
progress = 30.3% 54.5 / 180.0 sec elapsed 125.5 sec remain 256.332 GF 256.332 GF_per | |
progress = 31.6% 56.9 / 180.0 sec elapsed 123.1 sec remain 256.332 GF 256.332 GF_per | |
progress = 32.9% 59.2 / 180.0 sec elapsed 120.8 sec remain 256.331 GF 256.331 GF_per | |
progress = 34.2% 61.6 / 180.0 sec elapsed 118.4 sec remain 256.330 GF 256.330 GF_per | |
progress = 35.5% 64.0 / 180.0 sec elapsed 116.0 sec remain 256.330 GF 256.330 GF_per | |
progress = 36.9% 66.4 / 180.0 sec elapsed 113.6 sec remain 256.330 GF 256.330 GF_per | |
progress = 38.2% 68.7 / 180.0 sec elapsed 111.3 sec remain 256.330 GF 256.330 GF_per | |
progress = 39.5% 71.1 / 180.0 sec elapsed 108.9 sec remain 256.330 GF 256.330 GF_per | |
progress = 40.8% 73.5 / 180.0 sec elapsed 106.5 sec remain 256.329 GF 256.329 GF_per | |
progress = 42.1% 75.8 / 180.0 sec elapsed 104.2 sec remain 256.330 GF 256.330 GF_per | |
progress = 43.4% 78.2 / 180.0 sec elapsed 101.8 sec remain 256.330 GF 256.330 GF_per | |
progress = 44.8% 80.6 / 180.0 sec elapsed 99.4 sec remain 256.330 GF 256.330 GF_per | |
progress = 46.1% 82.9 / 180.0 sec elapsed 97.1 sec remain 256.330 GF 256.330 GF_per | |
progress = 47.4% 85.3 / 180.0 sec elapsed 94.7 sec remain 256.330 GF 256.330 GF_per | |
progress = 48.7% 87.7 / 180.0 sec elapsed 92.3 sec remain 256.329 GF 256.329 GF_per | |
progress = 50.0% 90.1 / 180.0 sec elapsed 89.9 sec remain 256.330 GF 256.330 GF_per | |
progress = 51.3% 92.4 / 180.0 sec elapsed 87.6 sec remain 256.329 GF 256.329 GF_per | |
progress = 52.7% 94.8 / 180.0 sec elapsed 85.2 sec remain 256.329 GF 256.329 GF_per | |
progress = 54.0% 97.2 / 180.0 sec elapsed 82.8 sec remain 256.328 GF 256.328 GF_per | |
progress = 55.3% 99.5 / 180.0 sec elapsed 80.5 sec remain 256.328 GF 256.328 GF_per | |
progress = 56.6% 101.9 / 180.0 sec elapsed 78.1 sec remain 256.328 GF 256.328 GF_per | |
progress = 57.9% 104.3 / 180.0 sec elapsed 75.7 sec remain 256.328 GF 256.328 GF_per | |
progress = 59.3% 106.7 / 180.0 sec elapsed 73.3 sec remain 256.327 GF 256.327 GF_per | |
progress = 60.6% 109.0 / 180.0 sec elapsed 71.0 sec remain 256.327 GF 256.327 GF_per | |
progress = 61.9% 111.4 / 180.0 sec elapsed 68.6 sec remain 256.326 GF 256.326 GF_per | |
progress = 63.2% 113.8 / 180.0 sec elapsed 66.2 sec remain 256.327 GF 256.327 GF_per | |
progress = 64.5% 116.1 / 180.0 sec elapsed 63.9 sec remain 256.326 GF 256.326 GF_per | |
progress = 65.8% 118.5 / 180.0 sec elapsed 61.5 sec remain 256.326 GF 256.326 GF_per | |
progress = 67.2% 120.9 / 180.0 sec elapsed 59.1 sec remain 256.326 GF 256.326 GF_per | |
progress = 68.5% 123.2 / 180.0 sec elapsed 56.8 sec remain 256.326 GF 256.326 GF_per | |
progress = 69.8% 125.6 / 180.0 sec elapsed 54.4 sec remain 256.326 GF 256.326 GF_per | |
progress = 71.1% 128.0 / 180.0 sec elapsed 52.0 sec remain 256.326 GF 256.326 GF_per | |
progress = 72.4% 130.4 / 180.0 sec elapsed 49.6 sec remain 256.326 GF 256.326 GF_per | |
progress = 73.7% 132.7 / 180.0 sec elapsed 47.3 sec remain 256.327 GF 256.327 GF_per | |
progress = 75.1% 135.1 / 180.0 sec elapsed 44.9 sec remain 256.327 GF 256.327 GF_per | |
progress = 76.4% 137.5 / 180.0 sec elapsed 42.5 sec remain 256.327 GF 256.327 GF_per | |
progress = 77.7% 139.8 / 180.0 sec elapsed 40.2 sec remain 256.328 GF 256.328 GF_per | |
progress = 79.0% 142.2 / 180.0 sec elapsed 37.8 sec remain 256.328 GF 256.328 GF_per | |
progress = 80.3% 144.6 / 180.0 sec elapsed 35.4 sec remain 256.328 GF 256.328 GF_per | |
progress = 81.6% 146.9 / 180.0 sec elapsed 33.1 sec remain 256.328 GF 256.328 GF_per | |
progress = 82.9% 149.3 / 180.0 sec elapsed 30.7 sec remain 256.328 GF 256.328 GF_per | |
progress = 84.3% 151.7 / 180.0 sec elapsed 28.3 sec remain 256.328 GF 256.328 GF_per | |
progress = 85.6% 154.0 / 180.0 sec elapsed 26.0 sec remain 256.328 GF 256.328 GF_per | |
progress = 86.9% 156.4 / 180.0 sec elapsed 23.6 sec remain 256.328 GF 256.328 GF_per | |
progress = 88.2% 158.8 / 180.0 sec elapsed 21.2 sec remain 256.329 GF 256.329 GF_per | |
progress = 89.5% 161.2 / 180.0 sec elapsed 18.8 sec remain 256.329 GF 256.329 GF_per | |
progress = 90.8% 163.5 / 180.0 sec elapsed 16.5 sec remain 256.329 GF 256.329 GF_per | |
progress = 92.2% 165.9 / 180.0 sec elapsed 14.1 sec remain 256.329 GF 256.329 GF_per | |
progress = 93.5% 168.3 / 180.0 sec elapsed 11.7 sec remain 256.329 GF 256.329 GF_per | |
progress = 94.8% 170.6 / 180.0 sec elapsed 9.4 sec remain 256.329 GF 256.329 GF_per | |
progress = 96.1% 173.0 / 180.0 sec elapsed 7.0 sec remain 256.330 GF 256.330 GF_per | |
progress = 97.4% 175.4 / 180.0 sec elapsed 4.6 sec remain 256.330 GF 256.330 GF_per | |
progress = 98.7% 177.7 / 180.0 sec elapsed 2.3 sec remain 256.330 GF 256.330 GF_per | |
Completed Benchmarking Phase... elapsed time: 180.1 seconds | |
2022-12-19 17:27:28.456 | |
Number of CG sets: 152 | |
Iterations per set: 52 | |
scaled res mean: 5.595820e-04 | |
scaled res variance: 0.000000e+00 | |
Total Time: 1.801177e+02 sec | |
Setup Overhead: 1.98% | |
Optimization Overhead: 0.74% | |
Convergence Overhead: 3.85% | |
1x1x1 process grid | |
256x256x256 local domain | |
SpMV = 229.8 GF (1447.5 GB/s Effective) 229.8 GF_per (1447.5 GB/s Effective) | |
SymGS = 290.9 GF (2245.2 GB/s Effective) 290.9 GF_per (2245.2 GB/s Effective) | |
total = 273.9 GF (2077.7 GB/s Effective) 273.9 GF_per (2077.7 GB/s Effective) | |
final = 256.3 GF (1944.1 GB/s Effective) 256.3 GF_per (1944.1 GB/s Effective) | |
end of application... | |
2022-12-19 17:27:28.499 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment