Skip to content

Instantly share code, notes, and snippets.

@keichi
Created December 19, 2022 09:00
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save keichi/bc5f036a105b2aef92df4989632dde22 to your computer and use it in GitHub Desktop.
Save keichi/bc5f036a105b2aef92df4989632dde22 to your computer and use it in GitHub Desktop.
[keichi@muffin2 ~]$ singularity run --nv --env LD_LIBRARY_PATH=/usr/local/cuda/lib64 -B $(pwd):/data hpc-benchmarks_21.4-hpcg.sif mpirun -n 1 --bind-to socket hpcg.sh --cpu-affinity 0-7 --cpu-cores-per-rank 8 --gpu-affinity 0 --dat /data/hpcg_a100_40gb.dat
WARNING: group: unknown groupid 200000
NOTE: MOFED driver for multi-node communication was not detected.
Multi-node communication performance may be reduced.
INFO: host=muffin2 rank=0 lrank=0 cores=8 gpu=0 cpu=0-7 mem= ucx= bin=/workspace/hpcg-linux-x86_64/xhpcg
HPCG-CUDA-Benchmark v1.0.0
HPCG-Benchmark v3.1
start of application (8 OMP threads)...
2022-12-19 17:22:13.776
Problem setup...
Setup time: 0.239429 sec
GPU: 'NVIDIA A100 80GB PCIe'
Memory use: 8699 MB / 81251 MB
1x1x1 process grid
256x256x256 local domain
Reference SpMV+MG...
Reference CG...
Initial Residual: 5.660695e+03 Max_err: 1.000000e+00 tot_err: 4.096000e+03
REF Iter = 1 Scaled Residual: 1.857836e-01 Max error: 1.000000e+00 tot_error: 9.698619e-01
REF Iter = 2 Scaled Residual: 1.018949e-01 Max error: 1.000000e+00 tot_error: 9.419039e-01
REF Iter = 3 Scaled Residual: 7.024168e-02 Max error: 1.000000e+00 tot_error: 9.145661e-01
REF Iter = 4 Scaled Residual: 5.357793e-02 Max error: 1.000000e+00 tot_error: 8.875508e-01
REF Iter = 5 Scaled Residual: 4.324831e-02 Max error: 1.000000e+00 tot_error: 8.607934e-01
REF Iter = 6 Scaled Residual: 3.620988e-02 Max error: 1.000000e+00 tot_error: 8.342848e-01
REF Iter = 7 Scaled Residual: 3.111134e-02 Max error: 1.000000e+00 tot_error: 8.080245e-01
REF Iter = 8 Scaled Residual: 2.726007e-02 Max error: 1.000000e+00 tot_error: 7.820212e-01
REF Iter = 9 Scaled Residual: 2.425797e-02 Max error: 1.000000e+00 tot_error: 7.562831e-01
REF Iter = 10 Scaled Residual: 2.185265e-02 Max error: 1.000000e+00 tot_error: 7.308183e-01
REF Iter = 11 Scaled Residual: 1.987491e-02 Max error: 1.000000e+00 tot_error: 7.056301e-01
REF Iter = 12 Scaled Residual: 1.820861e-02 Max error: 9.999999e-01 tot_error: 6.807200e-01
REF Iter = 13 Scaled Residual: 1.677371e-02 Max error: 9.999999e-01 tot_error: 6.560872e-01
REF Iter = 14 Scaled Residual: 1.551939e-02 Max error: 9.999998e-01 tot_error: 6.317310e-01
REF Iter = 15 Scaled Residual: 1.441463e-02 Max error: 9.999998e-01 tot_error: 6.076542e-01
REF Iter = 16 Scaled Residual: 1.343903e-02 Max error: 9.999996e-01 tot_error: 5.838617e-01
REF Iter = 17 Scaled Residual: 1.257818e-02 Max error: 9.999994e-01 tot_error: 5.603609e-01
REF Iter = 18 Scaled Residual: 1.181835e-02 Max error: 9.999990e-01 tot_error: 5.371598e-01
REF Iter = 19 Scaled Residual: 1.114377e-02 Max error: 9.999984e-01 tot_error: 5.142648e-01
REF Iter = 20 Scaled Residual: 1.053831e-02 Max error: 9.999974e-01 tot_error: 4.916784e-01
REF Iter = 21 Scaled Residual: 9.986375e-03 Max error: 9.999956e-01 tot_error: 4.694006e-01
REF Iter = 22 Scaled Residual: 9.474857e-03 Max error: 9.999926e-01 tot_error: 4.474276e-01
REF Iter = 23 Scaled Residual: 8.996370e-03 Max error: 9.999872e-01 tot_error: 4.257548e-01
REF Iter = 24 Scaled Residual: 8.548468e-03 Max error: 9.999773e-01 tot_error: 4.043789e-01
REF Iter = 25 Scaled Residual: 8.131537e-03 Max error: 9.999588e-01 tot_error: 3.832983e-01
REF Iter = 26 Scaled Residual: 7.747382e-03 Max error: 9.999236e-01 tot_error: 3.625121e-01
REF Iter = 27 Scaled Residual: 7.396224e-03 Max error: 9.998548e-01 tot_error: 3.420207e-01
REF Iter = 28 Scaled Residual: 7.075252e-03 Max error: 9.997179e-01 tot_error: 3.218202e-01
REF Iter = 29 Scaled Residual: 6.779564e-03 Max error: 9.994407e-01 tot_error: 3.019029e-01
REF Iter = 30 Scaled Residual: 6.502792e-03 Max error: 9.988717e-01 tot_error: 2.822529e-01
REF Iter = 31 Scaled Residual: 6.239884e-03 Max error: 9.976942e-01 tot_error: 2.628437e-01
REF Iter = 32 Scaled Residual: 5.992256e-03 Max error: 9.952573e-01 tot_error: 2.436245e-01
REF Iter = 33 Scaled Residual: 5.775328e-03 Max error: 9.902587e-01 tot_error: 2.244817e-01
REF Iter = 34 Scaled Residual: 5.636700e-03 Max error: 9.801624e-01 tot_error: 2.051084e-01
REF Iter = 35 Scaled Residual: 5.687363e-03 Max error: 9.600071e-01 tot_error: 1.846774e-01
REF Iter = 36 Scaled Residual: 6.107710e-03 Max error: 9.195170e-01 tot_error: 1.611846e-01
REF Iter = 37 Scaled Residual: 6.942002e-03 Max error: 8.377234e-01 tot_error: 1.310250e-01
REF Iter = 38 Scaled Residual: 7.315811e-03 Max error: 6.925335e-01 tot_error: 9.347398e-02
REF Iter = 39 Scaled Residual: 5.704675e-03 Max error: 5.288183e-01 tot_error: 6.268556e-02
REF Iter = 40 Scaled Residual: 3.478732e-03 Max error: 4.273827e-01 tot_error: 4.913752e-02
REF Iter = 41 Scaled Residual: 2.707532e-03 Max error: 3.692166e-01 tot_error: 4.305959e-02
REF Iter = 42 Scaled Residual: 3.039918e-03 Max error: 3.114679e-01 tot_error: 3.655679e-02
REF Iter = 43 Scaled Residual: 2.527545e-03 Max error: 2.541628e-01 tot_error: 2.960905e-02
REF Iter = 44 Scaled Residual: 2.063980e-03 Max error: 2.158009e-01 tot_error: 2.507440e-02
REF Iter = 45 Scaled Residual: 1.928522e-03 Max error: 1.786816e-01 tot_error: 2.087140e-02
REF Iter = 46 Scaled Residual: 1.621703e-03 Max error: 1.431311e-01 tot_error: 1.742793e-02
REF Iter = 47 Scaled Residual: 1.507093e-03 Max error: 1.091921e-01 tot_error: 1.421963e-02
REF Iter = 48 Scaled Residual: 1.166336e-03 Max error: 8.411006e-02 tot_error: 1.185468e-02
REF Iter = 49 Scaled Residual: 1.151750e-03 Max error: 6.081485e-02 tot_error: 9.639058e-03
REF Iter = 50 Scaled Residual: 8.219993e-04 Max error: 4.124194e-02 tot_error: 8.015353e-03
Optimization...
Optimization time: 8.779421e-02 sec
Validation...
Optimized CG Setup...
Initial Residual: 5.660695e+03 Max_err: 1.000000e+00 tot_err: 4.096000e+03
Iteration = 1 Scaled Residual: 2.202347e-01 Max error: 1.000000e+00 tot_error: 9.696712e-01
Iteration = 2 Scaled Residual: 1.191485e-01 Max error: 1.000000e+00 tot_error: 9.413010e-01
Iteration = 3 Scaled Residual: 8.097051e-02 Max error: 1.000000e+00 tot_error: 9.136654e-01
Iteration = 4 Scaled Residual: 6.124309e-02 Max error: 1.000000e+00 tot_error: 8.864398e-01
Iteration = 5 Scaled Residual: 4.925623e-02 Max error: 1.000000e+00 tot_error: 8.594742e-01
Iteration = 6 Scaled Residual: 4.116105e-02 Max error: 1.000000e+00 tot_error: 8.327582e-01
Iteration = 7 Scaled Residual: 3.533424e-02 Max error: 1.000000e+00 tot_error: 8.062738e-01
Iteration = 8 Scaled Residual: 3.093881e-02 Max error: 1.000000e+00 tot_error: 7.800319e-01
Iteration = 9 Scaled Residual: 2.749230e-02 Max error: 1.000000e+00 tot_error: 7.540381e-01
Iteration = 10 Scaled Residual: 2.471932e-02 Max error: 1.000000e+00 tot_error: 7.283065e-01
Iteration = 11 Scaled Residual: 2.243848e-02 Max error: 1.000000e+00 tot_error: 7.028385e-01
Iteration = 12 Scaled Residual: 2.052399e-02 Max error: 1.000000e+00 tot_error: 6.776463e-01
Iteration = 13 Scaled Residual: 1.889633e-02 Max error: 1.000000e+00 tot_error: 6.527332e-01
Iteration = 14 Scaled Residual: 1.749318e-02 Max error: 9.999999e-01 tot_error: 6.281060e-01
Iteration = 15 Scaled Residual: 1.626950e-02 Max error: 9.999998e-01 tot_error: 6.037698e-01
Iteration = 16 Scaled Residual: 1.519347e-02 Max error: 9.999995e-01 tot_error: 5.797322e-01
Iteration = 17 Scaled Residual: 1.423881e-02 Max error: 9.999988e-01 tot_error: 5.559936e-01
Iteration = 18 Scaled Residual: 1.338609e-02 Max error: 9.999974e-01 tot_error: 5.325614e-01
Iteration = 19 Scaled Residual: 1.261968e-02 Max error: 9.999943e-01 tot_error: 5.094370e-01
Iteration = 20 Scaled Residual: 1.192666e-02 Max error: 9.999880e-01 tot_error: 4.866250e-01
Iteration = 21 Scaled Residual: 1.129720e-02 Max error: 9.999752e-01 tot_error: 4.641264e-01
Iteration = 22 Scaled Residual: 1.072253e-02 Max error: 9.999498e-01 tot_error: 4.419457e-01
Iteration = 23 Scaled Residual: 1.019576e-02 Max error: 9.999005e-01 tot_error: 4.200796e-01
Iteration = 24 Scaled Residual: 9.710714e-03 Max error: 9.998067e-01 tot_error: 3.985317e-01
Iteration = 25 Scaled Residual: 9.261364e-03 Max error: 9.996318e-01 tot_error: 3.772973e-01
Iteration = 26 Scaled Residual: 8.843235e-03 Max error: 9.993118e-01 tot_error: 3.563748e-01
Iteration = 27 Scaled Residual: 8.451849e-03 Max error: 9.987384e-01 tot_error: 3.357558e-01
Iteration = 28 Scaled Residual: 8.083237e-03 Max error: 9.977310e-01 tot_error: 3.154322e-01
Iteration = 29 Scaled Residual: 7.735438e-03 Max error: 9.959961e-01 tot_error: 2.953821e-01
Iteration = 30 Scaled Residual: 7.408143e-03 Max error: 9.930677e-01 tot_error: 2.755781e-01
Iteration = 31 Scaled Residual: 7.105613e-03 Max error: 9.882205e-01 tot_error: 2.559564e-01
Iteration = 32 Scaled Residual: 6.840312e-03 Max error: 9.803454e-01 tot_error: 2.364040e-01
Iteration = 33 Scaled Residual: 6.635506e-03 Max error: 9.677530e-01 tot_error: 2.166942e-01
Iteration = 34 Scaled Residual: 6.528774e-03 Max error: 9.478351e-01 tot_error: 1.964153e-01
Iteration = 35 Scaled Residual: 6.566661e-03 Max error: 9.164654e-01 tot_error: 1.748605e-01
Iteration = 36 Scaled Residual: 6.773683e-03 Max error: 8.671280e-01 tot_error: 1.510058e-01
Iteration = 37 Scaled Residual: 7.070073e-03 Max error: 7.907894e-01 tot_error: 1.238698e-01
Iteration = 38 Scaled Residual: 7.137298e-03 Max error: 6.808658e-01 tot_error: 9.407025e-02
Iteration = 39 Scaled Residual: 6.466326e-03 Max error: 5.483560e-01 tot_error: 6.628375e-02
Iteration = 40 Scaled Residual: 4.970917e-03 Max error: 4.282996e-01 tot_error: 4.725462e-02
Iteration = 41 Scaled Residual: 3.367823e-03 Max error: 3.464041e-01 tot_error: 3.772342e-02
Iteration = 42 Scaled Residual: 2.395907e-03 Max error: 2.950169e-01 tot_error: 3.302722e-02
Iteration = 43 Scaled Residual: 2.263901e-03 Max error: 2.523657e-01 tot_error: 2.933024e-02
Iteration = 44 Scaled Residual: 2.591148e-03 Max error: 2.044977e-01 tot_error: 2.432630e-02
Iteration = 45 Scaled Residual: 2.237716e-03 Max error: 1.512566e-01 tot_error: 1.879228e-02
Iteration = 46 Scaled Residual: 1.489969e-03 Max error: 1.197750e-01 tot_error: 1.565498e-02
Iteration = 47 Scaled Residual: 1.395674e-03 Max error: 1.031406e-01 tot_error: 1.352714e-02
Iteration = 48 Scaled Residual: 1.490014e-03 Max error: 7.738042e-02 tot_error: 1.072478e-02
Iteration = 49 Scaled Residual: 9.967295e-04 Max error: 5.267676e-02 tot_error: 8.618764e-03
Iteration = 50 Scaled Residual: 8.738094e-04 Max error: 3.708438e-02 tot_error: 7.390304e-03
Iteration = 51 Scaled Residual: 8.648751e-04 Max error: 2.235215e-02 tot_error: 5.904643e-03
Iteration = 52 Scaled Residual: 5.595820e-04 Max error: 1.652918e-02 tot_error: 4.947250e-03
Starting Benchmarking Phase...
Performing 152 CG sets expected time: 180.0 seconds expected Perf: 256.3 GF (256.3 GF_per)
2022-12-19 17:24:28.323
progress = 1.3% 2.4 / 180.0 sec elapsed 177.6 sec remain 256.345 GF 256.345 GF_per
progress = 2.6% 4.7 / 180.0 sec elapsed 175.3 sec remain 256.332 GF 256.332 GF_per
progress = 3.9% 7.1 / 180.0 sec elapsed 172.9 sec remain 256.328 GF 256.328 GF_per
progress = 5.3% 9.5 / 180.0 sec elapsed 170.5 sec remain 256.324 GF 256.324 GF_per
progress = 6.6% 11.9 / 180.0 sec elapsed 168.1 sec remain 256.322 GF 256.322 GF_per
progress = 7.9% 14.2 / 180.0 sec elapsed 165.8 sec remain 256.328 GF 256.328 GF_per
progress = 9.2% 16.6 / 180.0 sec elapsed 163.4 sec remain 256.330 GF 256.330 GF_per
progress = 10.5% 19.0 / 180.0 sec elapsed 161.0 sec remain 256.329 GF 256.329 GF_per
progress = 11.8% 21.3 / 180.0 sec elapsed 158.7 sec remain 256.331 GF 256.331 GF_per
progress = 13.2% 23.7 / 180.0 sec elapsed 156.3 sec remain 256.333 GF 256.333 GF_per
progress = 14.5% 26.1 / 180.0 sec elapsed 153.9 sec remain 256.332 GF 256.332 GF_per
progress = 15.8% 28.4 / 180.0 sec elapsed 151.6 sec remain 256.335 GF 256.335 GF_per
progress = 17.1% 30.8 / 180.0 sec elapsed 149.2 sec remain 256.334 GF 256.334 GF_per
progress = 18.4% 33.2 / 180.0 sec elapsed 146.8 sec remain 256.332 GF 256.332 GF_per
progress = 19.7% 35.5 / 180.0 sec elapsed 144.5 sec remain 256.333 GF 256.333 GF_per
progress = 21.1% 37.9 / 180.0 sec elapsed 142.1 sec remain 256.333 GF 256.333 GF_per
progress = 22.4% 40.3 / 180.0 sec elapsed 139.7 sec remain 256.334 GF 256.334 GF_per
progress = 23.7% 42.7 / 180.0 sec elapsed 137.3 sec remain 256.335 GF 256.335 GF_per
progress = 25.0% 45.0 / 180.0 sec elapsed 135.0 sec remain 256.335 GF 256.335 GF_per
progress = 26.3% 47.4 / 180.0 sec elapsed 132.6 sec remain 256.335 GF 256.335 GF_per
progress = 27.6% 49.8 / 180.0 sec elapsed 130.2 sec remain 256.335 GF 256.335 GF_per
progress = 29.0% 52.1 / 180.0 sec elapsed 127.9 sec remain 256.334 GF 256.334 GF_per
progress = 30.3% 54.5 / 180.0 sec elapsed 125.5 sec remain 256.332 GF 256.332 GF_per
progress = 31.6% 56.9 / 180.0 sec elapsed 123.1 sec remain 256.332 GF 256.332 GF_per
progress = 32.9% 59.2 / 180.0 sec elapsed 120.8 sec remain 256.331 GF 256.331 GF_per
progress = 34.2% 61.6 / 180.0 sec elapsed 118.4 sec remain 256.330 GF 256.330 GF_per
progress = 35.5% 64.0 / 180.0 sec elapsed 116.0 sec remain 256.330 GF 256.330 GF_per
progress = 36.9% 66.4 / 180.0 sec elapsed 113.6 sec remain 256.330 GF 256.330 GF_per
progress = 38.2% 68.7 / 180.0 sec elapsed 111.3 sec remain 256.330 GF 256.330 GF_per
progress = 39.5% 71.1 / 180.0 sec elapsed 108.9 sec remain 256.330 GF 256.330 GF_per
progress = 40.8% 73.5 / 180.0 sec elapsed 106.5 sec remain 256.329 GF 256.329 GF_per
progress = 42.1% 75.8 / 180.0 sec elapsed 104.2 sec remain 256.330 GF 256.330 GF_per
progress = 43.4% 78.2 / 180.0 sec elapsed 101.8 sec remain 256.330 GF 256.330 GF_per
progress = 44.8% 80.6 / 180.0 sec elapsed 99.4 sec remain 256.330 GF 256.330 GF_per
progress = 46.1% 82.9 / 180.0 sec elapsed 97.1 sec remain 256.330 GF 256.330 GF_per
progress = 47.4% 85.3 / 180.0 sec elapsed 94.7 sec remain 256.330 GF 256.330 GF_per
progress = 48.7% 87.7 / 180.0 sec elapsed 92.3 sec remain 256.329 GF 256.329 GF_per
progress = 50.0% 90.1 / 180.0 sec elapsed 89.9 sec remain 256.330 GF 256.330 GF_per
progress = 51.3% 92.4 / 180.0 sec elapsed 87.6 sec remain 256.329 GF 256.329 GF_per
progress = 52.7% 94.8 / 180.0 sec elapsed 85.2 sec remain 256.329 GF 256.329 GF_per
progress = 54.0% 97.2 / 180.0 sec elapsed 82.8 sec remain 256.328 GF 256.328 GF_per
progress = 55.3% 99.5 / 180.0 sec elapsed 80.5 sec remain 256.328 GF 256.328 GF_per
progress = 56.6% 101.9 / 180.0 sec elapsed 78.1 sec remain 256.328 GF 256.328 GF_per
progress = 57.9% 104.3 / 180.0 sec elapsed 75.7 sec remain 256.328 GF 256.328 GF_per
progress = 59.3% 106.7 / 180.0 sec elapsed 73.3 sec remain 256.327 GF 256.327 GF_per
progress = 60.6% 109.0 / 180.0 sec elapsed 71.0 sec remain 256.327 GF 256.327 GF_per
progress = 61.9% 111.4 / 180.0 sec elapsed 68.6 sec remain 256.326 GF 256.326 GF_per
progress = 63.2% 113.8 / 180.0 sec elapsed 66.2 sec remain 256.327 GF 256.327 GF_per
progress = 64.5% 116.1 / 180.0 sec elapsed 63.9 sec remain 256.326 GF 256.326 GF_per
progress = 65.8% 118.5 / 180.0 sec elapsed 61.5 sec remain 256.326 GF 256.326 GF_per
progress = 67.2% 120.9 / 180.0 sec elapsed 59.1 sec remain 256.326 GF 256.326 GF_per
progress = 68.5% 123.2 / 180.0 sec elapsed 56.8 sec remain 256.326 GF 256.326 GF_per
progress = 69.8% 125.6 / 180.0 sec elapsed 54.4 sec remain 256.326 GF 256.326 GF_per
progress = 71.1% 128.0 / 180.0 sec elapsed 52.0 sec remain 256.326 GF 256.326 GF_per
progress = 72.4% 130.4 / 180.0 sec elapsed 49.6 sec remain 256.326 GF 256.326 GF_per
progress = 73.7% 132.7 / 180.0 sec elapsed 47.3 sec remain 256.327 GF 256.327 GF_per
progress = 75.1% 135.1 / 180.0 sec elapsed 44.9 sec remain 256.327 GF 256.327 GF_per
progress = 76.4% 137.5 / 180.0 sec elapsed 42.5 sec remain 256.327 GF 256.327 GF_per
progress = 77.7% 139.8 / 180.0 sec elapsed 40.2 sec remain 256.328 GF 256.328 GF_per
progress = 79.0% 142.2 / 180.0 sec elapsed 37.8 sec remain 256.328 GF 256.328 GF_per
progress = 80.3% 144.6 / 180.0 sec elapsed 35.4 sec remain 256.328 GF 256.328 GF_per
progress = 81.6% 146.9 / 180.0 sec elapsed 33.1 sec remain 256.328 GF 256.328 GF_per
progress = 82.9% 149.3 / 180.0 sec elapsed 30.7 sec remain 256.328 GF 256.328 GF_per
progress = 84.3% 151.7 / 180.0 sec elapsed 28.3 sec remain 256.328 GF 256.328 GF_per
progress = 85.6% 154.0 / 180.0 sec elapsed 26.0 sec remain 256.328 GF 256.328 GF_per
progress = 86.9% 156.4 / 180.0 sec elapsed 23.6 sec remain 256.328 GF 256.328 GF_per
progress = 88.2% 158.8 / 180.0 sec elapsed 21.2 sec remain 256.329 GF 256.329 GF_per
progress = 89.5% 161.2 / 180.0 sec elapsed 18.8 sec remain 256.329 GF 256.329 GF_per
progress = 90.8% 163.5 / 180.0 sec elapsed 16.5 sec remain 256.329 GF 256.329 GF_per
progress = 92.2% 165.9 / 180.0 sec elapsed 14.1 sec remain 256.329 GF 256.329 GF_per
progress = 93.5% 168.3 / 180.0 sec elapsed 11.7 sec remain 256.329 GF 256.329 GF_per
progress = 94.8% 170.6 / 180.0 sec elapsed 9.4 sec remain 256.329 GF 256.329 GF_per
progress = 96.1% 173.0 / 180.0 sec elapsed 7.0 sec remain 256.330 GF 256.330 GF_per
progress = 97.4% 175.4 / 180.0 sec elapsed 4.6 sec remain 256.330 GF 256.330 GF_per
progress = 98.7% 177.7 / 180.0 sec elapsed 2.3 sec remain 256.330 GF 256.330 GF_per
Completed Benchmarking Phase... elapsed time: 180.1 seconds
2022-12-19 17:27:28.456
Number of CG sets: 152
Iterations per set: 52
scaled res mean: 5.595820e-04
scaled res variance: 0.000000e+00
Total Time: 1.801177e+02 sec
Setup Overhead: 1.98%
Optimization Overhead: 0.74%
Convergence Overhead: 3.85%
1x1x1 process grid
256x256x256 local domain
SpMV = 229.8 GF (1447.5 GB/s Effective) 229.8 GF_per (1447.5 GB/s Effective)
SymGS = 290.9 GF (2245.2 GB/s Effective) 290.9 GF_per (2245.2 GB/s Effective)
total = 273.9 GF (2077.7 GB/s Effective) 273.9 GF_per (2077.7 GB/s Effective)
final = 256.3 GF (1944.1 GB/s Effective) 256.3 GF_per (1944.1 GB/s Effective)
end of application...
2022-12-19 17:27:28.499
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment