Skip to content

Instantly share code, notes, and snippets.

@EntityFX
Created January 27, 2022 15:25
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save EntityFX/1c61365c426f15baa9fd9f727a0ac484 to your computer and use it in GitHub Desktop.
Save EntityFX/1c61365c426f15baa9fd9f727a0ac484 to your computer and use it in GitHub Desktop.
Elbrus HPL
/opt/mpich-3.1.4/bin/mpiexec -n 1 --bind-to core ./xhpl
================================================================================
HPLinpack 2.2 -- High-Performance Linpack benchmark -- February 24, 2016
Written by A. Petitet and R. Clint Whaley, Innovative Computing Laboratory, UTK
Modified by Piotr Luszczek, Innovative Computing Laboratory, UTK
Modified by Julien Langou, University of Colorado Denver
================================================================================
An explanation of the input/output parameters follows:
T/V : Wall time / encoded variant.
N : The order of the coefficient matrix A.
NB : The partitioning blocking factor.
P : The number of process rows.
Q : The number of process columns.
Time : Time in seconds to solve the linear system.
Gflops : Rate of execution for solving the linear system.
The following parameter values will be used:
N : 18000 24000 36000 40800
NB : 240
PMAP : Row-major process mapping
P : 1
Q : 1
PFACT : Left
NBMIN : 4
NDIV : 4
RFACT : Left
BCAST : 1ring
DEPTH : 1
SWAP : Mix (threshold = 64)
L1 : transposed form
U : transposed form
EQUIL : yes
ALIGN : 16 double precision words
--------------------------------------------------------------------------------
- The matrix A is randomly generated for each test.
- The following scaled residual check will be computed:
||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N )
- The relative machine precision (eps) is taken to be 1.110223e-16
- Computational tests pass if scaled residuals are less than 16.0
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 18000 240 1 1 110.18 3.529e+01
HPL_pdgesv() start time Tue Jan 25 16:50:22 2022
HPL_pdgesv() end time Tue Jan 25 16:52:12 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0041575 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 24000 240 1 1 250.19 3.684e+01
HPL_pdgesv() start time Tue Jan 25 16:53:31 2022
HPL_pdgesv() end time Tue Jan 25 16:57:41 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0028744 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 36000 240 1 1 803.06 3.873e+01
HPL_pdgesv() start time Tue Jan 25 17:00:21 2022
HPL_pdgesv() end time Tue Jan 25 17:13:45 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0034209 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 40800 240 1 1 1154.37 3.923e+01
HPL_pdgesv() start time Tue Jan 25 17:18:11 2022
HPL_pdgesv() end time Tue Jan 25 17:37:25 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0034057 ...... PASSED
================================================================================
Finished 4 tests with the following results:
4 tests completed and passed residual checks,
0 tests completed and failed residual checks,
0 tests skipped because of illegal input values.
--------------------------------------------------------------------------------
End of Tests.
================================================================================
/opt/mpich-3.1.4/bin/mpiexec -n 8 --bind-to core ./xhpl
================================================================================
HPLinpack 2.2 -- High-Performance Linpack benchmark -- February 24, 2016
Written by A. Petitet and R. Clint Whaley, Innovative Computing Laboratory, UTK
Modified by Piotr Luszczek, Innovative Computing Laboratory, UTK
Modified by Julien Langou, University of Colorado Denver
================================================================================
An explanation of the input/output parameters follows:
T/V : Wall time / encoded variant.
N : The order of the coefficient matrix A.
NB : The partitioning blocking factor.
P : The number of process rows.
Q : The number of process columns.
Time : Time in seconds to solve the linear system.
Gflops : Rate of execution for solving the linear system.
The following parameter values will be used:
N : 18000 24000 36000 40800
NB : 240
PMAP : Row-major process mapping
P : 2
Q : 4
PFACT : Left
NBMIN : 4
NDIV : 4
RFACT : Left
BCAST : 1ring
DEPTH : 1
SWAP : Mix (threshold = 64)
L1 : transposed form
U : transposed form
EQUIL : yes
ALIGN : 16 double precision words
--------------------------------------------------------------------------------
- The matrix A is randomly generated for each test.
- The following scaled residual check will be computed:
||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N )
- The relative machine precision (eps) is taken to be 1.110223e-16
- Computational tests pass if scaled residuals are less than 16.0
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 18000 240 2 4 48.25 8.058e+01
HPL_pdgesv() start time Tue Jan 25 16:11:21 2022
HPL_pdgesv() end time Tue Jan 25 16:12:09 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0030712 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 24000 240 2 4 109.82 8.393e+01
HPL_pdgesv() start time Tue Jan 25 16:12:24 2022
HPL_pdgesv() end time Tue Jan 25 16:14:14 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0025091 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 36000 240 2 4 372.24 8.356e+01
HPL_pdgesv() start time Tue Jan 25 16:14:45 2022
HPL_pdgesv() end time Tue Jan 25 16:20:58 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0025393 ...... PASSED
/opt/mpich-3.1.4/bin/mpiexec -n 1 --bind-to core ./xhpl
================================================================================
HPLinpack 2.2 -- High-Performance Linpack benchmark -- February 24, 2016
Written by A. Petitet and R. Clint Whaley, Innovative Computing Laboratory, UTK
Modified by Piotr Luszczek, Innovative Computing Laboratory, UTK
Modified by Julien Langou, University of Colorado Denver
================================================================================
An explanation of the input/output parameters follows:
T/V : Wall time / encoded variant.
N : The order of the coefficient matrix A.
NB : The partitioning blocking factor.
P : The number of process rows.
Q : The number of process columns.
Time : Time in seconds to solve the linear system.
Gflops : Rate of execution for solving the linear system.
The following parameter values will be used:
N : 18000 24000 36000 40800
NB : 240
PMAP : Row-major process mapping
P : 1
Q : 1
PFACT : Left
NBMIN : 4
NDIV : 4
RFACT : Left
BCAST : 1ring
DEPTH : 1
SWAP : Mix (threshold = 64)
L1 : transposed form
U : transposed form
EQUIL : yes
ALIGN : 16 double precision words
--------------------------------------------------------------------------------
- The matrix A is randomly generated for each test.
- The following scaled residual check will be computed:
||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N )
- The relative machine precision (eps) is taken to be 1.110223e-16
- Computational tests pass if scaled residuals are less than 16.0
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 18000 240 1 1 319.93 1.215e+01
HPL_pdgesv() start time Tue Jan 25 18:12:19 2022
HPL_pdgesv() end time Tue Jan 25 18:17:39 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0037525 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 24000 240 1 1 758.03 1.216e+01
HPL_pdgesv() start time Tue Jan 25 18:19:36 2022
HPL_pdgesv() end time Tue Jan 25 18:32:15 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0034042 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 36000 240 1 1 2529.87 1.230e+01
HPL_pdgesv() start time Tue Jan 25 18:36:17 2022
HPL_pdgesv() end time Tue Jan 25 19:18:27 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0032316 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 40800 240 1 1 3668.83 1.234e+01
HPL_pdgesv() start time Tue Jan 25 19:25:05 2022
HPL_pdgesv() end time Tue Jan 25 20:26:14 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0034766 ...... PASSED
================================================================================
Finished 4 tests with the following results:
4 tests completed and passed residual checks,
0 tests completed and failed residual checks,
0 tests skipped because of illegal input values.
--------------------------------------------------------------------------------
End of Tests.
================================================================================
/opt/mpich-3.1.4/bin/mpiexec -n 1 --bind-to core ./xhpl
================================================================================
HPLinpack 2.2 -- High-Performance Linpack benchmark -- February 24, 2016
Written by A. Petitet and R. Clint Whaley, Innovative Computing Laboratory, UTK
Modified by Piotr Luszczek, Innovative Computing Laboratory, UTK
Modified by Julien Langou, University of Colorado Denver
================================================================================
An explanation of the input/output parameters follows:
T/V : Wall time / encoded variant.
N : The order of the coefficient matrix A.
NB : The partitioning blocking factor.
P : The number of process rows.
Q : The number of process columns.
Time : Time in seconds to solve the linear system.
Gflops : Rate of execution for solving the linear system.
The following parameter values will be used:
N : 18000 24000 36000 40800
NB : 240
PMAP : Row-major process mapping
P : 1
Q : 1
PFACT : Left
NBMIN : 4
NDIV : 4
RFACT : Left
BCAST : 1ring
DEPTH : 1
SWAP : Mix (threshold = 64)
L1 : transposed form
U : transposed form
EQUIL : yes
ALIGN : 16 double precision words
--------------------------------------------------------------------------------
- The matrix A is randomly generated for each test.
- The following scaled residual check will be computed:
||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N )
- The relative machine precision (eps) is taken to be 1.110223e-16
- Computational tests pass if scaled residuals are less than 16.0
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 18000 240 1 1 125.07 3.109e+01
HPL_pdgesv() start time Tue Jan 25 18:31:01 2022
HPL_pdgesv() end time Tue Jan 25 18:33:07 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0039362 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 24000 240 1 1 289.98 3.178e+01
HPL_pdgesv() start time Tue Jan 25 18:34:38 2022
HPL_pdgesv() end time Tue Jan 25 18:39:28 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0030475 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 36000 240 1 1 962.21 3.233e+01
HPL_pdgesv() start time Tue Jan 25 18:42:35 2022
HPL_pdgesv() end time Tue Jan 25 18:58:37 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0033414 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR10L4L4 40800 240 1 1 1393.63 3.249e+01
HPL_pdgesv() start time Tue Jan 25 19:03:44 2022
HPL_pdgesv() end time Tue Jan 25 19:26:57 2022
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0033365 ...... PASSED
================================================================================
Finished 4 tests with the following results:
4 tests completed and passed residual checks,
0 tests completed and failed residual checks,
0 tests skipped because of illegal input values.
--------------------------------------------------------------------------------
End of Tests.
================================================================================
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment