Skip to content

Instantly share code, notes, and snippets.

@gartnera
Last active December 17, 2022 06:36
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save gartnera/4b9dca45cb6be647dbb12e25c2b645db to your computer and use it in GitHub Desktop.
Save gartnera/4b9dca45cb6be647dbb12e25c2b645db to your computer and use it in GitHub Desktop.
7900XT Linux Benchmarks. AMD Ryzen 7 7700X CPU. 32GB RAM. Ubuntu 22.04 userspace.
root@rocm22:~/clpeak/build# ./clpeak -d 0
Platform: AMD Accelerated Parallel Processing
Device: gfx1100
Driver version : 3513.0 (HSA1.1,LC) (Linux x64)
Compute units : 42
Clock frequency : 3125 MHz
Global memory bandwidth (GBPS)
float : 615.87
float2 : 645.74
float4 : 674.50
float8 : 710.11
float16 : 717.83
Single-precision compute (GFLOPS)
float : 28076.05
float2 : 28097.33
float4 : 27598.11
float8 : 26509.29
float16 : 26367.47
Half-precision compute (GFLOPS)
half : 27488.21
half2 : 53690.29
half4 : 54410.40
half8 : 51235.12
half16 : 52198.80
Double-precision compute (GFLOPS)
double : 980.99
double2 : 986.50
double4 : 975.67
double8 : 974.88
double16 : 970.54
Integer compute (GIOPS)
int : 7106.09
int2 : 6773.35
int4 : 6677.53
int8 : 6722.32
int16 : 6739.17
Integer compute Fast 24bit (GIOPS)
int : 23607.37
int2 : 23635.00
int4 : 23451.05
int8 : 23487.69
int16 : 20124.57
Transfer bandwidth (GBPS)
enqueueWriteBuffer : 18.96
enqueueReadBuffer : 5.52
enqueueWriteBuffer non-blocking : 18.32
enqueueReadBuffer non-blocking : 5.52
enqueueMapBuffer(for read) : 1073741.88
memcpy from mapped ptr : 5.57
enqueueUnmap(after write) : 1160801.88
memcpy to mapped ptr : 17.79
Kernel launch latency : 1929408384.00 us
# release doesn't work. downloaded latest build from CI here: https://github.com/hashcat/hashcat/actions/runs/3689982478
root@rocm22:~/hashcat# ./hashcat -d 1 --benchmark
hashcat (v6.2.6) starting in benchmark mode
Benchmarking uses hand-optimized kernel code by default.
You can use it in your cracking session by setting the -O option.
Note: Using optimized kernel code limits the maximum supported password length.
To disable the optimized kernel code in benchmark mode, use the -w option.
HIP API (HIP 5.4.22802)
=======================
* Device #1: Radeon RX 7900 XT, 20464/20464 MB, 42MCU
* Device #2: , skipped
OpenCL API (OpenCL 2.1 AMD-APP (3513.0)) - Platform #1 [Advanced Micro Devices, Inc.]
=====================================================================================
* Device #3: Radeon RX 7900 XT, skipped
* Device #4: , skipped
Benchmark relevant options:
===========================
* --backend-devices=1
* --optimized-kernel-enable
-------------------
* Hash-Mode 0 (MD5)
-------------------
Speed.#1.........: 61748.3 MH/s (22.69ms) @ Accel:1024 Loops:1024 Thr:32 Vec:1
----------------------
* Hash-Mode 100 (SHA1)
----------------------
Speed.#1.........: 24809.5 MH/s (56.55ms) @ Accel:1024 Loops:1024 Thr:32 Vec:1
---------------------------
* Hash-Mode 1400 (SHA2-256)
---------------------------
Speed.#1.........: 10583.4 MH/s (66.47ms) @ Accel:512 Loops:1024 Thr:32 Vec:1
---------------------------
* Hash-Mode 1700 (SHA2-512)
---------------------------
Speed.#1.........: 2081.4 MH/s (84.41ms) @ Accel:256 Loops:512 Thr:32 Vec:1
-------------------------------------------------------------
* Hash-Mode 22000 (WPA-PBKDF2-PMKID+EAPOL) [Iterations: 4095]
-------------------------------------------------------------
Speed.#1.........: 1242.2 kH/s (68.91ms) @ Accel:1024 Loops:256 Thr:32 Vec:1
-----------------------
* Hash-Mode 1000 (NTLM)
-----------------------
Speed.#1.........: 97751.9 MH/s (14.26ms) @ Accel:1024 Loops:1024 Thr:32 Vec:1
---------------------
* Hash-Mode 3000 (LM)
---------------------
Speed.#1.........: 62701.4 MH/s (22.37ms) @ Accel:1024 Loops:1024 Thr:32 Vec:1
--------------------------------------------
* Hash-Mode 5500 (NetNTLMv1 / NetNTLMv1+ESS)
--------------------------------------------
Speed.#1.........: 63791.0 MH/s (21.98ms) @ Accel:1024 Loops:1024 Thr:32 Vec:1
----------------------------
* Hash-Mode 5600 (NetNTLMv2)
----------------------------
Speed.#1.........: 4243.8 MH/s (82.89ms) @ Accel:1024 Loops:256 Thr:32 Vec:1
--------------------------------------------------------
* Hash-Mode 1500 (descrypt, DES (Unix), Traditional DES)
--------------------------------------------------------
Speed.#1.........: 2248.8 MH/s (77.85ms) @ Accel:128 Loops:1024 Thr:32 Vec:1
------------------------------------------------------------------------------
* Hash-Mode 500 (md5crypt, MD5 (Unix), Cisco-IOS $1$ (MD5)) [Iterations: 1000]
------------------------------------------------------------------------------
Speed.#1.........: 22083.9 kH/s (58.82ms) @ Accel:1024 Loops:1000 Thr:32 Vec:1
----------------------------------------------------------------
* Hash-Mode 3200 (bcrypt $2*$, Blowfish (Unix)) [Iterations: 32]
----------------------------------------------------------------
Speed.#1.........: 83426 H/s (62.07ms) @ Accel:32 Loops:8 Thr:16 Vec:1
--------------------------------------------------------------------
* Hash-Mode 1800 (sha512crypt $6$, SHA512 (Unix)) [Iterations: 5000]
--------------------------------------------------------------------
Speed.#1.........: 327.1 kH/s (79.12ms) @ Accel:8192 Loops:512 Thr:32 Vec:1
--------------------------------------------------------
* Hash-Mode 7500 (Kerberos 5, etype 23, AS-REQ Pre-Auth)
--------------------------------------------------------
Speed.#1.........: 1387.7 MH/s (63.25ms) @ Accel:512 Loops:128 Thr:32 Vec:1
-------------------------------------------------
* Hash-Mode 13100 (Kerberos 5, etype 23, TGS-REP)
-------------------------------------------------
Speed.#1.........: 856.3 MH/s (51.18ms) @ Accel:256 Loops:128 Thr:32 Vec:1
---------------------------------------------------------------------------------
* Hash-Mode 15300 (DPAPI masterkey file v1 (context 1 and 2)) [Iterations: 23999]
---------------------------------------------------------------------------------
Speed.#1.........: 216.4 kH/s (67.70ms) @ Accel:1024 Loops:256 Thr:32 Vec:1
---------------------------------------------------------------------------------
* Hash-Mode 15900 (DPAPI masterkey file v2 (context 1 and 2)) [Iterations: 12899]
---------------------------------------------------------------------------------
Speed.#1.........: 74127 H/s (91.74ms) @ Accel:512 Loops:128 Thr:32 Vec:1
------------------------------------------------------------------
* Hash-Mode 7100 (macOS v10.8+ (PBKDF2-SHA512)) [Iterations: 1023]
------------------------------------------------------------------
Speed.#1.........: 945.9 kH/s (72.22ms) @ Accel:256 Loops:255 Thr:32 Vec:1
---------------------------------------------
* Hash-Mode 11600 (7-Zip) [Iterations: 16384]
---------------------------------------------
Speed.#1.........: 1198.7 kH/s (66.14ms) @ Accel:256 Loops:4096 Thr:32 Vec:1
------------------------------------------------
* Hash-Mode 12500 (RAR3-hp) [Iterations: 262144]
------------------------------------------------
Speed.#1.........: 173.6 kH/s (61.78ms) @ Accel:128 Loops:16384 Thr:32 Vec:1
--------------------------------------------
* Hash-Mode 13000 (RAR5) [Iterations: 32799]
--------------------------------------------
Speed.#1.........: 138.0 kH/s (77.65ms) @ Accel:1024 Loops:256 Thr:32 Vec:1
--------------------------------------------------------------------------------
* Hash-Mode 6211 (TrueCrypt RIPEMD160 + XTS 512 bit (legacy)) [Iterations: 1999]
--------------------------------------------------------------------------------
Speed.#1.........: 792.3 kH/s (53.17ms) @ Accel:64 Loops:1024 Thr:32 Vec:1
-----------------------------------------------------------------------------------
* Hash-Mode 13400 (KeePass 1 (AES/Twofish) and KeePass 2 (AES)) [Iterations: 24569]
-----------------------------------------------------------------------------------
Speed.#1.........: 209.2 kH/s (68.38ms) @ Accel:1024 Loops:256 Thr:32 Vec:1
-------------------------------------------------------------------
* Hash-Mode 6800 (LastPass + LastPass sniffed) [Iterations: 100099]
-------------------------------------------------------------------
Speed.#1.........: 44953 H/s (78.20ms) @ Accel:512 Loops:512 Thr:32 Vec:1
--------------------------------------------------------------------
* Hash-Mode 11300 (Bitcoin/Litecoin wallet.dat) [Iterations: 200459]
--------------------------------------------------------------------
Speed.#1.........: 9613 H/s (91.27ms) @ Accel:512 Loops:256 Thr:32 Vec:1
Started: Thu Dec 15 12:16:44 2022
Stopped: Thu Dec 15 12:19:37 2022
# AMD_VULKAN_ICD=AMDVLK was crashing, so just testing with RADV
➜ build git:(master) ✗ ./vkpeak 0
device = AMD Radeon Graphics (RADV GFX1100)
fp32-scalar = 22452.83 GFLOPS
fp32-vec4 = 20634.35 GFLOPS
fp16-scalar = 23400.66 GFLOPS
fp16-vec4 = 41876.88 GFLOPS
fp64-scalar = 986.09 GFLOPS
fp64-vec4 = 985.51 GFLOPS
int32-scalar = 6189.73 GIOPS
int32-vec4 = 5989.61 GIOPS
int16-scalar = 23371.58 GIOPS
int16-vec4 = 41198.30 GIOPS
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment