Skip to content

Instantly share code, notes, and snippets.

@bzm3r
Created February 22, 2020 01:23
Show Gist options
  • Save bzm3r/1e2c8de27548e23975239367c14b9b6a to your computer and use it in GitHub Desktop.
Save bzm3r/1e2c8de27548e23975239367c14b9b6a to your computer and use it in GitHub Desktop.
threadgroup results
transpose-threadgroup-WGS=(1,32) kernel already compiled...
GPU results verified!
task name:Vk-Threadgroup-TG=32
device: Radeon RX 570 Series
num BMs: 4096, TG size: 32
CPU loops: 1001, GPU loops: 5001
timestamp stats (N = 1001): 20.19 +/- 0.65 ms
instant stats (N = 1001): 20.83 +/- 0.66 ms
transpose-threadgroup-WGS=(2,32) kernel already compiled...
GPU results verified!
task name:Vk-Threadgroup-TG=64
device: Radeon RX 570 Series
num BMs: 4096, TG size: 64
CPU loops: 1001, GPU loops: 5001
timestamp stats (N = 1001): 11.22 +/- 0.00 ms
instant stats (N = 1001): 11.84 +/- 0.11 ms
transpose-threadgroup-WGS=(3,32) kernel already compiled...
GPU results verified!
task name:Vk-Threadgroup-TG=96
device: Radeon RX 570 Series
num BMs: 4096, TG size: 96
CPU loops: 1001, GPU loops: 5001
timestamp stats (N = 1001): 18.67 +/- 0.05 ms
instant stats (N = 1001): 19.32 +/- 0.13 ms
transpose-threadgroup-WGS=(4,32) kernel already compiled...
GPU results verified!
task name:Vk-Threadgroup-TG=128
device: Radeon RX 570 Series
num BMs: 4096, TG size: 128
CPU loops: 1001, GPU loops: 5001
timestamp stats (N = 1001): 21.57 +/- 0.02 ms
instant stats (N = 1001): 22.22 +/- 0.15 ms
transpose-threadgroup-WGS=(5,32) kernel already compiled...
GPU results verified!
task name:Vk-Threadgroup-TG=160
device: Radeon RX 570 Series
num BMs: 4096, TG size: 160
CPU loops: 1001, GPU loops: 5001
timestamp stats (N = 1001): 27.98 +/- 0.37 ms
instant stats (N = 1001): 28.65 +/- 0.40 ms
transpose-threadgroup-WGS=(6,32) kernel already compiled...
GPU results verified!
task name:Vk-Threadgroup-TG=192
device: Radeon RX 570 Series
num BMs: 4096, TG size: 192
CPU loops: 1001, GPU loops: 5001
timestamp stats (N = 1001): 34.42 +/- 0.34 ms
instant stats (N = 1001): 35.07 +/- 0.37 ms
transpose-threadgroup-WGS=(7,32) kernel already compiled...
GPU results verified!
task name:Vk-Threadgroup-TG=224
device: Radeon RX 570 Series
num BMs: 4096, TG size: 224
CPU loops: 1001, GPU loops: 5001
timestamp stats (N = 1001): 38.91 +/- 0.03 ms
instant stats (N = 1001): 39.56 +/- 0.15 ms
transpose-threadgroup-WGS=(8,32) kernel already compiled...
GPU results verified!
task name:Vk-Threadgroup-TG=256
device: Radeon RX 570 Series
num BMs: 4096, TG size: 256
CPU loops: 1001, GPU loops: 5001
timestamp stats (N = 1001): 42.67 +/- 0.03 ms
instant stats (N = 1001): 43.32 +/- 0.16 ms
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment