Skip to content

Instantly share code, notes, and snippets.

@bzm3r
Created February 22, 2020 22:02
Show Gist options
  • Save bzm3r/887501a2900c9ccfab108d735b818bfd to your computer and use it in GitHub Desktop.
Save bzm3r/887501a2900c9ccfab108d735b818bfd to your computer and use it in GitHub Desktop.
transpose-hybrid-shuffle-WGS=(32,1) kernel already compiled...
num bms: 4096, num dispatch groups: 4096
GPU results verified!
task name:Vk-HybridShuffle-TG=32
device: Intel(R) HD Graphics 520
num BMs: 4096, TG size: 32
CPU loops: 101, GPU loops: 1001
timestamp stats (N = 101): 57.83 +/- 1.31 ms
instant stats (N = 101): 58.53 +/- 1.26 ms
transpose-hybrid-shuffle-WGS=(64,1) kernel already compiled...
num bms: 4096, num dispatch groups: 2048
GPU results verified!
task name:Vk-HybridShuffle-TG=64
device: Intel(R) HD Graphics 520
num BMs: 4096, TG size: 64
CPU loops: 101, GPU loops: 1001
timestamp stats (N = 101): 58.21 +/- 1.30 ms
instant stats (N = 101): 59.05 +/- 1.37 ms
transpose-hybrid-shuffle-WGS=(128,1) kernel already compiled...
num bms: 4096, num dispatch groups: 1024
GPU results verified!
task name:Vk-HybridShuffle-TG=128
device: Intel(R) HD Graphics 520
num BMs: 4096, TG size: 128
CPU loops: 101, GPU loops: 1001
timestamp stats (N = 101): 58.79 +/- 1.35 ms
instant stats (N = 101): 59.58 +/- 1.33 ms
transpose-hybrid-shuffle-WGS=(256,1) kernel already compiled...
num bms: 4096, num dispatch groups: 512
GPU results verified!
task name:Vk-HybridShuffle-TG=256
device: Intel(R) HD Graphics 520
num BMs: 4096, TG size: 256
CPU loops: 101, GPU loops: 1001
timestamp stats (N = 101): 59.96 +/- 1.55 ms
instant stats (N = 101): 60.80 +/- 1.53 ms
transpose-hybrid-shuffle-WGS=(512,1) kernel already compiled...
num bms: 4096, num dispatch groups: 256
GPU results verified!
task name:Vk-HybridShuffle-TG=512
device: Intel(R) HD Graphics 520
num BMs: 4096, TG size: 512
CPU loops: 101, GPU loops: 1001
timestamp stats (N = 101): 67.65 +/- 1.98 ms
instant stats (N = 101): 68.48 +/- 1.98 ms
transpose-hybrid-shuffle-WGS=(1024,1) kernel already compiled...
num bms: 4096, num dispatch groups: 128
GPU results verified!
task name:Vk-HybridShuffle-TG=1024
device: Intel(R) HD Graphics 520
num BMs: 4096, TG size: 1024
CPU loops: 101, GPU loops: 1001
timestamp stats (N = 101): 84.42 +/- 1.27 ms
instant stats (N = 101): 85.25 +/- 1.26 ms
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment