Skip to content

Instantly share code, notes, and snippets.

@EoD
Last active January 8, 2016 09:40
Show Gist options
  • Save EoD/aa4759f9dea80ae5ae01 to your computer and use it in GitHub Desktop.
Save EoD/aa4759f9dea80ae5ae01 to your computer and use it in GitHub Desktop.
$ ./bin/opencl-stream
---OpenCL: Platform Vendor 0: Mesa
---OpenCL: Device found 0. AMD TONGA (DRM 3.1.0, LLVM 3.8.0)
---OpenCL: CL_DEVICE_MAX_MEM_ALLOC_SIZE: 256 MB
---OpenCL: CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE: 0 B
---OpenCL: CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT: 4 B
---OpenCL: CL_DEVICE_NATIVE_VECTOR_WIDTH_FLOAT: 4 B
---OpenCL: CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE: 2 B
---OpenCL: CL_DEVICE_NATIVE_VECTOR_WIDTH_DOUBLE: 2 B
---OpenCL: CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT: 4 B
---OpenCL: CL_DEVICE_NATIVE_VECTOR_WIDTH_INT: 4 B
---OpenCL: CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG: 2 B
---OpenCL: CL_DEVICE_NATIVE_VECTOR_WIDTH_LONG: 2 B
Auto-selecting platform 0.
Auto-selecting device 0.
---------------------------------------------------------------------------------------------------
Function Best Rate GB/s Avg time Min time Max time Best Workgroup Size Best GFLOPS
---------------------------------------------------------------------------------------------------
copyKernel1 136.778 0.088240 0.182778 1.723276 64 0.000
copyKernel2 136.027 0.057542 0.183787 0.904594 32 0.000
copyKernel4 112.169 0.063991 0.222879 0.820489 8 0.000
copyKernel8 119.263 0.063769 0.209621 0.813073 8 0.000
copyKernel16 106.846 0.075604 0.233981 0.813352 4 0.000
---------------------------------------------------------------------------------------------------
scaleKernel1 136.355 0.097945 0.183346 1.950134 64 9.151
scaleKernel2 135.931 0.072307 0.183916 1.302218 32 9.122
scaleKernel4 135.681 0.063167 0.184256 1.089181 16 9.105
scaleKernel8 133.663 0.067694 0.187038 1.004416 8 8.970
scaleKernel16 90.921 0.088369 0.274963 1.007259 4 6.102
---------------------------------------------------------------------------------------------------
addKernel1 75.402 0.141642 0.331556 2.488551 256 5.060
addKernel2 86.989 0.097308 0.287392 1.308611 256 5.838
addKernel4 91.205 0.082391 0.274108 1.217896 32 6.121
addKernel8 84.097 0.104618 0.297276 1.217038 16 5.644
addKernel16 73.294 0.109907 0.341091 1.219109 8 4.919
---------------------------------------------------------------------------------------------------
triadKernel1 113.341 0.143745 0.330859 2.506930 256 10.142
triadKernel2 132.021 0.111400 0.284046 1.536999 256 11.813
triadKernel4 136.679 0.085245 0.274366 1.235817 32 12.230
triadKernel8 123.406 0.107329 0.303874 1.263041 8 11.042
triadKernel16 111.287 0.111690 0.336968 1.258950 8 9.958
---------------------------------------------------------------------------------------------------
$ ./bin/opencl-stream
---OpenCL: Platform Vendor 0: Mesa
---OpenCL: Device found 0. AMD TONGA (DRM 3.1.0, LLVM 3.8.0)
Auto-selecting platform 0.
Auto-selecting device 0.
OpenCL Error -61 (CL_INVALID_BUFFER_SIZE), line 98
--------------------------------------------------------------------------------------------------
Function Best Rate GB/s Avg time Min time Max time Best Workgroup Size Best GLOPS
--------------------------------------------------------------------------------------------------
copyKernel1 51.458 0.327213 0.971664 5.200968 16 0.000
copyKernel2 51.305 0.320870 0.974558 2.957927 8 0.000
copyKernel4 50.460 0.397019 0.990882 4.089595 8 0.000
copyKernel8 47.185 0.500957 1.059664 6.938917 8 0.000
copyKernel16 37.611 1.097749 1.329412 14.879067 2 0.000
--------------------------------------------------------------------------------------------------
scaleKernel1 51.050 0.349621 0.979433 5.504304 16 3.426
scaleKernel2 47.575 0.342027 1.050975 3.134202 8 3.193
scaleKernel4 45.312 0.349201 1.103455 2.965188 16 3.041
scaleKernel8 42.159 0.528744 1.185989 6.928565 4 2.829
scaleKernel16 44.376 1.306128 1.126743 16.487554 4 2.978
--------------------------------------------------------------------------------------------------
addKernel1 45.929 0.360080 1.088644 6.068718 32 3.082
addKernel2 45.954 0.326798 1.088034 3.135593 16 3.084
addKernel4 46.913 0.315717 1.065802 2.501885 16 3.148
addKernel8 47.049 0.416834 1.062731 4.461564 8 3.157
addKernel16 46.972 0.862984 1.064466 13.083562 4 3.152
--------------------------------------------------------------------------------------------------
triadKernel1 69.291 0.369788 1.082395 6.350230 32 6.200
triadKernel2 69.071 0.336216 1.085844 3.356133 16 6.180
triadKernel4 70.164 0.318887 1.068925 2.627155 16 6.278
triadKernel8 69.727 0.418932 1.075629 4.313267 8 6.239
triadKernel16 58.054 0.923546 1.291897 14.418079 4 5.195
--------------------------------------------------------------------------------------------------
Error in result!
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment