Skip to content

Instantly share code, notes, and snippets.

@edhemphill
Last active January 13, 2018 18:18
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save edhemphill/1ac8e2fc4662493c60939befc3a256cd to your computer and use it in GitHub Desktop.
Save edhemphill/1ac8e2fc4662493c60939befc3a256cd to your computer and use it in GitHub Desktop.
Number of platforms 1
Platform Name AMD Accelerated Parallel Processing
Platform Vendor Advanced Micro Devices, Inc.
Platform Version OpenCL 2.0 AMD-APP (2482.3)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
Platform Extensions function suffix AMD
Platform Name AMD Accelerated Parallel Processing
Number of devices 1
Device Name Hawaii
Device Vendor Advanced Micro Devices, Inc.
Device Vendor ID 0x1002
Device Version OpenCL 1.2 AMD-APP (2482.3)
Driver Version 2482.3
Device OpenCL C Version OpenCL C 1.2
Device Type GPU
Device Profile FULL_PROFILE
Device Board Name (AMD) AMD Radeon (TM) R9 390 Series
Device Topology (AMD) PCI-E, 01:00.0
Max compute units 40
SIMD per compute unit (AMD) 4
SIMD width (AMD) 16
SIMD instruction width (AMD) 1
Max clock frequency 1040MHz
Graphics IP (AMD) 7.2
Device Partition (core)
Max number of sub-devices 40
Supported partition types none specified
Max work item dimensions 3
Max work item sizes 256x256x256
Max work group size 256
Preferred work group size multiple 64
Wavefront width (AMD) 64
Preferred / native vector sizes
char 4 / 4
short 2 / 2
int 1 / 1
long 1 / 1
half 1 / 1 (n/a)
float 1 / 1
double 1 / 1 (cl_khr_fp64)
Half-precision Floating-point support (n/a)
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Correctly-rounded divide and sqrt operations Yes
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Correctly-rounded divide and sqrt operations No
Address bits 64, Little-Endian
Global memory size 8577007616 (7.988GiB)
Global free memory (AMD) 8356400 (7.969GiB)
Global memory channels (AMD) 16
Global memory banks per channel (AMD) 16
Global memory bank width (AMD) 256 bytes
Error Correction support No
Max memory allocation 4244635648 (3.953GiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 2048 bits (256 bytes)
Global Memory cache type Read/Write
Global Memory cache size 16384
Global Memory cache line 64 bytes
Image support Yes
Max number of samplers per kernel 16
Max size for 1D images from buffer 134217728 pixels
Max 1D or 2D image array size 2048 images
ed@miner1:~/work$ GPU_MAX_HEAP_SIZE=100 GPU_SINGLE_ALLOC_PERCENT=100 GPU_FORCE_64BIT_PTR=1 GPU_USE_SYNC_OBJECTS=1 GPU_MAX_ALLOC_PERCENT=100 ethminer/build/ethminer/ethminer -M -G -v 9 ℹ 12:17:35|ethminer Found suitable OpenCL device [ Hawaii ] with 8577007616 bytes of GPU memory
Benchmarking on platform: CL
Preparing DAG for block #0
cl 12:17:35|cl-0 No work. Pause for 3 s.
Warming up...
cl 12:17:38|cl-0 New work: header #50c856ae… target 0000000000000002000000000000000000000000000000000000000000000000
cl 12:17:38|cl-0 New seed #00000000…
cl 12:17:39|cl-0 Platform: AMD Accelerated Parallel Processing
cl 12:17:39|cl-0 Device: Hawaii / OpenCL 1.2 AMD-APP (2482.3)
cl 12:17:39|cl-0 OpenCL kernel: Stable kernel
cl 12:17:40|cl-0 Build info:
cl 12:17:40|cl-0 Creating light cache buffer, size 16776896
cl 12:17:40|cl-0 Creating DAG buffer, size 1073739904
cl 12:17:40|cl-0 Loading kernels
cl 12:17:40|cl-0 Writing light cache buffer
cl 12:17:40|cl-0 Creating buffer for header.
cl 12:17:40|cl-0 Creating mining buffer
cl 12:17:40|cl-0 Generating DAG
cl 12:17:40|cl-0 DAG 0 %
cl 12:17:40|cl-0 DAG 6 %
cl 12:17:41|cl-0 DAG 12 %
cl 12:17:41|cl-0 DAG 18 %
cl 12:17:41|cl-0 DAG 25 %
cl 12:17:41|cl-0 DAG 31 %
cl 12:17:42|cl-0 DAG 37 %
cl 12:17:42|cl-0 DAG 43 %
cl 12:17:42|cl-0 DAG 50 %
cl 12:17:42|cl-0 DAG 56 %
cl 12:17:43|cl-0 DAG 62 %
cl 12:17:43|cl-0 DAG 68 %
cl 12:17:43|cl-0 DAG 75 %
cl 12:17:43|cl-0 DAG 81 %
cl 12:17:44|cl-0 DAG 87 %
cl 12:17:44|cl-0 DAG 93 %
cl 12:17:44|cl-0 Switch time 8802 ms / 5802023 us
Trial 1... 26025505
Trial 2... 28124336
Trial 3... 28124336
Trial 4... 28121521
Trial 5... 28229277
✘ 12:18:05|cl-0 OpenCL Error: clSetKernelArg: CL_INVALID_ARG_INDEX (-49)
min/mean/max: 26025505/27724995/28229277 H/s
inner mean: 28158378 H/s
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment