Created
December 1, 2020 00:50
-
-
Save shkhln/5996d9d4f60ef506a13730f607531b90 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
% nv-sglrun clinfo | |
shim init | |
Number of platforms 1 | |
Platform Name NVIDIA CUDA | |
Platform Vendor NVIDIA Corporation | |
Platform Version OpenCL 1.2 CUDA 11.1.110 | |
Platform Profile FULL_PROFILE | |
Platform Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid | |
Platform Extensions function suffix NV | |
Platform Name NVIDIA CUDA | |
Number of devices 1 | |
Device Name GeForce GTX 1660 | |
Device Vendor NVIDIA Corporation | |
Device Vendor ID 0x10de | |
Device Version OpenCL 1.2 CUDA | |
Driver Version 455.38 | |
Device OpenCL C Version OpenCL C 1.2 | |
Device Type GPU | |
Device Topology (NV) PCI-E, 01:00.0 | |
Device Profile FULL_PROFILE | |
Device Available Yes | |
Compiler Available Yes | |
Linker Available Yes | |
Max compute units 22 | |
Max clock frequency 1830MHz | |
Compute Capability (NV) 7.5 | |
Device Partition (core) | |
Max number of sub-devices 1 | |
Supported partition types None | |
Supported affinity domains (n/a) | |
Max work item dimensions 3 | |
Max work item sizes 1024x1024x64 | |
Max work group size 1024 | |
Preferred work group size multiple 32 | |
Warp size (NV) 32 | |
Preferred / native vector sizes | |
char 1 / 1 | |
short 1 / 1 | |
int 1 / 1 | |
long 1 / 1 | |
half 0 / 0 (n/a) | |
float 1 / 1 | |
double 1 / 1 (cl_khr_fp64) | |
Half-precision Floating-point support (n/a) | |
Single-precision Floating-point support (core) | |
Denormals Yes | |
Infinity and NANs Yes | |
Round to nearest Yes | |
Round to zero Yes | |
Round to infinity Yes | |
IEEE754-2008 fused multiply-add Yes | |
Support is emulated in software No | |
Correctly-rounded divide and sqrt operations Yes | |
Double-precision Floating-point support (cl_khr_fp64) | |
Denormals Yes | |
Infinity and NANs Yes | |
Round to nearest Yes | |
Round to zero Yes | |
Round to infinity Yes | |
IEEE754-2008 fused multiply-add Yes | |
Support is emulated in software No | |
Address bits 64, Little-Endian | |
Global memory size 6232997888 (5.805GiB) | |
Error Correction support No | |
Max memory allocation 1558249472 (1.451GiB) | |
Unified memory for Host and Device No | |
Integrated memory (NV) No | |
Minimum alignment for any data type 128 bytes | |
Alignment of base address 4096 bits (512 bytes) | |
Global Memory cache type Read/Write | |
Global Memory cache size 720896 (704KiB) | |
Global Memory cache line size 128 bytes | |
Image support Yes | |
Max number of samplers per kernel 32 | |
Max size for 1D images from buffer 268435456 pixels | |
Max 1D or 2D image array size 2048 images | |
Max 2D image size 32768x32768 pixels | |
Max 3D image size 16384x16384x16384 pixels | |
Max number of read image args 256 | |
Max number of write image args 32 | |
Local memory type Local | |
Local memory size 49152 (48KiB) | |
Registers per block (NV) 65536 | |
Max number of constant args 9 | |
Max constant buffer size 65536 (64KiB) | |
Max size of kernel argument 4352 (4.25KiB) | |
Queue properties | |
Out-of-order execution Yes | |
Profiling Yes | |
Prefer user sync for interop No | |
Profiling timer resolution 1000ns | |
Execution capabilities | |
Run OpenCL kernels Yes | |
Run native kernels No | |
Kernel execution timeout (NV) Yes | |
Concurrent copy and kernel execution (NV) Yes | |
Number of async copy engines 3 | |
printf() buffer size 1048576 (1024KiB) | |
Built-in kernels (n/a) | |
Device Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid | |
NULL platform behavior | |
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) NVIDIA CUDA | |
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) Success [NV] | |
clCreateContext(NULL, ...) [default] <checkNullCtx:2758: create context with device from default platform : error -9999> | |
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) No platform | |
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform | |
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No platform | |
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform | |
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) Invalid device type for platform | |
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No platform | |
ICD loader properties | |
ICD loader Name OpenCL ICD Loader | |
ICD loader Vendor OCL Icd free software | |
ICD loader Version 2.2.12 | |
ICD loader Profile OpenCL 2.2 | |
NOTE: your OpenCL library declares to support OpenCL 2.2, | |
but it seems to support up to OpenCL 1.0 only. | |
NOTE: your OpenCL library only supports OpenCL 1.0, | |
but some installed platforms support OpenCL 1.2. | |
Programs using 1.2 features may crash | |
or behave unexpectedly |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment