debsankha/mpids_cuda_mar03_2016.md

## mpids_cuda_mar03_2016.md

      
    Raw
  

              mpids_cuda_mar03_2016.md
            
          
    Take home messages


Using CUDA makes sense for massively parallelizable code, like matrix multiplication.
But copying data from host memory (RAM) to GPU is slow.
MATLAB has many CUDA aware functions. For testin, you can use MATLAB on sunna (node 01 - 08 only, please).
Generating random numbers and copying it back to host RAM was seen to be about 3 times faster while using CUDA.
But "raw" CUDA C/C++ needs a lot of boilerplate code to be written. Hecke tells me that this is changing/has changed a lot in newer versions of CUDA.

Using CUDA on sunna

Tell bash about CUDA

export PATH=/usr/local/cuda/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH
Have to use older GCC

export PATH=/usr/nld/gcc-4.6.3/bin:$PATH
export LD_LIBRARY_PATH=/usr/nld/gcc-4.6.3/lib64:$LD_LIBRARY_PATH
export LD_LIBRARY_PATH=/usr/nld/gcc-4.6.3/lib:$LD_LIBRARY_PATH
export LD_LIBRARY_PATH=/usr/nld/mpc-0.9/lib:$LD_LIBRARY_PATH
export LD_LIBRARY_PATH=/usr/nld/mpfr-3.1.1/lib:$LD_LIBRARY_PATH
export LD_LIBRARY_PATH=/usr/nld/gmp-5.0.5/lib:$LD_LIBRARY_PATH
Test that you have the compiler

dmanik@sunna02:~> nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2012 NVIDIA Corporation
Built on Fri_Sep_21_17:28:58_PDT_2012
Cuda compilation tools, release 5.0, V0.2.1221
References


A Nice presentation from NVIDIA (maybe old:  (http://www.nvidia.com/docs/IO/116711/sc11-cuda-c-basics.pdf)
Getting started guide from NVIDIA (https://developer.nvidia.com/how-to-cuda-c-cpp)
Follow the links at #2.
PyCUDA (https://documen.tician.de/pycuda/)