NVIDIA CUDA 及其函数库 (NVIDIA CUDA and Library) 简述
1、CUDA 常用 debug 工具:(Frequently-used CUDA Debug Tools)
--IDEs: nsight (Linux, Mac), Nsight VSE (Windows)
--Debuggers: cuda-memcheck, cuda-gdb (Linux), Nsight VSE (Windows)
--Profilers: Nsight Systems, Nsight Compute, nvprof, nvvp, Nsight VSE (Windows)
--Utilities: cuobjdump, nvdisasm, gpu-library-advisor
2、CUDA 包括的库:(CUDA Library)
--cublas (BLAS)
--cublas_device (BLAS Kernel Interface)
--cuda_occupancy (Kernel Occupancy Calculation [header file implementation]) cudadevrt (CUDA Device Runtime)
--cudart (CUDA Runtime)
--cufft (Fast Fourier Transform [FFT])
--cupti (CUDA Profiling Tools Interface)
--curand (Random Number Generation)
--cusolver (Dense and Sparse Direct Linear Solvers and Eigen Solvers) cusparse (Sparse Matrix)
--nvJPEG (JPEG encoding/decoding)
--npp (NVIDIA Performance Primitives [image and signal processing]) nvblas ("Drop-in" BLAS)
--nvcuvid (CUDA Video Decoder [Windows, Linux])
--nvgraph (CUDA nvGRAPH [accelerated graph analytics])
--nvml (NVIDIA Management Library)
--nvrtc (CUDA Runtime Compilation)
--nvtx (NVIDIA Tools Extension)
--thrust (Parallel Algorithm Library [header file implementation])
3、CUDA 版本和驱动版本对照表:(Parallel Table of CUDA v.s. Drivers)
--Table 1. CUDA Toolkit and Compatible Driver Versions
--CUDA ToolkitLinux x86_64 Driver Version Windows x86_64 Driver Version
--CUDA 10.1.105 >= 418.39 >= 418.96
--CUDA 10.0.130 >= 410.48 >= 411.31
--CUDA 9.2 (9.2.148 Update 1) >= 396.37 >= 398.26
--CUDA 9.2 (9.2.88) >= 396.26 >= 397.44
--CUDA 9.1 (9.1.85) >= 390.46 >= 391.29
--CUDA 9.0 (9.0.76) >= 384.81 >= 385.54
--CUDA 8.0 (8.0.61 GA2) >= 375.26 >= 376.51
--CUDA 8.0 (8.0.44) >= 367.48 >= 369.30
--CUDA 7.5 (7.5.16) >= 352.31 >= 353.66
--CUDA 7.0 (7.0.28) >= 346.46 >= 347.62
问题来源