没有合适的资源?快使用搜索试试~ 我知道了~
首页CUDA Samples
资源详情
资源评论
资源推荐
CUDA SAMPLES
TRM-06704-001_v8.0 | September 2016
Reference Manual
www.nvidia.com
CUDA Samples TRM-06704-001_v8.0|ii
TABLE OF CONTENTS
Chapter1.Release Notes...................................................................................... 1
1.1. CUDA 8.0................................................................................................... 1
1.2. CUDA 7.5................................................................................................... 2
1.3. CUDA 7.0................................................................................................... 2
1.4. CUDA 6.5................................................................................................... 3
1.5. CUDA 6.0................................................................................................... 4
1.6. CUDA 5.5................................................................................................... 4
1.7. CUDA 5.0................................................................................................... 5
1.8. CUDA 4.2................................................................................................... 6
1.9. CUDA 4.1................................................................................................... 6
Chapter2.Getting Started.....................................................................................7
2.1.Getting CUDA Samples...................................................................................7
Windows....................................................................................................... 7
Linux........................................................................................................... 7
Mac OSX....................................................................................................... 7
2.2.Building Samples.......................................................................................... 8
Windows....................................................................................................... 8
Linux........................................................................................................... 8
Mac............................................................................................................. 9
2.3.CUDA Cross-Platform Samples.......................................................................... 9
TARGET_ARCH............................................................................................... 10
TARGET_OS.................................................................................................. 10
TARGET_FS...................................................................................................10
Copying Libraries........................................................................................ 10
2.4.Using CUDA Samples to Create Your Own CUDA Projects.........................................11
2.4.1.Creating CUDA Projects for Windows........................................................... 11
2.4.2.Creating CUDA Projects for Linux............................................................... 11
2.4.3.Creating CUDA Projects for Mac OS X.......................................................... 12
Chapter3. Samples Reference...............................................................................13
3.1.Simple Reference........................................................................................13
asyncAPI......................................................................................................13
cdpSimplePrint - Simple Print (CUDA Dynamic Parallelism)......................................... 14
cdpSimpleQuicksort - Simple Quicksort (CUDA Dynamic Parallelism)..............................14
clock - Clock................................................................................................ 15
clock_nvrtc - Clock libNVRTC............................................................................ 15
cppIntegration - C++ Integration........................................................................ 15
cppOverload................................................................................................. 16
cudaOpenMP.................................................................................................16
fp16ScalarProduct - FP16 Scalar Product...............................................................16
inlinePTX - Using Inline PTX..............................................................................17
www.nvidia.com
CUDA Samples TRM-06704-001_v8.0|iii
inlinePTX_nvrtc - Using Inline PTX with libNVRTC....................................................17
matrixMul - Matrix Multiplication (CUDA Runtime API Version).....................................17
matrixMul_nvrtc - Matrix Multiplication with libNVRTC..............................................18
matrixMulCUBLAS - Matrix Multiplication (CUBLAS).................................................. 18
matrixMulDrv - Matrix Multiplication (CUDA Driver API Version)................................... 19
simpleAssert................................................................................................. 19
simpleAssert_nvrtc - simpleAssert with libNVRTC.................................................... 20
simpleAtomicIntrinsics - Simple Atomic Intrinsics.................................................... 20
simpleAtomicIntrinsics_nvrtc - Simple Atomic Intrinsics with libNVRTC...........................20
simpleCallback - Simple CUDA Callbacks...............................................................21
simpleCubemapTexture - Simple Cubemap Texture.................................................. 21
simpleIPC.....................................................................................................21
simpleLayeredTexture - Simple Layered Texture..................................................... 22
simpleMPI.................................................................................................... 22
simpleMultiCopy - Simple Multi Copy and Compute..................................................23
simpleMultiGPU - Simple Multi-GPU.....................................................................23
simpleOccupancy........................................................................................... 23
simpleP2P - Simple Peer-to-Peer Transfers with Multi-GPU......................................... 24
simplePitchLinearTexture - Pitch Linear Texture......................................................24
simplePrintf..................................................................................................25
simpleSeparateCompilation - Simple Static GPU Device Library................................... 25
simpleStreams...............................................................................................25
simpleSurfaceWrite - Simple Surface Write............................................................26
simpleTemplates - Simple Templates................................................................... 26
simpleTemplates_nvrtc - Simple Templates with libNVRTC......................................... 26
simpleTexture - Simple Texture..........................................................................27
simpleTextureDrv - Simple Texture (Driver Version)..................................................27
simpleVoteIntrinsics - Simple Vote Intrinsics.......................................................... 27
simpleVoteIntrinsics_nvrtc - Simple Vote Intrinsics with libNVRTC.................................28
simpleZeroCopy............................................................................................. 28
systemWideAtomics - System wide Atomics........................................................... 28
template - Template.......................................................................................29
UnifiedMemoryStreams - Unified Memory Streams................................................... 29
vectorAdd - Vector Addition..............................................................................30
vectorAdd_nvrtc - Vector Addition with libNVRTC....................................................30
vectorAddDrv - Vector Addition Driver API............................................................ 30
3.2.UtilitiesReference...................................................................................... 31
bandwidthTest - Bandwidth Test.........................................................................31
deviceQuery - Device Query..............................................................................31
deviceQueryDrv - Device Query Driver API............................................................ 31
p2pBandwidthLatencyTest - Peer-to-Peer Bandwidth Latency Test with Multi-GPUs............ 32
topologyQuery - Topology Query.........................................................................32
3.3.GraphicsReference..................................................................................... 33
www.nvidia.com
CUDA Samples TRM-06704-001_v8.0|iv
bindlessTexture - Bindless Texture...................................................................... 33
Mandelbrot...................................................................................................33
marchingCubes - Marching Cubes Isosurfaces......................................................... 34
simpleD3D10 - Simple Direct3D10 (Vertex Array).....................................................34
simpleD3D10RenderTarget - Simple Direct3D10 Render Target..................................... 35
simpleD3D10Texture - Simple D3D10 Texture......................................................... 35
simpleD3D11Texture - Simple D3D11 Texture......................................................... 36
simpleD3D9 - Simple Direct3D9 (Vertex Arrays).......................................................36
simpleD3D9Texture - Simple D3D9 Texture............................................................ 37
simpleGL - Simple OpenGL............................................................................... 37
simpleGLES - Simple OpenGLES..........................................................................38
simpleGLES_EGLOutput - Simple OpenGLES EGLOutput............................................. 38
simpleGLES_screen - Simple OpenGLES on Screen................................................... 39
simpleTexture3D - Simple Texture 3D.................................................................. 39
SLID3D10Texture - SLI D3D10 Texture...................................................................40
volumeFiltering - Volumetric Filtering with 3D Textures and Surface Writes..................... 40
volumeRender - Volume Rendering with 3D Textures................................................ 41
3.4.ImagingReference...................................................................................... 42
bicubicTexture - Bicubic B-spline Interoplation....................................................... 42
bilateralFilter - Bilateral Filter.......................................................................... 42
boxFilter - Box Filter...................................................................................... 43
convolutionFFT2D - FFT-Based 2D Convolution....................................................... 43
convolutionSeparable - CUDA Separable Convolution................................................ 44
convolutionTexture - Texture-based Separable Convolution........................................ 44
cudaDecodeD3D9 - CUDA Video Decoder D3D9 API...................................................44
cudaDecodeGL - CUDA Video Decoder GL API.........................................................45
dct8x8 - DCT8x8............................................................................................46
dwtHaar1D - 1D Discrete Haar Wavelet Decomposition..............................................46
dxtc - DirectX Texture Compressor (DXTC)............................................................ 46
CUDA_EGLStreams_Interop - EGLStreams CUDA Interop.............................................47
histogram - CUDA Histogram............................................................................. 47
HSOpticalFlow - Optical Flow............................................................................ 47
imageDenoising - Image denoising...................................................................... 48
postProcessGL - Post-Process in OpenGL............................................................... 48
recursiveGaussian - Recursive Gaussian Filter.........................................................49
simpleCUDA2GL - CUDA and OpenGL Interop of Images.............................................49
SobelFilter - Sobel Filter..................................................................................50
stereoDisparity - Stereo Disparity Computation (SAD SIMD Intrinsics)............................. 50
3.5.FinanceReference...................................................................................... 51
binomialOptions - Binomial Option Pricing.............................................................51
binomialOptions_nvrtc - Binomial Option Pricing with libNVRTC...................................51
BlackScholes - Black-Scholes Option Pricing........................................................... 51
BlackScholes_nvrtc - Black-Scholes Option Pricing with libNVRTC................................. 52
www.nvidia.com
CUDA Samples TRM-06704-001_v8.0|v
MonteCarloMultiGPU - Monte Carlo Option Pricing with Multi-GPU support...................... 52
quasirandomGenerator - Niederreiter Quasirandom Sequence Generator........................ 53
quasirandomGenerator_nvrtc - Niederreiter Quasirandom Sequence Generator with
libNVRTC................................................................................................. 53
SobolQRNG - Sobol Quasirandom Number Generator................................................ 53
3.6.SimulationsReference.................................................................................. 54
fluidsD3D9 - Fluids (Direct3D Version)..................................................................54
fluidsGL - Fluids (OpenGL Version)......................................................................54
fluidsGLES - Fluids (OpenGLES Version)................................................................ 55
nbody - CUDA N-Body Simulation........................................................................55
nbody_opengles - CUDA N-Body Simulation with GLES.............................................. 56
nbody_screen - CUDA N-Body Simulation on Screen................................................. 56
oceanFFT - CUDA FFT Ocean Simulation...............................................................57
particles - Particles........................................................................................ 57
smokeParticles - Smoke Particles........................................................................58
VFlockingD3D10............................................................................................. 58
3.7.AdvancedReference.................................................................................... 59
alignedTypes - Aligned Types.............................................................................59
c++11_cuda - C++11 CUDA................................................................................59
cdpAdvancedQuicksort - Advanced Quicksort (CUDA Dynamic Parallelism).......................60
cdpBezierTessellation - Bezier Line Tessellation (CUDA Dynamic Parallelism)....................60
cdpLUDecomposition - LU Decomposition (CUDA Dynamic Parallelism)........................... 61
cdpQuadtree - Quad Tree (CUDA Dynamic Parallelism)..............................................61
concurrentKernels - Concurrent Kernels................................................................61
eigenvalues - Eigenvalues.................................................................................62
fastWalshTransform - Fast Walsh Transform........................................................... 62
FDTD3d - CUDA C 3D FDTD...............................................................................62
FunctionPointers - Function Pointers................................................................... 63
interval - Interval Computing............................................................................ 63
lineOfSight - Line of Sight................................................................................ 63
matrixMulDynlinkJIT - Matrix Multiplication (CUDA Driver API version with Dynamic Linking
Version)...................................................................................................64
mergeSort - Merge Sort................................................................................... 64
newdelete - NewDelete................................................................................... 64
ptxjit - PTX Just-in-Time compilation.................................................................. 65
radixSortThrust - CUDA Radix Sort (Thrust Library).................................................. 65
reduction - CUDA Parallel Reduction................................................................... 65
scalarProd - Scalar Product...............................................................................66
scan - CUDA Parallel Prefix Sum (Scan)................................................................ 66
segmentationTreeThrust - CUDA Segmentation Tree Thrust Library............................... 66
shfl_scan - CUDA Parallel Prefix Sum with Shuffle Intrinsics (SHFL_Scan)........................ 67
simpleHyperQ............................................................................................... 67
sortingNetworks - CUDA Sorting Networks............................................................. 67
剩余119页未读,继续阅读
StefanSalvatore
- 粉丝: 92
- 资源: 5
上传资源 快速赚钱
- 我的内容管理 收起
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
会员权益专享
最新资源
- zigbee-cluster-library-specification
- JSBSim Reference Manual
- c++校园超市商品信息管理系统课程设计说明书(含源代码) (2).pdf
- 建筑供配电系统相关课件.pptx
- 企业管理规章制度及管理模式.doc
- vb打开摄像头.doc
- 云计算-可信计算中认证协议改进方案.pdf
- [详细完整版]单片机编程4.ppt
- c语言常用算法.pdf
- c++经典程序代码大全.pdf
- 单片机数字时钟资料.doc
- 11项目管理前沿1.0.pptx
- 基于ssm的“魅力”繁峙宣传网站的设计与实现论文.doc
- 智慧交通综合解决方案.pptx
- 建筑防潮设计-PowerPointPresentati.pptx
- SPC统计过程控制程序.pptx
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论0