TensorRT开源库：深度学习推理加速的C ++解决方案

需积分: 5 146 浏览量更新于2024-12-15 1 收藏 9.13MB ZIP 举报

资源摘要信息:"TensorRT是一个由NVIDIA推出的C++库，旨在提升深度学习模型在NVIDIA GPU上的推理性能。作为一个高性能推理平台，TensorRT专注于优化和执行深度学习模型的推理，使其在边缘设备、数据中心以及汽车领域等场景中发挥出更高的效率和更低的延迟。该库通过其专有的编译器、优化器和运行时引擎，能够将训练好的神经网络模型转换为优化的TensorRT引擎，从而加速模型推理。描述中提及的TensorRT开源软件存储库，是一个包含了TensorRT开源组件的代码仓库。这些组件包括TensorRT插件和解析器，它们能够支持Caffe和ONNX这样的深度学习框架。在该存储库中，开发者还可以找到一系列的示例应用程序，这些示例展示了如何使用TensorRT进行模型优化，并在实际部署中实现高效的推理任务。通过这种方式，开发者可以更直观地了解TensorRT的功能，并将其应用到具体项目中。由于TensorRT是为NVIDIA GPU优化的，因此在构建TensorRT OSS组件时，需要确保系统满足特定的软件依赖要求。这些要求包括CUDA和cuDNN的特定版本，以及GNU Make。CUDA（Compute Unified Device Architecture）是NVIDIA提供的一套并行计算平台和API模型，而cuDNN（CUDA Deep Neural Network library）是专门为深度神经网络设计的加速库。GNU Make是一个构建工具，用于控制编译器和程序链接器，生成可执行文件和库文件。在标签中提到的C/C++和Machine Learning进一步明确表明了TensorRT主要面向的是需要高性能计算支持的机器学习领域开发者。C/C++作为一种性能强大的编程语言，能够提供更贴近硬件的操作，非常适合进行深度学习模型的底层开发和优化。Machine Learning标签则指向了TensorRT的主要应用场景——机器学习推理。文件名称列表中的'TensorRT-master'指出了代码仓库的主干目录名，这意味着开发者可以通过访问这一目录来获取TensorRT开源项目的最新代码和资源，进行学习、研究和开发工作。整体而言，TensorRT的推出为使用NVIDIA GPU的开发者提供了一个强大的工具，使得他们能够在AI推理应用中实现高性能和快速部署。"

收起资源包目录

TensorRT开源库：深度学习推理加速的C ++解决方案（1126个子文件）

sampleOptions.cpp 55KB

sampleMLP.cpp 19KB

fused_multihead_attention_v2_int8_256_64_kernel.sm80.cpp 950KB

.coveragerc 210B

fused_multihead_attention_fp16_96_64_kernel.sm75.cpp 175KB

sampleUffFasterRCNN.cpp 26KB

batchedNMSPlugin.cpp 23KB

Makefile.config 8KB

fused_multihead_attention_v2_int8_384_64_kernel.sm86.cpp 1.22MB

fused_multihead_attention_fp16_128_64_kernel.sm75.cpp 310KB

fused_multihead_attention_v2_fp16_64_64_kernel.sm80.cpp 120KB

sampleNMT.cpp 60KB

fused_multihead_attention_v2_int8_192_64_kernel.sm75.cpp 1.44MB

fused_multihead_attention_v2_fp16_128_64_kernel.sm80.cpp 442KB

fused_multihead_attention_v2_int8_192_64_kernel.sm86.cpp 1.06MB

fused_multihead_attention_v2_int8_128_64_kernel.sm86.cpp 1.05MB

sampleUffMaskRCNN.cpp 23KB

fused_multihead_attention_v2_int8_128_64_kernel.sm75.cpp 1.13MB

fused_multihead_attention_v2_int8_384_64_kernel.sm75.cpp 1.33MB

fused_multihead_attention_v2_fp16_64_64_kernel.sm86.cpp 124KB

sampleINT8API.cpp 31KB

.clang-format 2KB

sampleInference.cpp 17KB

fused_multihead_attention_v2_fp16_256_64_kernel.sm80.cpp 394KB

samplePlugin.cpp 15KB

sampleMNISTAPI.cpp 17KB

fused_multihead_attention_fp16_384_64_kernel.sm86.cpp 190KB

fused_multihead_attention_v2_fp16_256_64_kernel.sm75.cpp 421KB

sampleOnnxMnistCoordConvAC.cpp 12KB

fused_multihead_attention_v2_fp16_384_64_kernel.sm86.cpp 440KB

embLayerNormVarSeqlenPlugin.cpp 18KB

pyFoundationalTypes.cpp 17KB

sampleSSD.cpp 15KB

fused_multihead_attention_int8_128_64_kernel.sm75.cpp 412KB

sampleMovieLens.cpp 23KB

pyPlugin.cpp 17KB

fused_multihead_attention_v2_int8_256_64_kernel.sm75.cpp 1016KB

multilevelProposeROIPlugin.cpp 16KB

set_ifndef.cmake 748B

fused_multihead_attention_fp16_128_64_kernel.sm80.cpp 262KB

pyCore.cpp 39KB

sampleMNIST.cpp 14KB

pyGraph.cpp 60KB

sampleFasterRCNN.cpp 20KB

nvFasterRCNNPlugin.cpp 17KB

fused_multihead_attention_v2_fp16_384_64_kernel.sm80.cpp 436KB

fused_multihead_attention_v2_int8_128_64_kernel.sm80.cpp 1.05MB

embLayerNormPlugin.cpp 20KB

sampleINT8.cpp 19KB

fused_multihead_attention_v2_fp16_96_64_kernel.sm75.cpp 207KB

find_library_create_target.cmake 1KB

sampleEngines.cpp 24KB

cropAndResizePlugin.cpp 16KB

regionPlugin.cpp 14KB

fused_multihead_attention_v2_int8_256_64_kernel.sm86.cpp 947KB

fused_multihead_attention_fp16_64_64_kernel.sm80.cpp 102KB

fused_multihead_attention_v2_int8_256_64_kernel.sm72.cpp 1.42MB

fused_multihead_attention_v2_fp16_96_64_kernel.sm80.cpp 202KB

sampleReformatFreeIO.cpp 22KB

fcPlugin.cpp 27KB

fused_multihead_attention_v2_fp16_128_64_kernel.sm75.cpp 478KB

proposalPlugin.cpp 26KB

fused_multihead_attention_v2_fp16_256_64_kernel.sm86.cpp 394KB

fused_multihead_attention_fp16_384_64_kernel.sm75.cpp 208KB

protobuf.cmake 10KB

sampleCharRNN.cpp 42KB

sampleUffPluginV2Ext.cpp 25KB

fused_multihead_attention_v2_int8_128_64_kernel.sm72.cpp 1.43MB

getopt.c 17KB

zlib.cmake 914B

qkvToContextPlugin.cpp 34KB

sampleAlgorithmSelector.cpp 26KB

fused_multihead_attention_v2_int8_192_64_kernel.sm80.cpp 1.07MB

fused_multihead_attention_fp16_96_64_kernel.sm80.cpp 170KB

setup.cfg 28B

fused_multihead_attention_int8_128_64_kernel.sm80.cpp 375KB

caffeParser.cpp 28KB

sampleReporting.cpp 13KB

sampleUffSSD.cpp 16KB

fused_multihead_attention_v2_fp16_64_64_kernel.sm75.cpp 117KB

fused_multihead_attention_v2_fp16_384_64_kernel.sm75.cpp 341KB

setup.cfg 28B

fused_multihead_attention_fp16_384_64_kernel.sm80.cpp 190KB

skipLayerNormPlugin.cpp 29KB

priorBoxPlugin.cpp 16KB

fused_multihead_attention_v2_fp16_128_64_kernel.sm86.cpp 444KB

fused_multihead_attention_fp16_64_64_kernel.sm75.cpp 104KB

setup.cfg 83B

sampleMovieLensMPS.cpp 28KB

fused_multihead_attention_int8_384_64_kernel.sm80.cpp 312KB

setup.cfg 39B

sampleDynamicReshape.cpp 19KB

fused_multihead_attention_v2_fp16_96_64_kernel.sm86.cpp 204KB

fused_multihead_attention_v2_int8_192_64_kernel.sm72.cpp 1.14MB

nmsPlugin.cpp 28KB

sampleUffMNIST.cpp 12KB

gridAnchorPlugin.cpp 17KB

fused_multihead_attention_v2_int8_384_64_kernel.sm80.cpp 1.22MB

fused_multihead_attention_int8_384_64_kernel.sm75.cpp 308KB

fused_multihead_attention_v2_int8_384_64_kernel.sm72.cpp 1.4MB

共 1126 条

安幕

粉丝: 33
资源: 4785

TensorRT开源库：深度学习推理加速的C ++解决方案

基于TensorRT集成的C++库开发

YOLOv5深度学习模型在C++中的GPU加速推理实现

CUDA+TensorRT+C++项目：模型预处理与加速推理实战

cpp-TensorRT是一个C库可以促进对NVIDIAGPU和深度学习加速器的高性能推断

waifu2x转换器ncnn版本，可在具有vulkan的intel / amd / nvidia GPU上快速运行-C/C++开发

深度学习+TensorRT-8.2.1.8+模型加速部署

使用TensorRT C++部署YOLOv10实现GPU加速-C++源码与模型

深度学习，tensorflow-gpu2.1.1版本，TensorFlow-gpu版本c++动态库。

window10+c+++yolov5-6.1+tensorrt6.0.1.5+pytorch1.7+opencv4.7

YOLOv5+TensorRT/OnnxRuntime+Visual Studio+CmakeLists实现推理

最新资源