DLVM：深度学习系统的现代编译器基础设施

需积分: 14 113 浏览量更新于2024-09-08 收藏 277KB PDF 举报

DLVM（Deep Learning Virtual Machine）是一项创新的深度学习编译器基础设施，由Richard Wei、Lane Schwartz和Vikram Adve在2018年的ICLR（International Conference on Learning Representations）工作坊上提出。随着深度学习在各种应用中的关键角色日益凸显，对软件的可靠性和性能要求越来越高。现有的许多深度学习框架往往采用Python作为基础语言，作为不安全的领域特定语言（DSL）并依赖于计算图解释器，这可能导致性能瓶颈和潜在的安全风险。 DLVM的设计旨在解决这些问题，它构建了一个现代化的编译器架构，采用线性代数作为中间表示，这是一种高效处理深度学习数学运算的方式。它利用算法导数技术，通过自反向传播生成adjoint code，即反向模式求导，以实现自动梯度计算，这对于训练深度神经网络至关重要。此外，DLVM还包含了针对特定领域的优化策略，这些优化针对深度学习任务进行了专门设计，可以显著提升执行效率。与传统的深度学习编译器框架相比，DLVM具有更高的模块化和通用性。它支持高度表达性的张量DSL（Domain Specific Language），这使得开发者能够更直接地编写和操作多维度数据结构，无需担心底层实现细节，从而提高了代码的可读性和维护性。DLVM的代码生成器利用LLVM（Low-Level Virtual Machine）技术，将高级语言编写的深度学习代码转换为目标GPU（图形处理器）的高效机器码，进一步加速了计算密集型任务的执行。 DLVM的核心理念是提供一种模块化、安全且性能卓越的深度学习框架环境，允许开发者在Swift这样的现代编程语言中嵌入其编译器基础设施，从而促进深度学习技术的广泛应用和创新。通过DLVM，研究人员和开发者能够更专注于构建模型和算法，而不是底层的优化和性能调整，从而推动整个深度学习生态系统的进步。

Workshop track - ICLR 2018

DLVM: A MODERN COMPILER INFRASTRUCTURE FOR

DEEP LEARNING SYSTEMS

Richard Wei

Departments of Computer Science & Linguistics

University of Illinois at Urbana-Champaign

Urbana, IL 61801

xwei12@illinois.edu

Lane Schwartz

Department of Linguistics

University of Illinois at Urbana-Champaign

Urbana, IL 61801

lanes@illinois.edu

Vikram Adve

Department of Computer Science

University of Illinois at Urbana-Champaign

Urbana, IL 61801

vadve@illinois.edu

ABSTRACT

Deep learning software demands reliability and performance. However, many of

the existing deep learning frameworks are software libraries that act as an unsafe

DSL in Python and a computation graph interpreter. We present DLVM, a design

and implementation of a compiler infrastructure with a linear algebra intermediate

representation, algorithmic differentiation by adjoint code generation, domain-

speciﬁc optimizations and a code generator targeting GPU via LLVM. Designed

as a modern compiler infrastructure inspired by LLVM, DLVM is more modular

and more generic than existing deep learning compiler frameworks, and supports

tensor DSLs with high expressivity. With our prototypical staged DSL embedded

in Swift, we argue that the DLVM system enables a form of modular, safe and

performant frameworks for deep learning.

1INTRODUCTION

Within the deep learning community, most current approaches to neural networks make use of

high-level frameworks with a tensor domain-speciﬁc language (DSL) such as Torch (Collobert et al.,

2011), TensorFlow (Abadi et al., 2016), PyTorch (PyTorch Development Team, 2016), and MXNet

(Chen et al., 2015). Traditionally, developers would build a computation graph (or dynamically

generate graph nodes) using a DSL and let the framework interpret the computation graph on parallel

architectures such as NVIDIA GPUs. While using hand-tuned GPU subroutines usually yields the

best performance for complex operators, advanced compiler techniques can be applied to simplify

computation, merge high-level operators based on shaping conditions, and fuse compatible element-

wise operators to a single kernel to minimize the latency between kernel launches. Recent projects, the

TensorFlow XLA compiler (Leary & Wang, 2017) and the NNVM compiler (NNVM, 2017) including

TVM (Chen et al., 2017), have begun to apply compiler techniques to deep learning systems, targeting

LLVM (Lattner & Adve, 2004) and various back-ends to achieve good performance. However, their

design and implementation have not entirely followed established best practices in widely-used

compiler frameworks in the industry.

Moreover, some frameworks use operator-overloading algorithmic differentiation (AD) to compute

gradients, leaving the gradient computation unoptimizable. The other approach to AD, source code

transformation, can produce more efﬁcient code. While frameworks such as TensorFlow already

perform AD as a graph transformation and apply various optimizations, their AD transformation is

not designed as a transformation pass in the pipeline of their compiler framework, but as part of the

DSL library. Making AD part of the compiler framework would greatly simplify the development of

DSLs, achieving separation of concerns.

arXiv:1711.03016v5 [cs.PL] 2 Feb 2018

下载后可阅读完整内容，剩余9页未读，立即下载

lethecsdn

粉丝: 0

DLVM：深度学习系统的现代编译器基础设施

The Deep Learning Compiler A Comprehensive Survey.pdf

LLVM IR MLIR: Scaling Compiler Infrastructure for Domain Specifi

Error:java: Compilation failed: internal java compiler error

maven Unresolved plugin: 'c:maven-compiler-plugin:3.8.0'

iea java: compilation failed: internal java compiler error

nresolved plugin: 'org.apache.maven.plugins:maven-compiler-plugin:3.1'

plugins org.apache.maven.plugins:maven-compiler-plugin:3.8.1 or one for its

如何添加 Unresolved plugin: 'org.apache.maven.plugins:maven-compiler-plugin:3.1' 到pom.xml

Cannot resolve plugin org.apache.maven.plugins:maven-compiler-plugin:3.9.0

无法解析插件 org.apache.maven.plugins:maven-compiler-plugin

最新资源