RISC-V GPU架构扩展支持CUDA

需积分: 9 182 浏览量更新于2024-08-05 收藏 1.16MB PDF 举报

"CUDA支持扩展至RISC-V GPU架构的研究论文" 随着科学计算的迅速发展，越来越多的研究者和开发者致力于在各种设备上实现不同的工作负载和操作。在这些设备中，由于其详尽的文档和优秀的开发工具，NVIDIA GPU成为了最流行的选择。因此，编写高性能CUDA代码的资源非常丰富。然而，CUDA主要只支持商业产品，对于开放源硬件平台的支持一直是个空白。 RISC-V（精简指令集计算机）因其优雅的设计和开放源码许可证，已成为硬件指令集架构（ISA）的首选。在这个项目中，研究人员们——来自美国乔治亚理工学院的Ruobing Han、Blaise Tine、Jaewon Lee以及韩国首尔国立大学的Jaewoong Sim和乔治亚理工学院的Hyesoon Kim，他们的目标是让现有的CUDA代码能够在RISC-V设备上运行，从而填补这一空白。 CUDA是由NVIDIA开发的一种并行计算平台和编程模型，主要用于高性能计算，特别是在图形处理和科学计算领域。它提供了一种高级编程接口，使得开发者可以利用GPU的并行计算能力来加速复杂任务。然而，CUDA目前仅与NVIDIA的GPU硬件兼容，这限制了它的应用范围。为了使CUDA能够在RISC-V架构上运行，研究人员需要克服几个关键挑战。首先，他们必须适配RISC-V的ISA，确保CUDA指令能够正确地被RISC-V处理器理解和执行。其次，需要为RISC-V设备开发相应的GPU驱动程序和库，以便与CUDA运行时系统交互。此外，优化和调整内存管理、上下文切换以及并行任务调度等也是必要的，以确保在RISC-V平台上获得与NVIDIA GPU相似的性能。此项目对开源社区和学术界具有重大意义，因为它不仅扩展了CUDA的应用场景，还可能激发更多的研究和创新，推动RISC-V生态系统的进一步发展。通过将CUDA引入开放源硬件平台，开发者和研究人员可以利用已有的CUDA代码库在更广泛的硬件选择上进行计算密集型应用的开发，这将有助于降低依赖特定硬件供应商的门槛，促进技术的多样性和公平竞争。这个项目旨在打破CUDA与特定商业硬件的绑定，使得高性能计算技术能够更好地服务于开源社区，同时也为RISC-V架构的GPU提供了强大的软件支持，有望推动整个计算机科学领域，特别是并行计算和图形处理领域的进步。

Supporting CUDA for an extended RISC-V GPU architecture

Ruobing Han

hanruobing@gatech.edu

Georgia Institute of Technology

USA

Blaise Tine

blaisetine@gatech.edu

Georgia Institute of Technology

USA

Jaewon Lee

jaewon.lee@gatech.edu

Georgia Institute of Technology

USA

Jaewoong Sim

jaewoong@snu.ac.kr

Seoul National University

Korea

Hyesoon Kim

hyesoon@cc.gatech.edu

Georgia Institute of Technology

USA

ABSTRACT

With the rapid development of scientic computation, more and

more researchers and developers are committed to implementing

various workloads/operations on dierent devices. Among all these

devices, NVIDIA GPU is the most popular choice due to its compre-

hensive documentation and excellent development tools. As a result,

there are abundant resources for hand-writing high-performance

CUDA codes. However, CUDA is mainly supported by only com-

mercial products and there has been no support for open-source

H/W platforms. RISC-V is the most popular choice for hardware

ISA, thanks to its elegant design and open-source license. In this

project, we aim to utilize these existing CUDA codes with RISC-V

devices. More specically, we design and implement a pipeline that

can execute CUDA source code on an RISC-V GPU architecture. We

have succeeded in executing CUDA kernels with several important

features, like multi-thread and atomic instructions, on an RISC-V

GPU architecture.

KEYWORDS

CUDA, RISC-V, Code Migration

ACM Reference Format:

Ruobing Han, Blaise Tine, Jaewon Lee, Jaewoong Sim, and Hyesoon Kim.

2021. Supporting CUDA for an extended RISC-V GPU architecture . In

Proceedings of ACM Conference (Conference’17). ACM, New York, NY, USA,

7 pages. https://doi.org/10.1145/nnnnnnn.nnnnnnn

1 INTRODUCTION

RISC-V is the most popular choice for researchers in the aca-

demic community and engineers in hardware companies. The most

important reason is its open-source spirit. These open-source li-

censes encourage many researchers to devote themselves to the

development of a mature ecology for RISC-V, and thus, in turn,

more and more people are willing to join the community, as there

are existing fancy codes, hardware designs, and so on.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for prot or commercial advantage and that copies bear this notice and the full citation

on the rst page. Copyrights for components of this work owned by others than ACM

must be honored. Abstracting with credit is permitted. To copy otherwise, or republish,

to post on servers or to redistribute to lists, requires prior specic permission and/or a

fee. Request permissions from permissions@acm.org.

Conference’17, July 2017, Washington, DC, USA

ACM ISBN 978-x-xxxx-xxxx-x/YY/MM. . .$15.00

https://doi.org/10.1145/nnnnnnn.nnnnnnn

In the RISC-V ecology, the software support is the bottleneck for

the blooming of the RISC-V community. Although OpenCL is an

open platform for heterogeneous computing, due to the stability and

software tool chain support, CUDA has been used widely. Unfortu-

nately, CUDA source code can only be compiled and then executed

on NVIDIA’s devices, which is a major obstacle to using RISC-V for a

wide range of applications, especially high-performance computing

and machine learning workloads.

One way to solve this dilemma is to use code migration[

Instead of using the default method to compile CUDA source code

with NVIDIA’s compiler, some researchers try to parse and modify

the source code to other high-level languages; more detail is shown

in Sec. 2.1. However, because these methods highly rely on the high

similarity between CUDA and the target high-level languages, they

are not general solutions. Another solution is to build a compiler

that directly compiles high-level CUDA language into a low-level

RISC-V binary le. To the best of our knowledge, although there

are translators that support generating RISC-V, none of them can

handle CUDA source code.

Thus, in this project we propose and build a pipeline to support

an end-to-end CUDA migration: the pipeline accepts CUDA source

codes as input and executes them on an extended RISC-V GPU

architecture. Our pipeline consists of several steps: translates CUDA

source code into NVVM IR[

], converts NVVM IR into SPIR-V IR

[

], forwards SPIR-V IR into POCL[

] to get RISC-V binary le,

and nally executes the binary le on an extended RISC-V GPU

architecture. We choose to use an intermediate representation (SPIR-

V) for two reasons 1) RISC-V is still in development and has a lot of

extensions, so we should not directly convert CUDA into RISC-V,

as it will make supporting new features in the future dicult for

our pipeline; 2) we want to make our pipeline more general so that

we can support CUDA as front-end and RISC-V as back-end. Our

pipeline is represented by Fig. 1.

In conclusion, the main contributions of our paper include the

following:

•

propose and implement a pipeline for executing CUDA source

code on RISC-V GPU;

•

build a translator support translating from NVVM to SPIR-

;

•

pipeline that is easy to maintain and further support other

front-end languages and back-end devices.

https://github.com/gthparch/NVPTX-SPIRV-Translator

arXiv:2109.00673v1 [cs.PL] 2 Sep 2021

用cuda kernel实现了几个feature：multi-thread

atomic instructions

IR: intermediate representation

standard

portable IR

portable open-source implementation

下载后可阅读完整内容，剩余6页未读，立即下载

u010634378

粉丝: 0
资源: 8

RISC-V GPU架构扩展支持CUDA

spring-boot-reference.pdf

instantclient-sqlplus-linux.x64-12.1.0.2.0.zip

TPM-Rev-2.0-Part-4-Supporting-Routines-01.38.pdf

T-REC-G.987.2-201010-I!!PDF-E.pdf

T-REC-G.709.1-202005-I!Cor1!PDF-E.pdf

Learning-Scala-Practical-Functional-Programming-for-the-JVM.pdf

usb-vip-ds.pdf

arteris-ncore-white-paper.pdf

ISO-IEC 14443-3-2011.pdf

ISO 26262-8-2011.pdf

最新资源