RENO：高效可重构神经形态计算加速器设计

193 浏览量更新于2024-08-28 收藏 882KB PDF 举报

"Reno: 高效可重构神经形态计算加速器设计" 本文是一篇研究论文，探讨了名为“Reno”的高效可重构神经形态计算加速器的设计。神经形态计算近年来因其解决冯·诺依曼瓶颈的潜力而受到广泛关注。Reno加速器利用基于 memristor 的交叉条阵列（MBC）的极高效率混合信号计算能力来加速人工神经网络（ANNs）的执行。MBC数组以层次结构排列，能够通过混合信号接口配置为各种ANN拓扑结构。在传统的冯·诺依曼架构中，数据传输和计算之间的瓶颈限制了计算机系统的性能，特别是在处理大规模机器学习任务时。神经形态计算借鉴了大脑神经元网络的工作原理，通过模拟神经元和突触的相互作用来进行计算，从而避免了这种瓶颈。 Reno的核心是其可重构性，这意味着它可以根据需要调整硬件结构以适应不同的神经网络模型。这种灵活性允许系统针对特定任务进行优化，提高能效并减少计算时间。MBC数组是实现这一目标的关键，因为它们能够直接在硬件层面上存储和处理信息，减少了传统架构中的数据搬运。文章详细介绍了Reno如何利用 memristor 技术。Memristors 是一种非易失性存储器，能够在断电后保持状态，同时也可以用作模拟计算的元件。在Reno中，这些memristors用于模拟神经元的权重，极大地提高了计算速度和能效。混合信号接口是Reno的另一个创新点，它结合了数字和模拟信号处理，使得在配置和控制MBC数组时具有更高的灵活性和精度。这使得Reno能够适应不断变化的神经网络需求，动态地调整其计算资源。此外，论文可能还涵盖了Reno的硬件实现细节、性能评估、与其他神经形态计算平台的比较，以及在实际应用中的潜在优势，如在物联网设备、边缘计算和数据中心等场景的应用。 Reno是一个旨在克服传统计算架构限制的前沿解决方案，通过高效可重构的设计和memristor技术，为神经网络的执行提供了新的途径，有望推动未来计算技术的发展。

RENO: A High-efﬁcient Reconﬁgurable Neuromorphic

Computing Accelerator Design

∗

Xiaoxiao Liu, Mengjie Mao, Beiye Liu, Hai Li, Yiran Chen

University of Pittsburgh

Pittsburgh, USA

{xil116, mem231, bel34, hal66, yic52}@pitt.edu

Boxun Li, Yu Wang

Tsinghua University

Beijing, P.R. China

{lbx13, yu-wang}@mails.tsinghua.edu.cn

Hao Jiang

San Francisco State University

San Francisco, USA

jianghao@sfsu.edu

Mark Barnell, Qing Wu

Air Force Research Laboratory

Rome, USA

{mark.barnell.1, qing.wu.2}@us.af.mil

Jianhua Yang

University of Massachusetts

Amherst, USA

jjyang@umass.edu

ABSTRACT

Neuromorphic computing is recently gaining signiﬁcant at-

tention as a promising candidate to conquer the well-known

von Neumann bottleneck. In this work, we propose RENO

– a eﬃcient reconﬁgurable neuromorphic computing acceler-

ator. RENO leverages the extremely eﬃcient mixed-signal

computation capability of memristor-based crossbar (MBC)

arrays to speedup the executions of artiﬁcial neural networks

(ANNs). The hierarchically arranged MBC arrays can be

conﬁgured to a variety of ANN topologies through a mixed-

signal interconnection network (M-Net). Simulation results

on seven ANN applications show that compared to the base-

line general-purpose processor, RENO can achieve on av-

erage 178.4× (27.06×) performance speedup and 184.2×

(25.23×) energy savings in high-eﬃcient multilayer percep-

tion (high-accurate auto-associative memory) implementa-

tion. Moreover, in the comparison to a pure digital neural

processing unit (D-NPU) and a design with MBC arrays co-

operating through a digital interconnection network, RENO

still achieves the fastest execution time and the lowest en-

ergy consumption with similar computation accuracy.

1. INTRODUCTION

Traditional von Neumann computers require frequent data

exchanging between processors and memory chips. This de-

sign severely limits the system performance and eﬃciency,

especially in computation-intensive cognitive applications.

As a promising candidate to overcome the ineﬃciency of

von Neumann architecture, neuromorphic systems recently

became a hot research area in future tera-scale computing.

Many studies have been conducted on the hardware imple-

mentation of artiﬁcial neural networks (ANNs) across both

digital and analog domains. Examples include neural net-

∗

This work is supported in part by NSF XPS-1337198, NSF CNS-

1116171, AFRL FA8750-15-2-0048, DARPA D13AP00042, HP Lab

Innov. Res. Pgm, NSFC 61373026, and Tsinghua Univ. Init. Sci.

Res. Pgm. Received and approved for public release by AFRL on

03/04/2015, case number 88ABW-2015-0833. Any Opinions, ﬁndings,

and conclusions or recommendations expressed in this material are

those of the authors and do not necessarily reﬂect the views of AFRL

or its contractors.

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for proﬁt or commercial advantage and that copies

bear this notice and the full citation on the ﬁrst page. To copy otherwise, to

republish, to post on servers or to redistribute to lists, requires prior speciﬁc

permission and/or a fee. Request permissions from Permissions@acm.org.

DAC ’15, June 07 - 11 2015, San Francisco, CA, USA

ACM. ACM 978-1-4503-3520-1/15/06$15.00

http://dx.doi.org/10.1145/2744769.2744900.

work accelerators for signal processing [5], digital approxi-

mate computing accelerators that leverage neural network

algorithms [8], and heterogeneous systems built with GPUs

and APUs for deep learning accelerations [9]. However, tra-

ditional CMOS technology has been proven to be ineﬃcient

for neuromorphic system design as dozens of transistors are

usually required to build one neuron [5].

Discovery of nanoscale memristor devices [6] inspired an

exciting approach to implement neuromorphic systems. Par-

ticularly, the similarity between the programmable resis-

tance state of memristors and the variable synaptic strengths

of biological synapses dramatically simplify the circuit real-

ization of neural network models. The specialty of memris-

tors has been investigated and exploited in a few research

works that focus on either the circuit implementation of the

matrix-vector multiplications in conventional approximate

computing acceleration [16, 17].

In this work, we propose RENO – a novel eﬃcient re-

conﬁgurable neuromorphic computing accelerator. RENO

uses on-chip memristor-based crossbar (MBC) arrays to im-

plement a perceptron networks, aiming at the acceleration

of ANN computations. Unlike many neuromorphic systems

that perform the computations on pure digital ALUs or ana-

log approximate computing units with AD/DA interface,

our design adopts a hybrid method in data representation:

the computation within the MBC arrays and the signal com-

munications among the MBC arrays are conducted in analog

form, while the control information remains as digital sig-

nals. Compared to the existing implementations of digital

ANN accelerators and approximate computing units, the key

distinctions of RENO can be summarized as:

• A eﬃcient memristor-based mixed-signal accel-

erator is designed to speed up neuromorphic comput-

ing and support the implementations of a variety of

neural network topologies;

• A mixed-signal interconnection network (M-Net)

is proposed to assist the communication of computa-

tional signals among the MBCs.

• An optimized conﬁguration is discussed and ﬁnal-

ized by thoroughly analyzing the impact of various de-

sign parameters on the system performance/accuracy.

RENO oﬀers a cost-eﬃcient and fault-tolerant ANN com-

putation platform complementing the general computations

of CPU cores. In the evaluations of RENO, we adopt a set

of prevailing ANN benchmarks and two ANN topologies-

Multilayer perception (MLP) and auto-associative memory

(AAM) to demonstrate the tradeoﬀs of the computation

performance and accuracy for diﬀerent RENO conﬁgura-

tions. Simulation results show that compared to the baseline

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38659955

粉丝: 4

RENO：高效可重构神经形态计算加速器设计

PyPI 官网下载 | reno-2.1.2-py2-none-any.whl

OPPO-Reno系列机型-mtk芯片解锁bl工具+root详细图文教程【仔细看支持机型】

PMRID:ECCV2020 - 移动设备上实用的深度原始图像去噪

tcp_reno:在UserSpace中通过模拟TCP Reno实现UDP上的可靠通信

reno：一个瘦的，可测试的路由库，旨在位于Deno的标准HTTP模块之上

python-reno-doc-2.11.2-1.el7.noarch.rpm

python-reno-doc-2.11.3-1.el8.noarch.rpm

python-reno-doc-2.9.2-1.el7.noarch.rpm

python-reno-doc-2.11.3-1.el7.noarch.rpm

python-reno-doc-2.7.0-1.el7.noarch.rpm

最新资源