多层次残差网络：Residual Networks of Residual Networks

95 浏览量更新于2024-08-29 收藏 408KB PDF 举报

"Residual Networks of Residual Networks: Multilevel Residual Networks" 本文是一篇关于深度学习领域的研究论文，主要探讨了残差网络（Residual Networks）的优化潜力及其改进方法。残差网络在图像识别任务中表现出色，通常包含数百甚至数千层，但单纯堆叠残差块会限制其优化能力。作者Ke Zhang、Miao Sun、Tony X. Han、Xingfang Yuan、Liru Guo和Tao Liu提出了一种新的架构——Residual Networks of Residual Networks（RoR），旨在挖掘残差网络的优化潜力。 RoR的核心思想是通过优化残差映射的残差映射，而非直接优化原始残差映射。具体来说，它在原有的残差网络基础上引入了级联式的短路连接，以增强网络的学习能力。这一创新设计使得每一层的残差映射都能得到更有效的训练，提高了网络的整体性能。 RoR不仅限于基础的ResNets（Pre-ResNets），也可以应用于宽残差网络（Wide Residual Networks，WRN）等不同类型的残差网络结构，并显著提升它们的表现。实验结果表明，RoR具有很好的有效性和通用性，能够提高各种残差网络的性能。通过引入多级残差结构，RoR解决了深层网络中的梯度消失和饱和问题，使得信息流在更深的层次中仍然保持活跃。这有助于网络在训练过程中更好地学习和提取特征，从而在图像识别等任务上取得更好的准确率。此外，RoR的级联结构也使得网络的训练更加高效。由于每个子残差网络可以独立优化，这减少了整个网络的训练难度，加速了收敛过程。同时，这种设计还有助于防止过拟合，通过增加网络的非线性表达能力，使得模型在泛化性能上有所提升。 "Residual Networks of Residual Networks: Multilevel Residual Networks"这篇论文提出了一种创新的深度学习网络结构，通过改进残差网络的设计，提升了深度学习模型的优化能力和整体性能，对于理解和优化深度神经网络具有重要的理论和实践价值。

IEEE TRANSACTIONS ON L

X CLASS FILES, VOL. 14, NO. 8, AUGUST 2016 1

Residual Networks of Residual Networks:

Multilevel Residual Networks

Ke Zhang, Member, IEEE, Miao Sun, Student Member, IEEE, Tony X. Han, Member, IEEE,

Xingfang Yuan, Student Member, IEEE, Liru Guo, and Tao Liu

Abstract—Residual networks family with hundreds or even

thousands of layers dominate major image recognition tasks, but

building a network by simply stacking residual blocks inevitably

limits its optimization ability. This paper proposes a novel

residual-network architecture, Residual networks of Residual net-

works (RoR), to dig the optimization ability of residual networks.

RoR substitutes optimizing residual mapping of residual mapping

for optimizing original residual mapping, in particular, adding

level-wise shortcut connections upon original residual networks,

to promote the learning capability of residual networks. More

importantly, RoR can be applied to various kinds of residual

networks (Pre-ResNets and WRN) and signiﬁcantly boost their

performance. Our experiments demonstrate the effectiveness and

versatility of RoR, where it achieves the best performance in all

residual-network-like structures. Our RoR-3-WRN58-4 models

achieve new state-of-the-art results on CIFAR-10, CIFAR-100 and

SVHN, with test errors 3.77%, 19.73% and 1.59% respectively.

These results outperform 1001-layer Pre-ResNets by 18.4% on

CIFAR-10 and 13.1% on CIFAR-100.

Index Terms—Image classiﬁcation, Residual networks, Resid-

ual networks of Residual networks, Shortcut, Stochastic Depth.

I. INTRODUCTION

ONVOLUTIONAL Neural Networks (CNNs) have given

the computer vision community a signiﬁcant shock [1],

and have been improving state-of-the-art results in many

computer vision applications. Since AlexNets’ [2] ground-

breaking victory at the ImageNet Large Scale Visual Recog-

nition Challenge 2012 (ILSVRC 2012) [4], deeper and deeper

CNNs [2], [3], [5], [6], [7], [8], [9], [10], [11], [12] have been

This work is supported by National Natural Science Foundation of China

(Grants No. 61302163 and No. 61302105), Hebei Province Natural Science

Foundation (Grants No. F2015502062) and the Fundamental Research Funds

for the Central Universities.

K. Zhang is with the Department of Electronic and Communication

Engineering, North China Electric Power University, Baoding, Hebei, 071000

China e-mail: zhangkeit@ncepu.edu.cn.

M. Sun is with the Department of Electrical and Computer Engi-

neering, University of Missouri, Columiba, MO, 65211 USA e-mail:

msqz6@mail.missouri.edu.

T. X. Han is with the Department of Electrical and Computer En-

gineering, University of Missouri, Columiba, MO, 65211 USA e-mail:

HanTX@missouri.edu.

X. Yuan is with the Department of Electrical and Computer En-

gineering, University of Missouri, Columiba, MO, 65211 USA e-mail:

xyuan@mail.missouri.edu.

L. Guo is with the Department of Electronic and Communication Engineer-

ing, North China Electric Power University, Baoding, Hebei, 071000 China

e-mail: glr9292@126.com.

T. Liu is with the Department of Electronic and Communication Engineer-

ing, North China Electric Power University, Baoding, Hebei, 071000 China

e-mail: taoliu@ncepu.edu.cn.

Manuscript received , 2016; revised , 2016.

conv

image

……

conv

image

……

Residual Networks

RoR

Avg pool

shortcuts

Residual Block

conv

Identity

mapping

F(x)

f(F(x) + x)

Fig. 1. The left image with dashed line is an original residual network

which contains a series of residual blocks, and each residual block has one

shortcut connection. The number (16, 32, or 64) of each residual block is the

number of output feature map. F (x) is the residual mapping and x is the

identity mapping. The original mapping represents as F (x) + x. The right

image with dashed line is our new residual networks of residual networks

architecture with three levels. RoR is constructed by adding identity shortcuts

level by level based on original residual networks.

proposed and achieved better performance on ImageNet or

other benchmark datasets. The results of these models revealed

the importance of network depth, as deeper networks lead to

superior results.

With a dramatic increase in depth, residual networks

(ResNets) [12] achieved the state-of-the-art performance

at ILSVRC 2015 classiﬁcation, localization, detection, and

COCO detection, segmentation tasks. However, very deep

models will suffer vanishing gradients and over-ﬁtting prob-

lems, thus the performance of thousand-layer ResNets is

worse than hundred-layer ResNets. Then the Identity Mapping

ResNets (Pre-ResNets) [13] simpliﬁed the residual networks

training by BN-ReLU-conv order. Pre-ResNets can allevi-

ate vanishing gradients problem, so that the performance of

thousand-layer Pre-ResNets can be further improved. The

arXiv:1608.02908v1 [cs.CV] 9 Aug 2016

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38723192

粉丝: 8
资源: 870

多层次残差网络：Residual Networks of Residual Networks

Wide Residual Networks.pdf

PARDISO案例研究：企业级部署策略与实践

【Mplus 8模型优化宝典】：拟合指标详解及最佳模型寻找技巧

【Mplus 8编程与自动化】：命令行界面与批处理优化技术大公开

【Mplus 8SEM全解】：结构方程模型的理论基础与实践操作指南

【Mplus 8实战分析】：模型评估、结果解读与图形化界面深度应用

【Mplus 8高级技巧】：复杂模型输出、绘图与自定义分析的终极攻略

zip4j.jar包下载,版本为 2.11.5

基于node.js完成登录

aapt_v0.2-eng.ibotpeaches.20151011.225425_win.tar.cab

最新资源