网络瘦身：高效卷积网络学习

需积分: 0 57 浏览量更新于2024-08-05 收藏 920KB PDF 举报

"Learning Efficient Convolutional Networks via Network Slimming" 在深度学习领域，尤其是计算机视觉任务中，卷积神经网络（CNNs）扮演着核心角色。然而，这些网络的计算成本高，使得它们在实际应用中的部署面临挑战。"Learning Efficient Convolutional Networks through Network Slimming"这篇论文提出了一种新颖的学习策略，旨在解决这一问题。论文的主要目标是通过网络瘦身（Network Slimming）来同时实现以下三个目标： 1. 减小模型大小：通过减少模型参数的数量，降低存储需求，使网络更紧凑。 2. 减少运行时内存占用：优化内存使用，提高资源效率。 3. 降低计算操作数：减少计算量，加快运行速度，而不会显著牺牲模型的准确性。不同于许多现有的模型压缩方法，网络瘦身采取了一种简单但有效的方式，即在通道级别引入稀疏性。这意味着某些卷积层的通道可以被“修剪”，以去除不重要的权重，同时保持模型的整体性能。这种方法直接适用于现代的CNN架构，如ResNet、VGG等，并且在训练过程中增加的额外开销极小。在训练过程中，网络瘦身通过在每个卷积层的通道上引入一个可学习的缩放因子（sparsity-inducing scaling factor），这个因子控制了该通道的重要性。在训练结束后，可以基于这些因子的值来决定哪些通道应该被保留，哪些可以安全地剔除。这样，网络可以在不进行大量重训练的情况下，自动调整其结构，达到压缩和优化的目的。实验结果显示，通过网络瘦身，能够在保持相似甚至更高准确率的同时，显著减少模型的参数数量、内存占用以及计算量。这种方法不需要专门的软件或硬件加速器，使得经过瘦身的模型能在各种设备上高效运行，包括资源有限的移动设备。总结来说，网络瘦身是一种强大的工具，它推动了深度学习模型在效率和实用性方面的边界。通过引入通道级别的稀疏性，它不仅有助于压缩模型，还优化了运行时性能，为CNN在实际场景中的广泛应用提供了可能性。这一方法对于那些需要在有限资源下运行复杂模型的应用，如嵌入式系统和物联网设备，具有巨大的价值。

Learning Efﬁcient Convolutional Networks through Network Slimming

Zhuang Liu

1∗

Jianguo Li

Zhiqiang Shen

Gao Huang

Shoumeng Yan

Changshui Zhang

Tsinghua University

Intel Labs China

Fudan University

Cornell University

{liuzhuangthu, zhiqiangshen0214}@gmail.com, {jianguo.li, shoumeng.yan}@intel.com,

gh349@cornell.edu, zcs@mail.tsinghua.edu.cn

Abstract

The deployment of deep convolutional neural networks

(CNNs) in many real world applications is largely hindered

by their high computational cost. In this paper, we propose

a novel learning scheme for CNNs to simultaneously 1) re-

duce the model size; 2) decrease the run-time memory foot-

print; and 3) lower the number of computing operations,

without compromising accuracy. This is achieved by en-

forcing channel-level sparsity in the network in a simple but

effective way. Different from many existing approaches, the

proposed method directly applies to modern CNN architec-

tures, introduces minimum overhead to the training process,

and requires no special software/hardware accelerators for

the resulting models. We call our approach network slim-

ming, which takes wide and large networks as input mod-

els, but during training insigniﬁcant channels are automat-

ically identiﬁed and pruned afterwards, yielding thin and

compact models with comparable accuracy. We empirically

demonstrate the effectiveness of our approach with several

state-of-the-art CNN models, including VGGNet, ResNet

and DenseNet, on various image classiﬁcation datasets. For

VGGNet, a multi-pass version of network slimming gives a

20× reduction in model size and a 5× reduction in comput-

ing operations.

1. Introduction

In recent years, convolutional neural networks (CNNs)

have become the dominant approach for a variety of com-

puter vision tasks, e.g., image classiﬁcation [22], object

detection [8], semantic segmentation [26]. Large-scale

datasets, high-end modern GPUs and new network architec-

tures allow the development of unprecedented large CNN

models. For instance, from AlexNet [22], VGGNet [31] and

GoogleNet [34] to ResNets [14], the ImageNet Classiﬁca-

tion Challenge winner models have evolved from 8 layers

to more than 100 layers.

∗

This work was done when Zhuang Liu and Zhiqiang Shen were interns

at Intel Labs China. Jianguo Li is the corresponding author.

However, larger CNNs, although with stronger represen-

tation power, are more resource-hungry. For instance, a

152-layer ResNet [14] has more than 60 million parame-

ters and requires more than 20 Giga ﬂoat-point-operations

(FLOPs) when inferencing an image with resolution 224×

224. This is unlikely to be affordable on resource con-

strained platforms such as mobile devices, wearables or In-

ternet of Things (IoT) devices.

The deployment of CNNs in real world applications are

mostly constrained by 1) Model size: CNNs’ strong repre-

sentation power comes from their millions of trainable pa-

rameters. Those parameters, along with network structure

information, need to be stored on disk and loaded into mem-

ory during inference time. As an example, storing a typi-

cal CNN trained on ImageNet consumes more than 300MB

space, which is a big resource burden to embedded devices.

2) Run-time memory: During inference time, the interme-

diate activations/responses of CNNs could even take more

memory space than storing the model parameters, even with

batch size 1. This is not a problem for high-end GPUs, but

unaffordable for many applications with low computational

power. 3) Number of computing operations: The convolu-

tion operations are computationally intensive on high reso-

lution images. A large CNN may take several minutes to

process one single image on a mobile device, making it un-

realistic to be adopted for real applications.

Many works have been proposed to compress large

CNNs or directly learn more efﬁcient CNN models for fast

inference. These include low-rank approximation [7], net-

work quantization [3, 12] and binarization [28, 6], weight

pruning [12], dynamic inference [16], etc. However, most

of these methods can only address one or two challenges

mentioned above. Moreover, some of the techniques require

specially designed software/hardware accelerators for exe-

cution speedup [28, 6, 12].

Another direction to reduce the resource consumption of

large CNNs is to sparsify the network. Sparsity can be im-

posed on different level of structures [2, 37, 35, 29, 25],

which yields considerable model-size compression and in-

ference speedup. However, these approaches generally re-

arXiv:1708.06519v1 [cs.CV] 22 Aug 2017

下载后可阅读完整内容，剩余9页未读，立即下载

东风中的蒟蒻

粉丝: 126
资源: 4

网络瘦身：高效卷积网络学习

模型压缩经典文章翻译1：（Network Slimming翻译）Network Slimming-Learning Efficient Convolutional Networks ...-附件资源

Udemy - Deep Learning Convolutional Neural Networks in Python

An Extremely Efficient Convolutional Neural Network for Mobile Devices

Deep learning with convolutional neural networks for brain mapping

Deep Learning and Convolutional Neural Networks for Medical Image Computing

最新书籍Deep Learning and Convolutional Neural Networks for Medical Image Computing

mobilenets: efficient convolutional neural networks for mobile vision applications

模型压缩论文解读1：（MobileNets解读）Efficient Convolutional Neural Networks for Mobile ...-附件资源

ESPACE: Accelerating Convolutional Neural Networks via Eliminating Spatial and Channel Redundancy

Relation Classification via Convolutional Deep Neural Network

最新资源