没有合适的资源?快使用搜索试试~ 我知道了~
首页语义分割神经网络ENet
资源详情
资源评论
资源推荐
ENet: A Deep Neural Network Architecture for
Real-Time Semantic Segmentation
Adam Paszke
Faculty of Mathematics, Informatics and Mechanics
University of Warsaw, Poland
a.paszke@students.mimuw.edu.pl
Abhishek Chaurasia, Sangpil Kim, Eugenio Culurciello
Electrical and Computer Engineering
Purdue University, USA
aabhish, sangpilkim, euge@purdue.edu
Abstract
The ability to perform pixel-wise semantic segmentation in real-time is of
paramount importance in mobile applications. Recent deep neural networks aimed
at this task have the disadvantage of requiring a large number of floating point oper-
ations and have long run-times that hinder their usability. In this paper, we propose
a novel deep neural network architecture named ENet (efficient neural network),
created specifically for tasks requiring low latency operation. ENet is up to 18
×
faster, requires 75
×
less FLOPs, has 79
×
less parameters, and provides similar or
better accuracy to existing models. We have tested it on CamVid, Cityscapes and
SUN datasets and report on comparisons with existing state-of-the-art methods,
and the trade-offs between accuracy and processing time of a network. We present
performance measurements of the proposed architecture on embedded systems and
suggest possible software improvements that could make ENet even faster.
1 Introduction
Recent interest in augmented reality wearables, home-automation devices, and self-driving vehicles
has created a strong need for semantic-segmentation (or visual scene-understanding) algorithms
that can operate in real-time on low-power mobile devices. These algorithms label each and every
pixel in the image with one of the object classes. In recent years, the availability of larger datasets
and computationally-powerful machines have helped deep convolutional neural networks (CNNs)
[
1
,
2
,
3
,
4
] surpass the performance of many conventional computer vision algorithms [
5
,
6
,
7
]. Even
though CNNs are increasingly successful at classification and categorization tasks, they provide coarse
spatial results when applied to pixel-wise labeling of images. Therefore, they are often cascaded with
other algorithms to refine the results, such as color based segmentation [
8
] or conditional random
fields [9], to name a few.
In order to both spatially classify and finely segment images, several neural network architectures
have been proposed, such as SegNet [
10
,
11
] or fully convolutional networks [
12
]. All these works
are based on a VGG16 [
13
] architecture, which is a very large model designed for multi-class
classification. These references propose networks with huge numbers of parameters, and long
inference times. In these conditions, they become unusable for many mobile or battery-powered
applications, which require processing images at rates higher than 10 fps.
In this paper, we propose a new neural network architecture optimized for fast inference and high
accuracy. Examples of images segmented using ENet are shown in Figure 1. In our work, we chose
arXiv:1606.02147v1 [cs.CV] 7 Jun 2016
若可再来
- 粉丝: 14
- 资源: 10
上传资源 快速赚钱
- 我的内容管理 收起
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
会员权益专享
最新资源
- zigbee-cluster-library-specification
- JSBSim Reference Manual
- c++校园超市商品信息管理系统课程设计说明书(含源代码) (2).pdf
- 建筑供配电系统相关课件.pptx
- 企业管理规章制度及管理模式.doc
- vb打开摄像头.doc
- 云计算-可信计算中认证协议改进方案.pdf
- [详细完整版]单片机编程4.ppt
- c语言常用算法.pdf
- c++经典程序代码大全.pdf
- 单片机数字时钟资料.doc
- 11项目管理前沿1.0.pptx
- 基于ssm的“魅力”繁峙宣传网站的设计与实现论文.doc
- 智慧交通综合解决方案.pptx
- 建筑防潮设计-PowerPointPresentati.pptx
- SPC统计过程控制程序.pptx
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论0