深度图像压缩提升协作目标检测效率

需积分: 0 92 浏览量更新于2024-08-05 收藏 2.75MB PDF 举报

随着移动设备的普及和云计算的发展，深度神经网络在移动应用中的效率提升引起了广泛关注。一种新兴的计算模式——协作智能（Collaborative Intelligence）应运而生，它将计算任务分解，部分工作在本地设备执行，部分通过与云端通信进行。其中的关键环节是深度特征数据（Deep Feature Data）的传输，这通常是无损或有损压缩的。然而，深度图像压缩在协作目标检测中的具体效果及其对准确性的潜在影响尚未得到充分研究。本文的焦点在于探索深度图像压缩在协作对象检测（Collaborative Object Detection）中的应用。作者首先指出，传统的做法往往假设数据在传输过程中保持完整，但随着通信带宽限制的现实，有损压缩成为提高效率的选择。近似无损（Near-lossless）压缩技术试图尽可能保留原始特征信息，而有损压缩则可能牺牲一部分精度以换取更大的压缩比。实验部分，作者提出了一种策略，旨在在采用有损压缩时优化深度特征数据，从而在减少通信开销的同时维持相对较高的检测准确性。结果显示，通过这个策略，即使在显著降低通信流量70%的情况下，目标检测的精度并未受到显著影响。此外，文章还探讨了深度特征压缩技术在关键词方面的应用，如“深度特征压缩”（Deep Feature Compression）、“协作智能”（Collaborative Intelligence）、以及“压缩增强”（Compression-augmentation）等概念。这些技术对于提升移动设备上的计算密集型任务性能至关重要，尤其是在资源受限的环境下，如移动设备上进行高效、实时的目标检测任务。本文的研究为理解和优化深度图像在协作目标检测中的传输提供了新的视角，展示了通过恰当的特征压缩策略如何平衡计算效率和准确性，这对于推动移动设备的智能化发展具有实际意义。未来的研究可能会进一步细化压缩算法的选择，以及寻找更为精确的压缩-解压缩技术，以实现更高的性能优化。

DEEP FEATURE COMPRESSION FOR COLLABORATIVE OBJECT DETECTION

Hyomin Choi and Ivan V. Baji

School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada

ABSTRACT

Recent studies have shown that the efﬁciency of deep neu-

ral networks in mobile applications can be signiﬁcantly im-

proved by distributing the computational workload between

the mobile device and the cloud. This paradigm, termed col-

laborative intelligence, involves communicating feature data

between the mobile and the cloud. The efﬁciency of such

approach can be further improved by lossy compression of

feature data, which has not been examined to date. In this

work we focus on collaborative object detection and study the

impact of both near-lossless and lossy compression of feature

data on its accuracy. We also propose a strategy for improving

the accuracy under lossy feature compression. Experiments

indicate that using this strategy, the communication overhead

can be reduced by up to 70% without sacriﬁcing accuracy.

Index Terms— Deep feature compression, collaborative

intelligence, compression-augmentation, object detection

1. INTRODUCTION

Mobile and Internet-of-Things (IoT) [1] devices are increas-

ingly relying on Artiﬁcial Intelligence (AI) engines to en-

able sophisticated applications such as personal digital as-

sistants [2], self-driving vehicles, autonomous drones, smart

cities, and so on. The AI engines themselves are generally

built on deep learning models. The most common way of de-

ploying such models is to place them in the cloud and have

the sensor data (images, speech, etc.) uploaded from the mo-

bile to the cloud for processing. This is referred to as the

cloud-only approach. More recently, with smaller graphical

processing units (GPUs) making their way into mobile/IoT

devices, some deep models might be able to run on the mo-

bile device, an approach referred to as mobile-only.

A recent study [3] has examined a spectrum of possibil-

ities in between the cloud-only and mobile-only extremes.

Speciﬁcally, they considered splitting a deep network into

two parts: the front end (consisting of an input layer and a

number of subsequent layers), which runs on the mobile, and

the back end (consisting of the remaining layers), which runs

on the cloud. In this approach, termed collaborative intelli-

gence, the front end computes features up to some layer in

the network, then these features are uploaded to the cloud

for the remainder of the computation. The authors examined

the energy consumption and latency associated with perform-

ing computation in this way, for various split points in typical

deep models. Their ﬁndings indicate that signiﬁcant savings

can be achieved in both energy and latency if the network is

split appropriately. They also proposed an algorithm called

Neurosurgeon to ﬁnd the optimal split point, depending on

whether energy or latency is to be minimized.

The reason why collaborative intelligence can be more ef-

ﬁcient than cloud-only and mobile-only approaches is that the

feature data volume in deep convolutional neural networks

(CNNs) typically decreases as we move from the input to the

output. Executing initial layers on the mobile will cost some

energy and time, but if the network is split appropriately, we

will end up with far less data to be uploaded to the cloud,

which will save both transmission latency on the uplink and

the energy used for radio transmission. Hence, on the balance,

there may be a net beneﬁt in energy and/or latency. Based

on [3], depending on the resources available (GPU or CPU on

the mobile, speed and energy for wireless transmission, etc.),

optimal split points for CNNs tend to be deep in the network.

A recently released study [4] has extended the approach of

[3] to include model training and additional network architec-

tures. While the network is again split between the mobile

and the cloud, in the framework proposed in [4] the data can

move both ways between the mobile and the cloud in order to

optimize efﬁciency of both training and inference.

While [3, 4] have established the potential beneﬁts of col-

laborative intelligence, the issue of efﬁcient transfer of feature

data between the mobile and the cloud is largely unexplored.

Speciﬁcally, [3] does not consider feature compression at all,

while [4] uses 8-bit quantization of feature data followed by

lossless compression, but does not examine the impact of such

processing on the application. Feature compression can fur-

ther improve the efﬁciency of collaborative intelligence by

minimizing the latency and energy of feature data transfer.

The impact of compressing the input has been studied in sev-

eral CNN applications [5, 6, 7] and the effects vary from case

to case. However, the impact of feature compression has not

been studied yet, to our knowledge.

In this work, we focus on a deep model for object de-

tection and study the impact of feature compression on its

accuracy. Section 2 presents preliminaries, while Section 3

describes the proposed methods. Experimental results and

conclusions are presented in Sections 4 and 5, respectively.

arXiv:1802.03931v1 [cs.CV] 12 Feb 2018

下载后可阅读完整内容，剩余4页未读，立即下载

生活教会我们

粉丝: 33
资源: 315

深度图像压缩提升协作目标检测效率

基于DSP 的图像压缩系统设计与算法研究

yolov1_voc2007_目标检测

自监督学习怎么和目标检测融合

目标检测数据处理工具

深度学习 协同轨迹规划

stm32运动目标检测

通过将图像压缩的编码和解码过程分离，解码与视觉分析过程迁移至云端，利用视觉分析任务和压缩的协同效应，实现特定的视觉分析任务处理和图像压缩兼顾。

帮我找找多摄像头协同的目标跟踪方法

双阶段目标检测的工作原理

给出10个深度学习课设的题目

最新资源

深度学习协同轨迹规划