边缘计算：动态深度神经网络优化工作负载分配

需积分: 0 12 浏览量更新于2024-08-05 收藏 380KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"一种用于边缘计算中有效工作负载分配的动态深度神经网络设计" 在现代信息技术领域，边缘计算作为云计算的补充，正日益受到关注。它将数据处理和应用执行更靠近数据源，即设备端，以减少延迟并提高效率。然而，边缘计算设备，如自主机器人和无人机，面临着通信不稳定和计算资源有限的挑战。电池供电的移动设备尤为如此，因为它们必须在有限的能源下完成复杂的任务，如深度神经网络（DNN）计算。随着对精确度需求的不断增长，现代DNN设计倾向于采用级联模块化层，这显著增加了计算工作负载和资源占用，从而加速了电池的消耗。为解决这一问题，一种策略是使用较浅的网络并将部分工作负载转移到骨干服务器上，但这会导致由于通信信道的不稳定性而产生的显著延迟开销。论文“一种动态深度神经网络设计用于边缘计算中有效工作负载分配”探讨了这个问题，并提出了一种动态DNN设计方法。这种方法旨在根据边缘设备的实时条件和资源可用性，智能地分配和调整工作负载，以平衡计算效率和能源消耗。通过动态调整DNN的结构和运算，该设计可以适应变化的环境条件，同时最小化通信延迟和计算压力。具体实现中，这种动态DNN可能会包括以下关键组成部分： 1. **状态感知模块**：监测设备的当前状态，如电池电量、网络连接质量和计算资源利用率。 2. **决策机制**：基于收集到的状态信息，确定哪些任务应该在本地执行，哪些任务应传输到云端。 3. **网络自适应结构**：能够根据任务需求和资源限制动态改变网络的深度和宽度，可能包括对某些层的激活或关闭。 4. **优化算法**：优化工作负载分配过程，确保高效能和低延迟，可能包括在线学习策略来持续改进决策性能。通过这种方式，动态DNN设计有望实现边缘计算环境中的最优工作负载分配，最大化设备效率，同时减少不必要的能源消耗。这对于提升自主移动设备的性能和持久性具有重要意义，特别是在实时性和可靠性要求高的应用中，如自动驾驶和远程监控。未来的研究可能会进一步探索如何在保证性能的同时，减少动态DNN的计算复杂性和对存储的需求，以及如何利用边缘计算的分布式特性来协同处理工作负载，实现整体系统的优化。此外，对于网络不稳定性和安全性的考虑也将是动态DNN设计需要面对的关键问题。

资源详情

资源推荐

A Dynamic Deep Neural Network Design for

Efﬁcient Workload Allocation in Edge Computing

Chi Lo, Yu-Yi Su, Chun-Yi Lee, and Shih-Chieh Chang

Dept. of Computer Science, National Tsing Hua University

No. 101, Sec. 2, Kuang-Fu Rd., Hsinchu, Taiwan 30013, R.O.C.

{chilo9212, wwball34}@gmail.com, {cylee, scchang}@cs.nthu.edu.tw

Abstract—Unreliable communication channels and limited

computing resources at the edge end are two primary constraints

of battery-powered movable devices, such as autonomous robots

and unmanned aerial vehicles (UAVs). The impact is espe-

cially severe for those performing deep neural network (DNN)

computations. With increasing demand for accuracy, the trend

in modern DNN designs is the use of cascaded modularized

layers. Implementing a deep network at the edge increases

computational workloads and resource occupancy, leading to an

increase in battery drain. Using a shallow network and ofﬂoading

workloads to backbone servers, however, incur signiﬁcant latency

overheads caused by unstable communication channels. Hence,

dynamic DNN design techniques for efﬁcient workload allocation

are urgently required to manage the amount of workload trans-

missions while achieving the required accuracy. In this paper, we

explore the use of authentic operation (AO) unit and dynamic

network structure to enhance DNNs. The AO unit deﬁnes a set

of stochastic threshold values for different DNN output classes

and determines at runtime if an input has to be transferred

to backbone servers for further analysis. The dynamic network

structure adjusts its depth according to channel availability.

Experiments have been comprehensively performed on several

well-known DNN models and datasets. Our results show that,

on an average, the proposed techniques are able to reduce the

amount of transmissions by up to 17% compared to previous

methods under the same accuracy requirement.

Keywords—Deep neural network, workload allocation, edge

computing, authentic operation, dynamic network structure

I. INTRODUCTION

Deep neural networks (DNNs) have emerged as a popular

design paradigm in the area of image classiﬁcation and object

detection [1]–[12]. This is due to their capabilities to extract

high-level and abstract features from raw data. A number of

DNN architectures and training algorithms have been proposed

to improve the accuracy of multilayer perceptrons (MLPs) and

convolutional neural networks (CNNs) from different perspec-

tives [3]–[5]. With increasing demand for high accuracy, there

has been a trend in recent years to increase the number of lay-

ers of DNNs. Several state-of-the-art CNN architectures, such

as AlexNet [2], Network In Network (NIN) [6], VGGNet [7],

and GoogLeNet [8], contain from eight to dozens of hidden

layers. Researchers have even further pushed the network size

up to 152 layers [9], achieving an unprecedented error rate less

than 5% on the famous ImageNet dataset [13]. It is believed

that the deeper a network is, the higher the accuracy it delivers

[7]–[9]. However, deeper neural networks usually require more

computation, leading to higher workloads as well as resource

Robots

UAVs

Capture image

Built-in auxiliary network

Remote principal network

Trustworthy?

Yes

Unreliable

channel

Obtain

result

Obtain

result

Fig. 1: Workload sharing of DNN between edge and server

occupancy compared to shallower ones. These computational

workloads and resource requirements limit the scale of DNNs

to be executed on energy-constrained embedded devices. Thus,

an efﬁcient method to manage the workloads of embedded

systems performing DNN computations is urgently needed.

The concept of edge computing [14]–[16] is to perform data

processing at the edge end, near the source of data. Edge-end

embedded devices, such as unmanned aerial vehicles (UAVs)

and autonomous robots, work synergistically with powerful

servers to provide performance in modern edge computing

systems. Edge-end devices usually have limited computational

capability and resources, thus only shallower DNNs (denoted

as auxiliary networks) can be accommodated, as compared to

the deeper ones (denoted as principal networks) executed at

the server end. In such systems, the edge devices may suffer

from unstable communication channels. When the communi-

cation channels are fully accessible, edge devices can leverage

both the auxiliary networks and principal networks to achieve

high DNN accuracy. When the channels are unstable, edge

devices can only share fewer workloads to the server. As a

result, utilizing the communication channels and efﬁciently

allocating workloads between an auxiliary network and a

principal network are of particular importance.

A promising strategy to deal with the above issues is to

allocate difﬁcult DNN workloads to the principal network

at the remote server, while retaining easy ones at the edge.

Whether to transfer an input from the edge to the server

is determined by calculating its conﬁdence level. Conﬁdence

level is used as a measure of the reliability of a prediction.

The higher the conﬁdence level is, the more trustworthy the

prediction might be. Fig. 1 illustrates such a scenario. The

input images captured by robots or UAVs are ﬁrst analyzed

2017 IEEE 35th International Conference on Computer Design

DOI 10.1109/ICCD.2017.49

273

下载后可阅读完整内容，剩余7页未读，立即下载

兰若芊薇

粉丝: 29
资源: 301

边缘计算：动态深度神经网络优化工作负载分配

基于边缘节点的深度神经网络任务分配方法.pdf

一种基于深度强化学习与概率性能感知的边缘计算环境多工作流卸载方法.pdf

深度神经网络实现卸载策略、边缘计算、任务卸载、能耗优化、成本优化的matlab仿真

边缘计算卸载算法python

边缘计算资源分配matlab

边缘计算upf是什么意思

多接入边缘计算mec及关键技术 pdf

和边缘计算相关的实战项目

边缘计算项目Python

mec移动边缘计算源码

转矩分配中最优分配,平均分配,动态分配的区别

用于移动边缘计算项目模拟的python代码-源码

PowerDC根据电源和负载的位置，以及电源线和电源分配网络的电阻值，如何计算出电流在整个电源分布网络中的流动路径。

如何在物联网感知层中实现边缘计算以提高数据处理效率？请讨论边缘计算的优势和挑战。

云计算、边缘计算、AI

使用边缘计算，基于ns3平台，对5g进行传输优化设计及模拟

边缘计算资源卸载matlab仿真

边缘计算任务卸载、同态加密、成本定价与深度强化学习如何融合

负载分担和负载均衡有什么区别

工作负载如何衡量或计算

最新资源