雾计算与边缘计算：深度学习推理的边缘部署策略

需积分: 0 89 浏览量更新于2024-08-05 收藏 811KB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

随着物联网（IoT）、智能手机应用以及嵌入式系统的普及，近年来对在网络边缘提供计算资源的需求日益增长。九年前，人们开始设想利用智能设备的边缘计算能力，通过云lets、Fog等概念，将计算任务移至离数据源更近的地方，以实现更快速、可扩展且可用的云计算服务。这种靠近数据源头的处理方式有助于降低延迟，提升响应速度，特别适合于那些对实时分析有高要求的应用场景，如视频分析、自动驾驶和工业自动化。深度学习作为一种强大的数据处理技术，在这些场景中扮演了关键角色。它能够自动学习并执行复杂的模式识别任务，减少了人工干预的需求，从而在诸如图像识别、语音识别等领域展现出显著的优势。然而，传统的云计算服务存在云服务中断和较高延迟的问题，这在很大程度上限制了深度学习的广泛应用。为了克服这些问题，研究人员和开发者正在探索将深度学习推理任务部署到边缘计算和雾计算系统中的可能性。边缘计算将部分计算任务转移到网络边缘的设备，如智能手机、路由器或小型数据中心，这不仅降低了对云端资源的依赖，还显著减少了数据传输的时间。雾计算则介于云和终端设备之间，提供了更细粒度的资源管理和分发，进一步优化了延迟性能。实施深度学习和推理在雾/边缘计算系统中需要解决一系列挑战，包括硬件资源的有效管理、模型的分布式存储与更新、以及如何在有限的计算能力下保持高效的性能。此外，数据隐私和安全问题也是必须考虑的重要环节，因为在边缘处理敏感数据时，保护用户隐私至关重要。为了在实际应用中成功部署深度学习，研究人员Swarnava Dey和Arijit Mukherjee，作为TCS Research & Innovation的嵌入式系统和机器人学领域的专家，可能在他们的工作中探讨了如何设计适应边缘环境的算法、优化硬件架构、以及构建高效能的通信协议，以确保深度学习模型能在各种智能设备上实时运行，并满足低延迟要求。这篇论文深入探讨了在雾和边缘计算环境中如何有效地实现深度学习推理，旨在推动这一技术在现实世界中的广泛应用，尤其是在那些对响应速度和数据隐私高度关注的领域。随着技术的不断发展，我们期待看到更多的创新解决方案，以克服当前边缘计算环境中的挑战，推动智能化服务的普及和进步。

资源详情

资源推荐

Implementing Deep Learning and Inferencing on

Fog and Edge Computing Systems

Swarnava Dey (Author)

Embedded Systems and Robotics

TCS Research & Innovation

Kolkata, India

Email: swarnava.dey@tcs.com

Arijit Mukherjee (Author)

Embedded Systems and Robotics

TCS Research & Innovation

Kolkata, India

Email: mukherjee.arijit@tcs.com

Abstract—The case for leveraging the computing resources of

smart devices at the edge of network was conceptualized almost

nine years back. Since then several concepts like Cloudlets,

Fog etc. were instrumental in realizing computing at network

edge, in physical proximity to the data sources for building

more responsive, scalable and available Cloud based services.

An essential component in smartphone applications, Internet of

Things(IoT), ﬁeld robotics etc. is the ability to analyze large

amount of data with reasonable latency. Deep Learning is fast

becoming a de facto choice for performing this data analytics

owing to its ability to reduce human interventions in such

workﬂows. Major deterrent of providing Deep Learning based

Cloud services are Cloud outages and relatively high latency.

In the current article the role of Fog Computing in addressing

these issues is discussed, current state of standardization in

Fog / Edge Computing is reviewed and the importance of

optimum resource provisioning for running Edge-Analytics is

highlighted. A detailed design and evaluation of the distribution

and parallelization aspects of an Edge based Deep Learning

framework using off-the-shelf components is presented along with

strategies for optimum resource provisioning in constrained edge

devices based on experiments with system resource (CPU, GPU

& RAM) consumptions of a Deep Convolutional Neural Network.

I. INTRODUCTION

In today's Digital World the dumb rule based network end-

devices are giving way to intelligent, semi-autonomous devices

that provide tailor-made, real-time services. These service

endpoints include smartphones, wearable devices, autonomous

vehicles, robots & drones and other embedded systems used

in domains such as healthcare, city services, engineering,

ﬁnance, entertainment and several others. As these sensor ﬁtted

devices, sensors and human beings are generating high amount

of contextual data, the options for application of artiﬁcial in-

telligence (AI) for rendering more intelligent, customized and

effective services are increasing. Though these data-driven,

machine inferred services promise to bring in a paradigm

shift in several different application areas, the major challenge

remains in managing the big data generated by data sources

and applying AI for learning/inferencing in a distributed

fashion for near accurate and near real-time response. Due

to fast changing nature of services and requirement of fast

time-to-market, there is a shift of focus towards automated

generation of features for machine learning via Deep Learning

(DL) [3], [5], from traditional hand engineering of machine

learning features. In a typical supervised DL application, input

data with desired output is fed to a complex neural network

(NN) with processing and non-linear transition happening at

several layers to adjust the NN to form a model, which

can be utilized later to predict/classify/encode new sets of

data. Cloud based sensor data analytics frameworks are often

challenged by low latency requirement of the applications

and intermittent network connectivity due to high mobility

of the devices. To handle these issues resource rich devices

within local access network can be utilized and this was ﬁrst

established in [1], where service software was ofﬂoaded for

execution on a Cloudlet virtual machine. Fog Computing [2]

proposed in 2012 envisaged deployment of services within lo-

cal access network, augmenting Cloud based deployments. As

an ongoing activity, Multi-access Edge Computing (MEC) [6]

initiative from ETSI is standardizing the process of deploying

applications and services in applications like video analytics,

Internet of Things (IoT) etc. in the Radio Access Network

(RAN) edge. With the availability state-of-the-art distributed

DL frameworks like TensorFlow [7] and scalable Cloud/Fog

based infrastructure it is possible to rapidly develop intelligent,

data-driven services. In this work we address this issue of run-

ning DL based analytics in a Cloud-Edge setup. We focus on

two primary requirements for successful deployment of such

services: a) analyzing resource requirement of applications

in terms of processing speed, memory for optimum resource

utilization and b) utilizing the workload information for op-

timized partitioning of load between networked resources in

order to minimize deployment cost or response time or some

other parameter important for the problem at hand.We perform

ﬁne grained analyses of the resource requirement for ofﬂoaded

execution of a representative application that implements DL

analytics on streaming data, considering NN size (depth,

width, number of layers), data throughput, NN hyperparame-

ters (feature extraction ﬁlter size, batch size) etc. with respect

to execution time, CPU, GPU and memory requirement for

different levels of accuracy of prediction/classiﬁcation. We

present a set of benchmarking methods and results and hope

that these will be helpful in provisioning optimum resources

at network edge to design effective DL analytics frameworks

capable of handling large volume of streaming data. We also

discuss the distribution and parallelization strategies using

SmartEdge'18 - Second International Workshop on Smart Edge Computing and Networking

下载后可阅读完整内容，剩余5页未读，立即下载

思想假

粉丝: 33
资源: 326

雾计算与边缘计算：深度学习推理的边缘部署策略

未来车辆雾计算网络

轻量级深度学习推理框架.

深度学习推理引擎中的并行计算技术与GPU加速计算

深度学习推理引擎中的并行计算技术初探

Canny边缘检测在计算机视觉中的最新进展：深度学习与人工智能

深度学习推理引擎并行计算技术中的多线程编程优化

深度学习推理引擎中的并行计算技术与网络负载均衡

一般选择深度学习边缘计算设备考虑什么因素

数据中心推理和边缘推理有什么区别

和边缘计算相关的实战项目

边缘设备如何执行计算任务

DNNs 神经网络边缘计算

什么叫轻量级的深度学习模型

深度学习轻量化 目的

④EdgeX+OpenVINO混合AI创新

tensorRT是什么

ubuntu深度学习图像处理

tensorrt轻松部署高性能dnn推理_基于自动驾驶车辆的NVIDIA-TensorRT推理实时优化

瑞芯微部署深度学习模型

最新资源

深度学习轻量化目的