没有合适的资源?快使用搜索试试~ 我知道了~
首页多核系统中节能实时任务调度的节点缩放分析
多核系统中节能实时任务调度的节点缩放分析
0 下载量 10 浏览量
更新于2024-08-27
收藏 506KB PDF 举报
本文档标题为《节点规模分析:针对功率感知的实时任务调度》,发表于2015年的IEEE Transactions on Computers期刊。随着多核处理器的普及,动态电压缩放(DVS)技术被广泛应用在性能与能耗之间寻求平衡。研究焦点在于针对相同架构的多核系统中的实时任务,探讨如何实现一种功率感知的高效调度策略。 作者们提出了一种名为NodeScaling模型的方法,旨在解决在电源管理下实时任务的调度问题。他们发现,对于一个给定的任务集,存在一个理论上的速度上限,这个速度下的能耗最低。换句话说,当任务运行速度达到这个上限时,系统的总体能耗将达到最小。这个发现对于理解和优化能源效率至关重要。 文章的核心贡献是证明了任务集中的最大任务利用率umax(即所有任务使用的计算能力占系统总能力的比例)与功率效率的关系。通过分析,他们揭示了umax对系统能耗的影响,以及如何通过合理的任务调度策略来最大化umax,同时保持在能耗最优的范围内。 此外,文中还可能探讨了影响节点规模、任务优先级、任务依赖性等因素对功率感知调度的影响,以及如何通过算法设计来动态调整处理器频率,以适应实时任务的实时性和能量约束。最后,鉴于文章尚未经过最终编辑,可能还包含了初步的仿真结果、比较现有调度算法的性能优势以及未来研究方向的讨论。 这篇论文为理解和优化现代多核系统中的实时任务调度提供了一个理论框架,有助于电力高效的计算平台设计,对节能计算和绿色计算有着实际的应用价值。
资源详情
资源推荐
0018-9340 (c) 2015 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See
http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation
information: DOI 10.1109/TC.2015.2485229, IEEE Transactions on Computers
IEEE TRANSACTIONS ON COMPUTERS, VOL. X, NO. X, JUNE 2015 3
done on the semi-partitioned scheme which assigns
statically most tasks to one fixed core as in partitioned
scheduling, while a few number of tasks are split
into several subtasks, which are assigned to different
cores. Thus a scheduling algorithm for a multi-core
system in this paper represents a pair of single-core
scheduling algorithm and allocation algorithm. An
example for single-core algorithm is the well-known
RM algorithm or EDF algorithm [19], while FF (First
Fit) or WF (Worst Fit) algorithm can be considered as
examples of allocation algorithm.
Power Consumption Model. Normally the power
consumption of the core is represented through the
speed (frequency) and the voltage of the core. The
core’s power consumption function P consists of 2
parts: a static part existing even when there is no
workload (it’s due to the leakage) and a dynamic part
which is related to the frequency [20] .
P = DV
2
dd
s + P
s
(1)
In the function 1, P
s
is the static part, and s =
k
(V
dd
−V
t
)
2
V
dd
. D, V
t
, V
dd
and k denote respectively the
effective switch capacitance, the threshold voltage,
the supply voltage, and a hardware-design-specific
constant. Note that V
dd
≥ V
t
≥ 0, k > 0, and
D > 0, V
dd
is usually proportional to the speed s. Thus
in the following analysis, we make the simplifying
assumption that the dynamic part scales by a factor of
s
3
, and P
s
is a constant. This simplification is justified
by the close match between the data sheet curves
of real DVS processors and the analytical curves [6].
Actually we can have:
P (s) = Cs
3
+ P
s
(2)
where C and P
s
are constants.
The total power of a multi-core processor is simply
a sum of the power dissipated in each core: P
tot
=
P
P
i
(s). The execution requirement c
i
executed in a
time interval is linear of the core’s speed, and the
energy consumed for a core to execute a task at the
core’s speed s for t time units is t · P (s). Suppose a
task set is assigned to n cores in an identical multi-
core system, and there is not speed change during
time [t
0
, t
1
]. The total energy consumed in this time
interval for the task set is: E
tot
=
P
n
i=1
P
i
(s) ·(t
1
− t
0
).
3 NODE SCALING MODEL STRUCTURE
Today’s multicores are complex systems of cores,
caches, interconnects, memory controllers, multiple-
domain clocking, and other components. The power
consumption of each part in a multi-core processor
must be measured to precisely estimate cores’ power
efficiency. In order to model most of the existing
multi-core processors, Sniper [21] and McPAT [22]
are used to simulate the power consumption of pro-
cessors varying with the processor’s frequency. As a
next generation parallel, high-speed and accurate x86
simulator, Sniper simulator is based on the interval
core model and the Graphite simulation infrastruc-
ture, allowing for fast and accurate simulation when
exploring different homogeneous and heterogeneous
multi-core architectures. Sniper integrates with Mc-
PAT which is a power and area modeling framework
to estimate the program’s and processor’s power
consumption. In this article, the realistic Nehalem
systems are modeled and simulated to act as the target
multi-core system.
For a given real-time task set τ, an existing schedul-
ing algorithm should provide a feasible mapping for
τ on an identical multi-core system π, including the
number of required cores, the core’s speed (normal
equals to s
max
) and the assignment of tasks in each
core. This mapping is defined as the initial state IT
π
=
(m, s), which includes the number of required cores
m and the core’s speed s. Then we extend the number
of cores to m
0
, m
0
≥ m, and reduce the speed of cores
to s
0
in the condition of keeping the schedulability of
the task set τ. The extension results in the extended
state ET
π
= (m
0
, s
0
). Trying to maximize the energy
consumption disparity between the initial state and
the extended state, we can find the suitable m
0
and s
0
which result in the minimal energy consumption for
the given task set τ .
The structure of Node Scaling model is shown in
Fig. 1. We suppose the cores of a multi-core processor
connected by a network with a 2-dimension torus
topology. For a real-time task set τ , the initial state is
represented as the four dark gray cores in the dotted
line rectangle. Then Node Scaling model is used to
compute the extended core number and the reduced
speed. Thus Node Scaling scheduler reserves more
cores (the dark cores in the dashed line rectangle) and
reduces the core’s speed to slow down the system.
Therefore, the conditions to ensure the feasibility of
a task set in the case of scaling the number of cores
and reducing the core’s speed, and the extended state
which results in a maximal system energy saving are
two key problems need to be solved in Node Scaling
model.
4 SCHEDULABILITY TEST
In Node Scaling model, a real-time task set is sup-
posed to execute on the initial state(IT
π
) and the
extended state(ET
π
) in order to maximize the energy
consumption disparity between the two states. We
must find the conditions which ensure all of tasks
can be accomplished before their deadlines without
missing the timing constraints when we extend IT
π
to ET
π
.
Funk et al. [23] studied the problem of exact test
for determining whether a given periodic task set is
feasible in an uniform multiprocessor platform. Since
an identical multi-core system can be considered as
剩余13页未读,继续阅读
weixin_38535221
- 粉丝: 3
- 资源: 936
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 十种常见电感线圈电感量计算公式详解
- 军用车辆:CAN总线的集成与优势
- CAN总线在汽车智能换档系统中的作用与实现
- CAN总线数据超载问题及解决策略
- 汽车车身系统CAN总线设计与应用
- SAP企业需求深度剖析:财务会计与供应链的关键流程与改进策略
- CAN总线在发动机电控系统中的通信设计实践
- Spring与iBATIS整合:快速开发与比较分析
- CAN总线驱动的整车管理系统硬件设计详解
- CAN总线通讯智能节点设计与实现
- DSP实现电动汽车CAN总线通讯技术
- CAN协议网关设计:自动位速率检测与互连
- Xcode免证书调试iPad程序开发指南
- 分布式数据库查询优化算法探讨
- Win7安装VC++6.0完全指南:解决兼容性与Office冲突
- MFC实现学生信息管理系统:登录与数据库操作
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功