多核系统中节能实时任务调度的节点缩放分析

研究论文

10 浏览量更新于2024-08-27 收藏 506KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

资源详情

资源推荐

http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation

information: DOI 10.1109/TC.2015.2485229, IEEE Transactions on Computers

IEEE TRANSACTIONS ON COMPUTERS, VOL. X, NO. X, JUNE 2015 3

done on the semi-partitioned scheme which assigns

statically most tasks to one ﬁxed core as in partitioned

scheduling, while a few number of tasks are split

into several subtasks, which are assigned to different

cores. Thus a scheduling algorithm for a multi-core

system in this paper represents a pair of single-core

scheduling algorithm and allocation algorithm. An

example for single-core algorithm is the well-known

RM algorithm or EDF algorithm [19], while FF (First

Fit) or WF (Worst Fit) algorithm can be considered as

examples of allocation algorithm.

Power Consumption Model. Normally the power

consumption of the core is represented through the

speed (frequency) and the voltage of the core. The

core’s power consumption function P consists of 2

parts: a static part existing even when there is no

workload (it’s due to the leakage) and a dynamic part

which is related to the frequency [20] .

P = DV

s + P

(1)

In the function 1, P

is the static part, and s =

−V

)

. D, V

, V

and k denote respectively the

effective switch capacitance, the threshold voltage,

the supply voltage, and a hardware-design-speciﬁc

constant. Note that V

≥ V

≥ 0, k > 0, and

D > 0, V

is usually proportional to the speed s. Thus

in the following analysis, we make the simplifying

assumption that the dynamic part scales by a factor of

, and P

is a constant. This simpliﬁcation is justiﬁed

by the close match between the data sheet curves

of real DVS processors and the analytical curves [6].

Actually we can have:

P (s) = Cs

+ P

(2)

where C and P

are constants.

The total power of a multi-core processor is simply

a sum of the power dissipated in each core: P

tot

(s). The execution requirement c

executed in a

time interval is linear of the core’s speed, and the

energy consumed for a core to execute a task at the

core’s speed s for t time units is t · P (s). Suppose a

task set is assigned to n cores in an identical multi-

core system, and there is not speed change during

time [t

, t

]. The total energy consumed in this time

interval for the task set is: E

tot

i=1

(s) ·(t

− t

3 NODE SCALING MODEL STRUCTURE

Today’s multicores are complex systems of cores,

caches, interconnects, memory controllers, multiple-

domain clocking, and other components. The power

consumption of each part in a multi-core processor

must be measured to precisely estimate cores’ power

efﬁciency. In order to model most of the existing

multi-core processors, Sniper [21] and McPAT [22]

are used to simulate the power consumption of pro-

cessors varying with the processor’s frequency. As a

next generation parallel, high-speed and accurate x86

simulator, Sniper simulator is based on the interval

core model and the Graphite simulation infrastruc-

ture, allowing for fast and accurate simulation when

exploring different homogeneous and heterogeneous

multi-core architectures. Sniper integrates with Mc-

PAT which is a power and area modeling framework

to estimate the program’s and processor’s power

consumption. In this article, the realistic Nehalem

systems are modeled and simulated to act as the target

multi-core system.

For a given real-time task set τ, an existing schedul-

ing algorithm should provide a feasible mapping for

τ on an identical multi-core system π, including the

number of required cores, the core’s speed (normal

equals to s

max

) and the assignment of tasks in each

core. This mapping is deﬁned as the initial state IT

(m, s), which includes the number of required cores

m and the core’s speed s. Then we extend the number

of cores to m

, m

≥ m, and reduce the speed of cores

to s

in the condition of keeping the schedulability of

the task set τ. The extension results in the extended

state ET

= (m

, s

). Trying to maximize the energy

consumption disparity between the initial state and

the extended state, we can ﬁnd the suitable m

and s

which result in the minimal energy consumption for

the given task set τ .

The structure of Node Scaling model is shown in

Fig. 1. We suppose the cores of a multi-core processor

connected by a network with a 2-dimension torus

topology. For a real-time task set τ , the initial state is

represented as the four dark gray cores in the dotted

line rectangle. Then Node Scaling model is used to

compute the extended core number and the reduced

speed. Thus Node Scaling scheduler reserves more

cores (the dark cores in the dashed line rectangle) and

reduces the core’s speed to slow down the system.

Therefore, the conditions to ensure the feasibility of

a task set in the case of scaling the number of cores

and reducing the core’s speed, and the extended state

which results in a maximal system energy saving are

two key problems need to be solved in Node Scaling

model.

4 SCHEDULABILITY TEST

In Node Scaling model, a real-time task set is sup-

posed to execute on the initial state(IT

) and the

extended state(ET

) in order to maximize the energy

consumption disparity between the two states. We

must ﬁnd the conditions which ensure all of tasks

can be accomplished before their deadlines without

missing the timing constraints when we extend IT

to ET

Funk et al. [23] studied the problem of exact test

for determining whether a given periodic task set is

feasible in an uniform multiprocessor platform. Since

an identical multi-core system can be considered as

剩余13页未读，继续阅读

weixin_38535221

粉丝: 3
资源: 936

多核系统中节能实时任务调度的节点缩放分析

Stretchability-aware block scaling for image retargeting

Power scaling for cognitive radio.pdf

Kang 等 - 2023 - Scaling up GANs for Text-to-Image Synthesis.pdf

api-ms-win-shcore-scaling-l1-1-1.dll

N-dependent scaling of grand-averaged estimates是什么意思

'train': ( "{cmd_mpi:s} nnp-scaling 100 > nnp-scaling-stdout.log 2> nnp-scaling-stdout.err; " "{cmd_mpi:s} nnp-train > nnp-train-stdout.log 2> nnp-train-stdout.err"), 'predict': '{cmd_mpi:s} nnp-dataset 0 > nnp-dataset-stdout.log 2> nnp-dataset-stdout.err'

python scaling

CloudSim可以参考哪些文献

为什么对ResNet34模型单独使用Linear scaling learning rate和共同使用Large-batch training与 Linear scaling learning rate和Large-batch training与No bias decay时模型性能反而下降

spyder中报这样的错怎么解决WARNING: C:/buildkite-agent/builds/buildkite-windows-cpu-autoscaling-group-i-0fc7796c793e6356f-1/xgboost/xgboost-ci-windows/src/objective/regression_obj.cu:213: reg:linear is now deprecated in favor of reg:squarederror.

power scaling

LSTM kubernetes

z-scaling用R语言的计算方式

oracle rac mpp

python提取脑电功率谱密度特征

在ROS Noetic版本的MoveIt中，使用Iterative Parabolic Time Parameterization算法对轨迹重规划的python函数

最新资源