多核处理器的架构与门级功率模型结合：提升低功耗与仿真效率

188 浏览量更新于2024-08-26 收藏 226KB PDF 举报

随着信息技术的飞速发展，多核处理器已经成为现代计算系统中的关键组件。低功耗已经成为了衡量多核处理器性能的一个重要指标，尤其在追求高效能的同时，如何有效地管理和降低能耗成为了设计师们亟待解决的问题。随着多核处理器的复杂性不断增加，准确而高效的功耗估算变得至关重要，这不仅涉及到系统级别的架构设计，也涉及到底层的电路实现。本文提出了一种新颖的APowerModelCombined，即结合了架构级和门级的多核处理器功率模型。该模型将复杂的多核处理器分解为一系列可配置的建筑块，如处理器核心、缓存、总线等，每个建筑块都有其特定的参数化RTL（ Register Transfer Level）设计。通过这个模型，研究人员可以精细地模拟每个建筑块在不同工作负载下的行为，进而估算出它们的门级功率消耗。在这个过程中，作者使用参数化的RTL来建模，这种方法允许动态调整电路特性，以适应不同的工作环境和优化策略。通过这种方法，模型能够提供更贴近实际运行状况的功率预测，从而提高估算的准确性。接着，将这些估算值转化为lookup tables（查找表），便于在架构模拟器中进行快速查询和整合，使得整个系统的功耗分析更为便捷。实验结果显示，这种结合了架构级和门级的功率模型在峰值功率估计方面表现出极高的精度。相比于单纯的门级或架构级估算方法，它显著提升了仿真性能，使得设计师能够在早期设计阶段就能获得更可靠的功耗预测，从而在优化系统性能的同时，有效控制功耗，符合低功耗设计的趋势。总结来说，本文提出的APowerModelCombined为多核处理器的功耗管理提供了一个强大的工具，它通过跨层次的分析，既考虑了系统的整体结构，又兼顾了电路的细节，从而实现了高效且精确的功耗估算。这对于提升多核处理器的能源效率和优化系统设计具有重要意义。未来的研究可能进一步优化模型的参数化方法，或者将其应用到更多的硬件设计流程中，以推动低功耗计算技术的发展。

A Power Model Combined of Architectural Level

and Gate Level for Multicore Processors

Manman Peng

Key Laboratory for Embedded and Network Computing

of Hunan Province

Hunan University

Changsha, China

pengmanman@hnu.edu.cn

Yan Hu

Key Laboratory for Embedded and Network Computing

of Hunan Province

Hunan University

Changsha, China

hnu.huyan@gmail.com

Abstract—Low power consumption is becoming a critical

factor for multicore processors. As the multicore processor

design complexity increases, power estimation for multicore

processors has gained more importance. This paper presents a

new power model combined of architectural level and gate

level for multicore processors. The model maps the multicore

processors to a combination of building blocks, and estimates

the gate-level power of these blocks using parameterized RTL.

Then, the power numbers are made in the form of look-up

tables, and integrated in architecture simulators. The

experiments show that for peak power estimation, an excellent

accuracy has been reached and simulation performance is

greatly improved compared to the gate level.

Keywords-power modeling; gate level; architectural level;

Multicore processors

I. INT ROD UC ION

As VLSI technology is developing rapidly in

complexity and density, power consumption of chips has

become a major concern in the state of the art of high-

performance CPU design. To consume less power but still

get better overall performance, Industry has already shifted

gears to deploy architectures with multiple cores [2] and

large last-level caches [3], so the power consumption of

multi-core and many-core processors deserves to get more

attention.

According to the different design phases of the chip,

the analysis methods of overall power consumption is

divided into the following categories: architecture-level,

RTL-level, gate-level (netlist-level) and transistor-level. As

the power model is refined from the highest level of

abstraction to the lowest, the accuracy and detail of

functional and timing information increase. At the two

extremes, there are many power models/tools [4, 5, 6] have

been proposed. Architecture-level power models [4, 5] are

fast, but ignore the impact of specific circuit implementation

of various factors on power consumption. Gate-level power

models [6] can give precise power dissipation of a circuit

driven by certain input vectors. However, due to the nature

of its simulation process, gate-level estimators always have

the slowest speed, and the circuit netlists must be known

before any simulation could be performed.

In this paper, we introduce a new power model for

multicore processors which is a combination of gate level

and architectural level. Our model uses a series of EDA

tools and DesignWare Library [8] which is a collection of

the industry's most widely used, silicon-proven reusable

intellectual property blocks to get the gate-level power.

Then, these power numbers are made in the form of a

lookup table, and integrated in Gem5 [12] to provide power

estimates. For the components that we can’t get the RTL

hardware description, we still use analytical methods.

Dynamic, short-circuit, and leakage power are all modeled.

The rest of the paper is organized as follows: section 2

discusses related work, section 3 presents building blocks of

a multicore processor, section 4 provides a detailed

description of our power estimation methodology and its

flow, section 5 describes the validation experimental results

and section 6 discusses our conclusions.

II.

RELATED WORK

Wattch [4] is a widely-used architecture-level power

analysis and optimization tool. The power estimation of

Wattch is that the tool integrates parameterized power

models of common structures present in modern superscalar

microprocessors into Simplescalar. Wattch only models

dynamic power consumption and fall the main processor

units into four categories: array structures, fully associative

content-addressable memories, combinational logic and

wires and clocking. The limitation of Wattch is that they do

not necessarily model all of the miscellaneous logic present

in real microprocessors, and use simple linear scaling

models based on 0.8um technology that are inaccurate to

make predictions for current and future deep-submicron

technology nodes.

CACTI [13] is an integrated cache and memory access

time, cycle time, area, leakage, and dynamic power model.

It uses device models based on the industry-standard ITRS

2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications

DOI 10.1109/TrustCom.2013.204

1652

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38652196

粉丝: 2
资源: 939

多核处理器的架构与门级功率模型结合：提升低功耗与仿真效率

片上多核处理器架构指南

针对40100G高速以太网多核处理器架构的研究与改进.pdf

gem5-gpu A heterogeneous CPU-GPU Simulator

大学计算机基础：处理器构造的技术探秘

计算机系统架构

【软件架构设计】：构建高效数组操作Python库的策略与技巧

【SQL查询优化全攻略】：Hackerrank数据库挑战专家级攻略

java基础GUI框架完成的贪吃蛇小游戏.zip

安卓期末大作业-Android跑步计数app期末大作业源码（高分项目）

C#毕业设计-基于ASP.NET的教师公寓管理系统源码.zip

最新资源