异构CPU-GPU系统硬件环回仿真技术

需积分: 9 64 浏览量更新于2024-09-10 收藏 520KB PDF 举报

随着嵌入式系统对性能和效率的需求不断提高，多核CPU/GPU异构平台逐渐受到关注。传统的全系统模拟器通常通过在模拟的CPU和其他设备上运行完整的软件栈来分析系统内部行为。然而，针对CPU/GPU异构平台的全系统模拟器相对匮乏，而现有的GPU模拟器由于其计算密集性，在运行实际应用软件时速度极慢，这限制了开发效率和准确性。本文提出了一种硬件在环（Hardware-in-the-loop, HIL）模拟技术，旨在解决CPU/GPU异构平台的模拟难题。作者们，来自首尔国立大学和首尔大学电气与计算机工程学院的研究者们，以及三星电子的科研团队，合作开发了一种创新的解决方案。他们的工作重点在于设计一个能够无缝集成GPU硬件的全系统模拟器。关键贡献包括一个专为CPU模拟器和开发板之间设计的新型接口机制，该机制允许GPU硬件实时参与模拟过程，从而提高了模拟的真实性和效率。这种集成使得GPU能够更准确地模拟其实际工作负载，减少了对纯软件模拟的依赖，使得开发者可以在早期阶段就进行更精确的性能预测和优化。通过硬件在环模拟，研究者们旨在为CPU/GPU异构平台的软件开发者提供一个更为高效、准确的开发环境，使得他们能够在不修改实际硬件的情况下，快速评估应用程序在真实硬件上的行为。这种方法对于硬件加速器的系统级设计、并行编程优化、驱动程序开发以及能耗分析等方面具有重要意义。此外，论文可能还涵盖了如何处理跨平台兼容性问题、如何确保模拟结果的可重现性，以及在实际应用中的性能基准测试等细节。这项技术有望推动异构计算平台的发展，并且在Android等移动或嵌入式系统中，HSA（Heterogeneous System Architecture）和DSE（Dynamic Software Environment）等技术的集成可能也会有所体现，以提升系统的整体性能和用户体验。这篇论文是针对CPU/GPU异构平台开发的一种革新性方法，它结合了硬件仿真和系统模拟的优势，有望成为嵌入式系统设计和优化的重要工具。

Hardware-in-the-loop Simulation for CPU/GPU

Heterogeneous Platforms

Youngsub Ko,

Taeyoung Kim,

Youngmin Yi,

Myungsun Kim,

Soonhoi Ha

1,3

School of Electrical Engineering and Computer Science, Seoul National University, Seoul, Korea,

DMC R&D Samsung Electronics, Suwon, Korea

{kys4464, tykim, sha}@iris.snu.ac.kr,

mskim@redwood.snu.ac.kr

School of Electrical and Computer Engineering, University of Seoul, Seoul, Korea

ymyi@uos.ac.kr

ABSTRACT

Multi-core CPU/GPU heterogeneous platforms became popular

in embedded systems. A full system simulator is typically used to

observe the internal system behavior by running complete

software stacks without modification on simulation models of

CPUs and other devices in the system. However, there are few

known full system simulators for CPU/GPU heterogeneous

platforms and existent GPU simulators are prohibitively slow for

running application software. In this paper, we propose a

hardware-in-the-loop simulation technique that integrates GPU

hardware into a full system simulator. A novel interfacing

mechanism between CPU simulator and the development board,

where GPU hardware is integrated, is devised. In the experiments,

we took Exynos 4412 as a case study, where gem5 simulator is

used to simulate mainly a quad-core ARM CPU in the platform

and an Exynos development board is used to run the Mali GPU

hardware. We could successfully run Android apps on the

proposed hardware-in-the-loop simulation framework with up to

1.5 M cycles per second performance.

Keywords

HIL Simulation, CPU/GPU Heterogeneous platform, Mali GPU

1. INTRODUCTION

With ever increasing demand for computation in the embedded

systems, a mobile GPU has become an essential component in

most embedded systems. We can easily find many SoCs that

integrate both a CPU and a GPU: Tegra from NVIDIA,

Snapdragon from Qualcomm, and Exynos from Samsung, to name

a few. These chips are widely used on many platforms ranging

from automobiles to high-performance smart phones and tablet

PCs. Since low power consumption is the major design constraint

in most computer systems these days, the trend towards

CPU/GPU heterogeneous platforms will continue, also with the

increasing number of cores in CPUs and GPUs.

For architectural exploration of the system, as well as for

debugging and performance monitoring, a full system simulator is

typically used, on which complete software stacks can run without

modification. A full system simulator consists of simulation

models of CPUs, memories and a communication network as well

as peripherals. While CPU architectures have been studied for a

long time and there are many simulators available for different

CPUs, there are only a few GPU simulators whose simulation

speed is prohibitively slow for running application software;

gpgpu-sim [1] and Barra [2] are NVIDIA GPU simulators and the

simulation speed is only dozens of kilo-cycles per second. They

get even slower as the number of cores in a GPU increases. To the

best of our knowledge, there is no publicly available simulator for

widely used mobile GPUs such as Mali from ARM, PowerVR

from Imagination Technology, and Adreno from Qualcomm.

To make the full system simulation feasible for CPU/GPU

heterogeneous architectures, we propose a hardware-in-the-loop

simulation (HIL) technique that integrates existent GPU hardware

into a full system simulator. There are several challenges in

enabling hardware-in-the-loop simulation with CPU simulators

and the existent GPU hardware development board, among which

we list three major challenges. First, unlike a simulator or a

typical FPGA emulator for HW IPs, we cannot stop the execution

of GPU hardware and resume it conveniently. How to integrate a

development board into a CPU simulator is a challenging problem.

Second, since we model a system that has on-chip memory shared

by the CPU and the GPU, with the separate CPU simulator and

the GPU board, we must synchronize the duplicated shared

memory models, and maintain the coherence. Third, the simulator

must coordinate with the real GPU hardware carefully to preserve

functional correctness. For example, interrupts between

processors must be correctly modeled without violating the

causality or introducing any deadlock between the models.

To overcome these challenges, we devised a novel interfacing

mechanism between a CPU simulator and a development board

where the GPU hardware is integrated. To the best of our

knowledge, this is the first hardware-in-the-loop simulation

framework for a CPU/GPU heterogeneous embedded system that

can run complete software stacks without modification. The

proposed technique does not require the modification of GPU

drivers as well as the Linux kernel or the Android. The interaction

between the CPU simulator and the GPU hardware is done at the

calls to the GPU drivers, which can be detected easily without any

instrumentation. This allows easy porting of the simulation

interfaces for different GPUs, and also enables more efficient

synchronization between the simulator and the board.

It provides instruction-level accuracy for computation workload

so that the simulation can run fast enough to explore design space

of the system with the existent GPU hardware. For instance, we

can vary the number of CPU cores or change the types of GPUs,

evaluating the performance impact fast. On the other hand,

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for profit or commercial advantage and that

copies bear this notice and the full citation on the first page. Copyrights

for components of this work owned by others than the author(s) must be

honored. Abstracting with credit is permitted. To copy otherwise, or

republish, to post on servers or to redistribute to lists, requires prior

specific permission and/or a fee. Request permissions

from Permissions@acm.org.

DAC '14, June 01 - 05 2014, San Francisco, CA, USA

Publication rights licensed to ACM. ACM 978-1-4503-2730-

5/14/06$15.00.

http://dx.doi.org/10.1145/2593069.2593149

下载后可阅读完整内容，剩余5页未读，立即下载

Morning21

粉丝: 16
资源: 10

异构CPU-GPU系统硬件环回仿真技术

Hardware-in-the-loop Simulation for CPU-GPU Heterogeneous Platforms.ppt

A Chip-Hardware-in-the-Loop Simulation Framework

A Hardware-in-the-Loop Simulation Platform for the Verification and Validation

Hardware-in-the-loop simulation system for space information networks

Multi2Sim A Simulation Framework for CPU-GPU computing

Motor-double-closed-loop-simulation.zip_double loop _matlab doub

Open-Source+Library+and+freeware+for+Software-in-the-Loop+

GPU-GEMS-3D-Fluid-Simulation：Unity中的3D流体模拟

MATLAB-SIMULATION-FOR-PWM-INVERTERS-IN-POWER-ELECTRONICS

VANET-Simulation-in-MATLAB-master.rar_VANET-Simulation_in_matlab

最新资源