GPU工作原理与现代图形处理革命

需积分: 9 127 浏览量更新于2024-09-17 收藏 4MB PDF 举报

GPU（图形处理单元）工作原理详解随着20世纪90年代早期计算机游戏市场的爆炸性增长，消费者对实时3D图形的需求激增，图形处理单元（GPU）应运而生。在此之前，3D图形还停留在科幻范畴，然而到本世纪初，几乎每台新电脑都内置了专门用于提供高性能、视觉丰富且互动的3D体验的GPU。这一转变是消费者需求、制造业进步和技术潜能释放的综合结果。传统的固定功能3D图形管道被GPU的并行计算能力所取代，使其不再局限于单一的图形渲染任务，而是向着通用的并行计算引擎演进。现代GPU能够直接在硬件上执行多种并行算法，那些充分利用底层计算能力的算法可以实现惊人的性能提升。可以说，GPU已经成为普及的桌面级并行计算机，极大地扩展了计算能力。 GPU的工作原理主要包括一个图形处理管道，这是一个高度优化的流水线架构，分为多个阶段：顶点着色器（Vertex Shader）、几何着色器（Geometry Shader）、片段着色器（Fragment Shader）以及可能的后处理阶段，如光栅化（Rasterization）、纹理采样（Texture Sampling）和最终的渲染输出。这个管道的工作流程通常是顺序执行，但每个阶段都是并行执行的，因为每个像素或顶点都可以独立处理。在图形渲染过程中，GPU接收来自CPU的指令，包括3D模型、纹理和光照等信息。首先，顶点着色器对输入的3D模型的顶点进行变换和着色，生成顶点属性。接着，几何着色器对这些顶点进行进一步处理，例如剪裁和合并，形成多边形。片段着色器对每个像素进行颜色计算，结合纹理映射和光照效果，确定最终的像素颜色。最后，光栅化将像素转换为屏幕上的可见图像，而纹理采样则负责从纹理贴图中获取所需的纹理数据。 GPU的优势在于其并行处理能力，它能够同时处理大量的像素和顶点，这使得它在大规模渲染场景、物理模拟、机器学习和深度学习等高计算密集型任务中发挥关键作用。而且，随着技术的进步，GPU的性能持续增长，与CPU之间的性能差距还在扩大，这意味着未来更多的应用程序将利用GPU的算力，推动计算行业的创新和发展。 GPU已经从最初的图形加速器转变为现代计算平台的核心组件，其灵活的架构和强大的并行处理能力使得它成为驱动许多现代数字娱乐、科学计算和人工智能领域发展的关键技术。理解GPU的工作原理对于任何从事图形设计、游戏开发、机器学习或高性能计算的人来说都是至关重要的。

96 Computer

HOW THINGS WORK

n the early 1990s, ubiquitous

interactive 3D graphics was still

the stuff of science ﬁction. By the

end of the decade, nearly every

new computer contained a graph-

ics processing unit (GPU) dedicated to

providing a high-performance, visu-

ally rich, interactive 3D experience.

This dramatic shift was the in-

evitable consequence of consumer

demand for videogames, advances in

manufacturing technology, and the

exploitation of the inherent paral-

lelism in the feed-forward graphics

pipeline. Today, the raw computa-

tional power of a GPU dwarfs that of

the most powerful CPU, and the gap is

steadily widening.

Furthermore, GPUs have moved

away from the traditional ﬁxed-func-

tion 3D graphics pipeline toward

a flexible general-purpose compu-

tational engine. Today, GPUs can

implement many parallel algorithms

directly using graphics hardware.

Well-suited algorithms that leverage

all the underlying computational

horsepower often achieve tremendous

speedups. Truly, the GPU is the first

widely deployed commodity desktop

parallel computer.

THE GRAPHICS PIPELINE

The task of any 3D graphics system

is to synthesize an image from a

description of a scene—60 times per

second for real-time graphics such as

videogames. This scene contains the

geometric primitives to be viewed as

well as descriptions of the lights illu-

minating the scene, the way that each

object reﬂects light, and the viewer’s

position and orientation.

GPU designers traditionally have

expressed this image-synthesis process

as a hardware pipeline of specialized

stages. Here, we provide a high-level

overview of the classic graphics

pipeline; our goal is to highlight those

aspects of the real-time rendering cal-

culation that allow graphics applica-

tion developers to exploit modern

GPUs as general-purpose parallel

computation engines.

Pipeline input

Most real-time graphics systems

assume that everything is made of tri-

angles, and they ﬁrst carve up any more

complex shapes, such as quadrilaterals

or curved surface patches, into trian-

gles. The developer uses a computer

graphics library (such as OpenGL or

Direct3D) to provide each triangle to

the graphics pipeline one vertex at a

time; the GPU assembles vertices into

triangles as needed.

Model transformations

A GPU can specify each logical

object in a scene in its own locally

defined coordinate system, which is

convenient for objects that are natu-

rally deﬁned hierarchically. This con-

venience comes at a price: before

rendering, the GPU must first trans-

form all objects into a common coor-

dinate system. To ensure that triangles

aren’t warped or twisted into curved

shapes, this transformation is limited

to simple affine operations such as

rotations, translations, scalings, and

the like.

As the “Homogeneous Coordinates”

sidebar explains, by representing each

vertex in homogeneous coordinates,

the graphics system can perform the

entire hierarchy of transformations

simultaneously with a single matrix-

vector multiply. The need for efﬁcient

hardware to perform ﬂoating-point

vector arithmetic for millions of ver-

tices each second has helped drive the

GPU parallel-computing revolution.

The output of this stage of the

pipeline is a stream of triangles, all

expressed in a common 3D coordinate

system in which the viewer is located

at the origin, and the direction of view

is aligned with the z-axis.

Lighting

Once each triangle is in a global

coordinate system, the GPU can com-

pute its color based on the lights in the

scene. As an example, we describe the

calculations for a single-point light

source (imagine a very small lightbulb).

The GPU handles multiple lights by

summing the contributions of each

individual light. The traditional graph-

ics pipeline supports the Phong light-

ing equation (B-T. Phong, “Illumina-

tion for Computer-Generated Images,”

Comm. ACM, June 1975, pp. 311-

317), a phenomenological appearance

model that approximates the look of

plastic. These materials combine a dull

diffuse base with a shiny specular high-

How GPUs

Work

David Luebke, NVIDIA Research

Greg Humphreys, University of Virginia

GPUs have moved away from

the traditional ﬁxed-function

3D graphics pipeline toward

a ﬂexible general-purpose

computational engine.

r2How.qxp 23/1/07 12:44 PM Page 96

下载后可阅读完整内容，剩余4页未读，立即下载

GISsirclyx

粉丝: 38
资源: 2

GPU工作原理与现代图形处理革命

Introduction-to-GPUs.pdf

How Does a GPU Shader Work (2018)-计算机科学

Packt.Hands-On.GPU.Computing.with.Python.1789341078.epub

基于微信小程序的社区门诊管理系统php.zip

白色大气风格的设计师作品模板下载.zip

工程经济学自考必备软件下载

UML课程设计报告.doc

白色大气风格响应式彩绘精品水果网站模板.zip

白色简洁风格的别墅整站网站模板.zip

白色简洁风格的APP展示动态源码下载.zip

最新资源