深入理解x86指令集：现代软件开发的基石

需积分: 50 182 浏览量更新于2024-09-12 收藏 134KB PDF 举报

"x86指令手册是一份详细阐述x86指令集架构的参考资料，旨在帮助读者理解并能编写简单的汇编程序，以及流畅地阅读反汇编的二进制代码。这份手册采用GNU工具（如assembler as和debugger gdb）所使用的汇编语法。虽然不同的工具可能有不同的表示方式，但只要关注当前使用的是哪种工具，这些差异并不影响理解。" x86指令集架构是计算机体系结构中的一个重要部分，特别是在x86平台上进行低级别编程时。它包含了丰富的指令，这使得x86被归类为复杂指令集计算（CISC）架构，与简化指令集计算（RISC）相比，x86的指令数量更多，功能更复杂。这种复杂性部分源于对向后兼容性的考虑，对于现代程序员来说，许多历史遗留的特性可能并不直接相关。 x86指令集的基础包括以下几个核心概念： 1. **寄存器**：在x86架构中，有一系列高速内存区域，称为寄存器，它们用于存储数据和指令。常见的通用寄存器有EAX、EBX、ECX、EDX，以及ESP（堆栈指针）和EIP（指令指针）。这些寄存器在执行指令时起着关键作用，因为访问寄存器的速度比访问内存快得多。 2. **数据类型**：x86支持多种数据类型，包括字节（byte）、字（word）、双字（doubleword）和四字（quadword）。它们分别对应8位、16位、32位和64位的数据。在64位的x86-64架构中，还增加了更大的数据类型。 3. **内存**：x86架构中的内存模型允许程序在主内存中存储和检索数据。通过地址来访问内存，寄存器通常用来存储这些地址。内存操作指令包括加载（load）和存储（store），还有更复杂的内存访问模式，如串操作（string operations）。汇编语言是x86指令集的直接映射，它提供了一种人可读的方式来编写机器代码。例如，`MOV`指令用于在寄存器和内存之间或寄存器之间转移数据，`ADD`指令用于加法运算，`JMP`用于跳转到其他指令地址等。使用汇编语言编程时，理解这些基本指令至关重要。在x86汇编中，还有其他高级特性，如条件转移指令（如`JNE`，不相等时跳转）、循环控制（如`LOOP`）以及调用和返回指令（`CALL`和`RET`），这些允许实现更复杂的控制流程。此外，x86指令集还包括了对浮点运算的支持，通过浮点寄存器（如XMM和YMM寄存器）和专门的浮点运算指令（如`SQRTSD`，平方根双精度浮点数）。在现代软件中，这些特性在图形处理、科学计算和高性能计算等领域有着广泛的应用。掌握x86指令手册可以帮助程序员深入理解底层计算机工作原理，编写高效的代码，以及调试程序时能够解读反汇编代码。虽然学习x86汇编可能会相对复杂，但对于计算机系统和软件工程的专业人士来说，这是一项宝贵的技能。

The x86 Instruction Set Architecture

CS232: Computer Architecture II

This set of notes provides an overview of the x86 instruction set architecture and its use in modern software. The goal

is to familiarize you with the ISA to the point that you can code simple programs and can read disassembled binary

code comfortably. Substantial portions of the ISA are ignored completely for the sake of simplicity. The notes use the

assembly notation used by the GNU tools, including the assembler as (used by the compiler gcc) and the debugger

gdb. Other tools may deﬁne other notations, but such things are merely cosmetic so long as you pay attention to what

you are using at the time.

The Basics: Registers, Data Types, and Memory

You may have heard or seen the term “Reduced Instruction Set Computing,” or RISC, and its counterpart, “Complex

Instruction Set Computing,” or CISC. While these terms were never entirely clear and have been further muddied by

years of marketing, the x86 ISA is certainly vastly more complex than that of MIPS. On the other hand, much of

the complexity has to do with backwards compatibility, which is mostly irrelevant to someone writing code today.

Furthermore, we need use only a limited subset of the ISA in this class.

Modern ﬂavors of x86—also called IA32, or Intel Architecture 32—have eight 32-bit integer registers. The registers

are not entirely general-purpose, meaning that some instructions limit your choice of register operands to fewer than

eight. A couple of other special-purpose 32-bit registers are also available—namely the instruction pointer (program

counter) and the ﬂags (condition codes), and we shall ignore the ﬂoating-point and multimedia registers. Unlike most

RISC machines, the registers have names stemming from their historical special purposes, as described below.

%eax accumulator (for adding, multiplying, etc.)

%ebx base (address of array in memory)

%ecx count (of loop iterations)

%edx data (e.g., second operand for binary operations)

%esi source index (for string copy or array access)

%edi destination index (for string copy or array access)

%ebp base pointer (base of current stack frame)

%esp stack pointer (top of stack)

%eip instruction pointer (program counter)

%eflags ﬂags (condition codes and other things)

high

AH AL

EAX

8 016 15 7

8−bit

low

EAX

EBX

ECX

EDX

ESI

EDI

EBP

ESP

32−bit

16−bit

The character “%” is used to denote a register in assembly code and is not considered a part of the register name itself;

note also that register names are not case sensitive. The letter “E” in each name indicates that the “extended” version

of the register is desired (extended from 16 bits). Registers can also be used to store 16- and 8-bit values, which is

useful when writing smaller values to memory or I/O ports. As shown to the right above, the low 16 bits of a register

are accessed by dropping the “E” from the register name, e.g., %si. Finally, the two 8-bit halves of the low 16 bits of

the ﬁrst four registers can be used as 8-bit registers by replacing “X” with “H” (high) or “L” (low).

The x86 ISA supports both 2’s complement and unsigned integers in widths of 32, 16, and 8 bits, single and double-

precision IEEE ﬂoating-point, 80-bit Intel ﬂoating-point, ASCII strings, and binary-coded decimal (BCD). Most in-

structions are independent of data type, but some require that you select the proper instruction for the data types of the

operands. Try multiplying 32-bit representations of -1 and 1 to produce a 64-bit result, for example.

Use of memory is more ﬂexible in x86 than in MIPS: in addition to load and store operations, many x86 operations

accept memory locations as operands. For example, a single instruction serves to read the value in a memory location,

add a constant, and store the sum back to the memory location. With x86, memory is 8-bit (byte) addressable and uses

32-bit addresses, although few machines today fully populate this 4 GB address space.

One aspect of x86’s treatment of memory may confuse you: it is little endian. Little endian means that if you store a

32-bit register into memory and then look at the four bytes of memory one by one, you will ﬁnd the little end of the

32 bits ﬁrst, followed by the next eight bits, then the next, and ﬁnally the high eight bits of the stored value. Thus

0x12345678 becomes 0x78, 0x56, 0x34, 0x12 in consecutive memory locations. Obviously, values read from memory

下载后可阅读完整内容，剩余7页未读，立即下载

magic_fox

粉丝: 0
资源: 14

深入理解x86指令集：现代软件开发的基石

x86汇编指令集大全（带注释）.pdf

x86 汇编指令速查手册

X86汇编指令集X86汇编指令集

x86 指令手册

x86doc:Intel x86 指令文档的 HTML 表示

x86中断手册等资料

英文的sun的x86汇编手册

80x86ins.rar_80X86_80x86数据手册_asm 386_x86_x86汇编手册

Intel指令手册

英特尔开发者手册卷2：x86指令详解与查找表指南

最新资源