没有合适的资源?快使用搜索试试~ 我知道了~
首页Unicode 10编码图表(最新完整版)
资源详情
资源评论
资源推荐

The Unicode Standard, Version 10.0
Archived Code Charts
The Unicode Standard, Version 10.0
This file contains the complete set of character code tables and list of character names for
The Unicode Standard, Version 10.0
This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard.
See http://www.unicode.org/errata/ for an up-to-date list of errata.
See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts.
See http://www.unicode.org/charts/PDF/Unicode-10.0/ for charts showing only the characters added in Unicode 10.0.
Disclaimer
These charts are provided as the online reference to the character contents of the Unicode Standard, Version 10.0 but do
not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete
understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode
Standard, Version 10.0, online at http://www.unicode.org/versions/Unicode10.0.0/, as well as Unicode Standard Annexes
#9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, and #45, the other Unicode Technical Reports and Standards, and
the Unicode Character Database, which are available online.
See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/
A thorough understanding of the information contained in these additional sources is required for a successful
implementation.
Fonts
The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be
expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number
of different font designers, who own the rights to the fonts.
See http://www.unicode.org/charts/fonts.html for a list.
Terms of Use
You may freely use these code charts for personal or internal business uses only. You may not incorporate them either
wholly or in part into any product or publication, or otherwise distribute them without express written permission from
the Unicode Consortium. However, you may provide links to these charts.
The fonts and font data used in production of these code charts may NOT be extracted, or used in any other way in any
product or publication, without permission or license granted by the typeface owner(s).
The Unicode Consortium is not liable for errors or omissions in this file or the standard itself. Information on characters
added to the Unicode Standard since the publication of the most recent version of the Unicode Standard, as well as on
characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site.
See http://www.unicode.org/pending/pending.html and http://www.unicode.org/alloc/Pipeline.html.
Copyright © 1991-2017 Unicode, Inc. All rights reserved.

The Unicode Standard 10.0, Copyright © 1991-2017 Unicode, Inc. All rights reserved.
007FC0 Controls and Basic Latin 0000
000 001 002 003 004 005 006 007
!
"
#
$
%
&
'
(
)
*
+
,
-
.
/
0
1
2
3
4
5
6
7
8
9
:
;
<
=
>
?
@
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
[
\
]
^
_
`
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
y
z
{
|
}
~
0000
0001
0002
0003
0004
0005
0006
0007
0008
0009
000A
000B
000C
000D
000E
000F
0010
0011
0012
0013
0014
0015
0016
0017
0018
0019
001A
001B
001C
001D
001E
001F
0020
0021
0022
0023
0024
0025
0026
0027
0028
0029
002A
002B
002C
002D
002E
002F
0030
0031
0032
0033
0034
0035
0036
0037
0038
0039
003A
003B
003C
003D
003E
003F
0040
0041
0042
0043
0044
0045
0046
0047
0048
0049
004A
004B
004C
004D
004E
004F
0050
0051
0052
0053
0054
0055
0056
0057
0058
0059
005A
005B
005C
005D
005E
005F
0060
0061
0062
0063
0064
0065
0066
0067
0068
0069
006A
006B
006C
006D
006E
006F
0070
0071
0072
0073
0074
0075
0076
0077
0078
0079
007A
007B
007C
007D
007E
007F
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F

The Unicode Standard 10.0, Copyright © 1991-2017 Unicode, Inc. All rights reserved.
0024C0 Controls and Basic Latin 0000
001B <control>
= ESCAPE
001C
<control>
= INFORMATION SEPARATOR FOUR
= file separator (FS)
001D
<control>
= INFORMATION SEPARATOR THREE
= group separator (GS)
001E
<control>
= INFORMATION SEPARATOR TWO
= record separator (RS)
001F
<control>
= INFORMATION SEPARATOR ONE
= unit separator (US)
ASCII punctuation and symbols
Based on ISO/IEC 646.
0020
SPACE
• sometimes considered a control code
• other space characters: 2000–200A
→ 00A0
no-break space
→ 200B
zero width space
→ 2060 word joiner
→ 3000
ǀ ideographic space
→ FEFF
ǝ zero width no-break space
0021
! EXCLAMATION MARK
= factorial
= bang
→ 00A1¡ inverted exclamation mark
→ 01C3ǃ latin letter retroflex click
→ 203C
‼ double exclamation mark
→ 203D
‽ interrobang
→ 2762❢ heavy exclamation mark ornament
0022 " QUOTATION MARK
• neutral (vertical), used as opening or closing
quotation mark
• preferred characters in English for paired
quotation marks are 201C
“ & 201D”
• 05F4״ is preferred for gershayim when writing
Hebrew
→ 02BA
ʺ modifier letter double prime
→ 030B
$ combining double acute accent
→ 030E$ combining double vertical line above
→ 05F4
״ hebrew punctuation gershayim
→ 2033
″ double prime
→ 3003〃 ditto mark
0023 # NUMBER SIGN
= pound sign, hash, crosshatch, octothorpe
→ 2114℔ l b bar symbol
→ 2317⌗ viewdata square
→ 266F
♯ music sharp sign
0024 $ DOLLAR SIGN
= milréis, escudo
• used for many peso currencies in Latin America
and elsewhere
• glyph may have one or two vertical bars
• other currency symbol characters start at
20A0
₠
→ 00A4¤ currency sign
→ 20B1
₱ peso sign
→ 1F4B2
💲 heavy dollar sign
C0 controls
Alias names are those for ISO/IEC 6429:1992. Commonly used
alternative aliases are also shown.
0000
<control>
= NULL
0001
<control>
= START OF HEADING
0002
<control>
= START OF TEXT
0003
<control>
= END OF TEXT
0004
<control>
= END OF TRANSMISSION
0005
<control>
= ENQUIRY
0006
<control>
= ACKNOWLEDGE
0007
<control>
= BELL
0008
<control>
= BACKSPACE
0009
<control>
= CHARACTER TABULATION
= horizontal tabulation (HT), tab
000A
<control>
= LINE FEED (LF)
= new line (NL), end of line (EOL)
000B
<control>
= LINE TABULATION
= vertical tabulation (VT)
000C
<control>
= FORM FEED (FF)
000D
<control>
= CARRIAGE RETURN (CR)
000E
<control>
= SHIFT OUT
• known as LOCKING-SHIFT ONE in 8-bit
environments
000F
<control>
= SHIFT IN
• known as LOCKING-SHIFT ZERO in 8-bit
environments
0010 <control>
= DATA LINK ESCAPE
0011
<control>
= DEVICE CONTROL ONE
0012
<control>
= DEVICE CONTROL TWO
0013
<control>
= DEVICE CONTROL THREE
0014
<control>
= DEVICE CONTROL FOUR
0015
<control>
= NEGATIVE ACKNOWLEDGE
0016
<control>
= SYNCHRONOUS IDLE
0017
<control>
= END OF TRANSMISSION BLOCK
0018
<control>
= CANCEL
0019
<control>
= END OF MEDIUM
001A
<control>
= SUBSTITUTE
→ FFFDƳ replacement character

The Unicode Standard 10.0, Copyright © 1991-2017 Unicode, Inc. All rights reserved.
0041C0 Controls and Basic Latin 0025
002F / SOLIDUS
= slash, virgule
→ 01C0ǀ latin letter dental click
→ 0338
$ combining long solidus overlay
→ 2044
⁄ fraction slash
→ 2215∕ division slash
ASCII digits
0030
0 DIGIT ZERO
⁓ 0030 FE000 short diagonal stroke form
0031
1 DIGIT ONE
0032 2 DIGIT TWO
0033
3 DIGIT THREE
0034 4 DIGIT FOUR
0035 5 DIGIT FIVE
0036
6 DIGIT SIX
0037 7 DIGIT SEVEN
0038 8 DIGIT EIGHT
0039
9 DIGIT NINE
ASCII punctuation and symbols
003A
: COLON
• also used to denote division or scale; for that
mathematical use 2236∶ is preferred
→ 0589
։ armenian full stop
→ 05C3
׃ hebrew punctuation sof pasuq
→ 2236∶ ratio
→ A789
꞉ modifier letter colon
003B ; SEMICOLON
• this, and not 037E;, is the preferred character
for ’Greek question mark’
→ 037E; greek question mark
→ 061B arabic semicolon
→ 204F
⁏ reversed semicolon
003C
< LESS-THAN SIGN
→ 2039
‹ single left-pointing angle quotation
mark
→ 2329
〈 left-pointing angle bracket
→ 27E8⟨ mathematical left angle bracket
→ 3008
〈 left angle bracket
003D
= EQUALS SIGN
• other related characters: 2241≁–2263≣
→ 2260≠ not equal to
→ 2261≡ identical to
→ A78A
꞊ modifier letter short equals sign
→ 10190
𐆐 roman sextans sign
003E
> GREATER-THAN SIGN
→ 203A
› single right-pointing angle quotation
mark
→ 232A
〉 right-pointing angle bracket
→ 27E9⟩ mathematical right angle bracket
→ 3009
〉 right angle bracket
003F ? QUESTION MARK
→ 00BF
¿ inverted question mark
→ 037E; greek question mark
→ 061F arabic question mark
→ 203D
‽ interrobang
→ 2048⁈ question exclamation mark
→ 2049
⁉ exclamation question mark
0040
@ COMMERCIAL AT
= at sign
Uppercase Latin alphabet
0041 A LATIN CAPITAL LETTER A
0025
% PERCENT SIGN
→ 066A arabic percent sign
→ 2030
‰ per mille sign
→ 2031‱ per ten thousand sign
→ 2052
⁒ commercial minus sign
0026
& AMPERSAND
→ 204A
⁊ tironian sign et
→ 214B⅋ turned ampersand
→ 1F674
🙴 heavy ampersand ornament
0027
' APOSTROPHE
= apostrophe-quote (1.0)
= APL quote
• neutral (vertical) glyph with mixed usage
• 2019’ is preferred for apostrophe
• preferred characters in English for paired
quotation marks are 2018
‘ & 2019’
• 05F3׳ is preferred for geresh when writing
Hebrew
→ 02B9
ʹ modifier letter prime
→ 02BCʼ modifier letter apostrophe
→ 02C8
ˈ modifier letter vertical line
→ 0301
$ combining acute accent
→ 05F3׳ hebrew punctuation geresh
→ 2032
′ prime
→ A78C
ꞌ latin small letter saltillo
0028
( LEFT PARENTHESIS
= opening parenthesis (1.0)
0029
) RIGHT PARENTHESIS
= closing parenthesis (1.0)
• see discussion on semantics of paired
bracketing characters
002A * ASTERISK
= star (on phone keypads)
→ 066D arabic five pointed star
→ 204E
⁎ low asterisk
→ 2217∗ asterisk operator
→ 26B9
⚹ sextile
→ 2731
✱ heavy asterisk
002B
+ PLUS SIGN
→ 2795
➕ heavy plus sign
002C
, COMMA
= decimal separator
→ 060C arabic comma
→ 201A
‚ single low-9 quotation mark
→ 2E41
⹁ reversed comma
→ 3001
、 ideographic comma
002D
- HYPHEN-MINUS
= hyphen or minus sign
• used for either hyphen or minus sign
→ 2010
‐ hyphen
→ 2011
non-breaking hyphen
→ 2012‒ figure dash
→ 2013
– en dash
→ 2043
⁃ hyphen bullet
→ 2212− minus sign
→ 10191
𐆑 roman uncia sign
002E
. FULL STOP
= period, dot, decimal point
• may be rendered as a raised decimal point in
old style numbers
→ 06D4 arabic full stop
→ 2E3C
⸼ stenographic full stop
→ 3002
。 ideographic full stop

The Unicode Standard 10.0, Copyright © 1991-2017 Unicode, Inc. All rights reserved.
0074C0 Controls and Basic Latin 0042
005C \ REVERSE SOLIDUS
= backslash
→ 20E5 combining reverse solidus overlay
→ 2216∖ set minus
005D
] RIGHT SQUARE BRACKET
= closing square bracket (1.0)
005E
^ CIRCUMFLEX ACCENT
• this is a spacing character
→ 02C4
˄ modifier letter up arrowhead
→ 02C6
ˆ modifier letter circumflex accent
→ 0302$ combining circumflex accent
→ 2038
‸ caret
→ 2303
⌃ up arrowhead
005F
_ LOW LINE
= spacing underscore (1.0)
• this is a spacing character
→ 02CD
ˍ modifier letter low macron
→ 0331
$ combining macron below
→ 0332$ combining low line
→ 2017
‗ double low line
0060
` GRAVE ACCENT
• this is a spacing character
→ 02CB
ˋ modifier letter grave accent
→ 0300
$ combining grave accent
→ 2035‵ reversed prime
Lowercase Latin alphabet
0061
a LATIN SMALL LETTER A
0062
b LATIN SMALL LETTER B
0063 c LATIN SMALL LETTER C
0064 d LATIN SMALL LETTER D
0065
e LATIN SMALL LETTER E
→ 212E℮ estimated symbol
→ 212Fℯ script small e
0066
f LATIN SMALL LETTER F
0067 g LATIN SMALL LETTER G
→ 0261
ɡ latin small letter script g
→ 210Aℊ script small g
0068 h LATIN SMALL LETTER H
→ 04BB
һ cyrillic small letter shha
→ 210Eℎ planck constant
0069 i LATIN SMALL LETTER I
• Turkish and Azerbaijani use 0130İ for
uppercase
→ 0131
ı latin small letter dotless i
→ 1D6A4𝚤 mathematical italic small dotless i
006A j LATIN SMALL LETTER J
→ 0237
ȷ latin small letter dotless j
→ 1D6A5𝚥 mathematical italic small dotless j
006B k LATIN SMALL LETTER K
006C
l LATIN SMALL LETTER L
→ 2113
ℓ script small l
→ 1D4C1𝓁 mathematical script small l
006D m LATIN SMALL LETTER M
006E n LATIN SMALL LETTER N
→ 207F
ⁿ superscript latin small letter n
006F
o LATIN SMALL LETTER O
→ 2134ℴ script small o
0070
p LATIN SMALL LETTER P
0071
q LATIN SMALL LETTER Q
0072 r LATIN SMALL LETTER R
0073 s LATIN SMALL LETTER S
0074
t LATIN SMALL LETTER T
0042 B LATIN CAPITAL LETTER B
→ 212Cℬ script capital b
0043 C LATIN CAPITAL LETTER C
→ 2102ℂ double-struck capital c
→ 212Dℭ black-letter capital c
0044
D LATIN CAPITAL LETTER D
0045 E LATIN CAPITAL LETTER E
→ 2107ℇ euler constant
→ 2130ℰ script capital e
0046
F LATIN CAPITAL LETTER F
→ 2131ℱ script capital f
→ 2132
Ⅎ turned capital f
0047
G LATIN CAPITAL LETTER G
0048 H LATIN CAPITAL LETTER H
→ 210Bℋ script capital h
→ 210Cℌ black-letter capital h
→ 210Dℍ double-struck capital h
0049
I LATIN CAPITAL LETTER I
• Turkish and Azerbaijani use 0131ı for
lowercase
→ 0130
İ latin capital letter i with dot above
→ 0406
І cyrillic capital letter byelorussian-
ukrainian i
→ 04C0Ӏ cyrillic letter palochka
→ 2110ℐ script capital i
→ 2111ℑ black-letter capital i
→ 2160
Ⅰ roman numeral one
004A J LATIN CAPITAL LETTER J
004B
K LATIN CAPITAL LETTER K
→ 212A
K kelvin sign
004C L LATIN CAPITAL LETTER L
→ 2112ℒ script capital l
004D M LATIN CAPITAL LETTER M
→ 2133ℳ script capital m
004E N LATIN CAPITAL LETTER N
→ 2115ℕ double-struck capital n
004F
O LATIN CAPITAL LETTER O
0050
P LATIN CAPITAL LETTER P
→ 2119ℙ double-struck capital p
0051
Q LATIN CAPITAL LETTER Q
→ 211Aℚ double-struck capital q
0052 R LATIN CAPITAL LETTER R
→ 211Bℛ script capital r
→ 211Cℜ black-letter capital r
→ 211Dℝ double-struck capital r
0053
S LATIN CAPITAL LETTER S
0054
T LATIN CAPITAL LETTER T
0055 U LATIN CAPITAL LETTER U
0056 V LATIN CAPITAL LETTER V
→ 2164
Ⅴ roman numeral five
0057
W LATIN CAPITAL LETTER W
0058 X LATIN CAPITAL LETTER X
0059
Y LATIN CAPITAL LETTER Y
005A Z LATIN CAPITAL LETTER Z
→ 2124ℤ double-struck capital z
→ 2128ℨ black-letter capital z
ASCII punctuation and symbols
005B
[ LEFT SQUARE BRACKET
= opening square bracket (1.0)
• other bracket characters: 27E6⟦–27EB⟫,
2983⦃
–2998⦘, 3008〈–301B〛
剩余2569页未读,继续阅读














安全验证
文档复制为VIP权益,开通VIP直接复制

评论0