Initial commit of OpenSPARC T2 design and verification files.
[OpenSPARC-T2-DV] / tools / perl-5.8.0 / lib / 5.8.0 / unicore / NamesList.txt
CommitLineData
86530b38
AT
1@@@ The Unicode Standard 3.2
2@@@+ Draft U32M020305.lst
3 Minor annotation edits for final release.
4 Addition of a few more Khmer annotations.
5 This file is semi-automatically derived from UnicodeData.txt and
6 a set of manually created annotations using a script to select
7 or suppress information from the data file. The rules used
8 for this process are aimed at readability for the human reader,
9 at the expense of some details; therefore, this file should not
10 be parsed for machine-readable information.
11@@ 0000 C0 Controls and Basic Latin (Basic Latin) 007F
12@ C0 controls
13@+ Alias names are those for ISO/IEC 6429:1992. Commonly used alternative aliases are also shown.
140000 <control>
15 = NULL
160001 <control>
17 = START OF HEADING
180002 <control>
19 = START OF TEXT
200003 <control>
21 = END OF TEXT
220004 <control>
23 = END OF TRANSMISSION
240005 <control>
25 = ENQUIRY
260006 <control>
27 = ACKNOWLEDGE
280007 <control>
29 = BELL
300008 <control>
31 = BACKSPACE
320009 <control>
33 = CHARACTER TABULATION
34 = horizontal tabulation (HT), tab
35000A <control>
36 = LINE FEED (LF)
37 = new line (NL), end of line (EOL)
38000B <control>
39 = LINE TABULATION
40000C <control>
41 = FORM FEED (FF)
42000D <control>
43 = CARRIAGE RETURN (CR)
44000E <control>
45 = SHIFT OUT
46000F <control>
47 = SHIFT IN
480010 <control>
49 = DATA LINK ESCAPE
500011 <control>
51 = DEVICE CONTROL ONE
520012 <control>
53 = DEVICE CONTROL TWO
540013 <control>
55 = DEVICE CONTROL THREE
560014 <control>
57 = DEVICE CONTROL FOUR
580015 <control>
59 = NEGATIVE ACKNOWLEDGE
600016 <control>
61 = SYNCHRONOUS IDLE
620017 <control>
63 = END OF TRANSMISSION BLOCK
640018 <control>
65 = CANCEL
660019 <control>
67 = END OF MEDIUM
68001A <control>
69 = SUBSTITUTE
70 x (replacement character - FFFD)
71001B <control>
72 = ESCAPE
73001C <control>
74 = INFORMATION SEPARATOR FOUR
75 = file separator (FS)
76001D <control>
77 = INFORMATION SEPARATOR THREE
78 = group separator (GS)
79001E <control>
80 = INFORMATION SEPARATOR TWO
81 = record separator (RS)
82001F <control>
83 = INFORMATION SEPARATOR ONE
84 = unit separator (US)
85@ ASCII
860020 SPACE
87 * sometimes considered a control code
88 * other space characters: 2000-200A
89 x (no-break space - 00A0)
90 x (zero width space - 200B)
91 x (word joiner - 2060)
92 x (ideographic space - 3000)
93 x (zero width no-break space - FEFF)
940021 EXCLAMATION MARK
95 = factorial
96 = bang
97 x (inverted exclamation mark - 00A1)
98 x (latin letter retroflex click - 01C3)
99 x (double exclamation mark - 203C)
100 x (interrobang - 203D)
101 x (heavy exclamation mark ornament - 2762)
1020022 QUOTATION MARK
103 * neutral (vertical), used as opening or closing quotation mark
104 * preferred characters in English for paired quotation marks are 201C & 201D
105 x (modifier letter double prime - 02BA)
106 x (combining double acute accent - 030B)
107 x (combining double vertical line above - 030E)
108 x (double prime - 2033)
109 x (ditto mark - 3003)
1100023 NUMBER SIGN
111 = pound sign, hash, crosshatch, octothorpe
1120024 DOLLAR SIGN
113 = milreis, escudo
114 * glyph may have one or two vertical bars
115 * other currency symbol characters: 20A0-20AF
116 x (currency sign - 00A4)
1170025 PERCENT SIGN
118 x (arabic percent sign - 066A)
119 x (per mille sign - 2030)
120 x (per ten thousand sign - 2031)
121 x (commercial minus sign - 2052)
1220026 AMPERSAND
1230027 APOSTROPHE
124 = APOSTROPHE-QUOTE
125 = APL quote
126 * neutral (vertical) glyph having mixed usage
127 * preferred character for apostrophe is 2019
128 * preferred characters in English for paired quotation marks are 2018 & 2019
129 x (modifier letter prime - 02B9)
130 x (modifier letter apostrophe - 02BC)
131 x (modifier letter vertical line - 02C8)
132 x (combining acute accent - 0301)
133 x (prime - 2032)
1340028 LEFT PARENTHESIS
135 = OPENING PARENTHESIS
1360029 RIGHT PARENTHESIS
137 = CLOSING PARENTHESIS
138 * see discussion on semantics of paired bracketing characters
139002A ASTERISK
140 = star (on phone keypads)
141 x (arabic five pointed star - 066D)
142 x (low asterisk - 204E)
143 x (asterisk operator - 2217)
144 x (heavy asterisk - 2731)
145002B PLUS SIGN
146002C COMMA
147 = decimal separator
148 x (arabic comma - 060C)
149 x (single low-9 quotation mark - 201A)
150 x (ideographic comma - 3001)
151002D HYPHEN-MINUS
152 = hyphen or minus sign
153 * used for either hyphen or minus sign
154 x (hyphen - 2010)
155 x (non-breaking hyphen - 2011)
156 x (figure dash - 2012)
157 x (en dash - 2013)
158 x (minus sign - 2212)
159002E FULL STOP
160 = PERIOD
161 = dot, decimal point
162 * may be rendered as a raised decimal point in old style numbers
163 x (arabic full stop - 06D4)
164 x (ideographic full stop - 3002)
165002F SOLIDUS
166 = SLASH
167 = virgule, shilling (British)
168 x (latin letter dental click - 01C0)
169 x (combining long solidus overlay - 0338)
170 x (fraction slash - 2044)
171 x (division slash - 2215)
1720030 DIGIT ZERO
1730031 DIGIT ONE
1740032 DIGIT TWO
1750033 DIGIT THREE
1760034 DIGIT FOUR
1770035 DIGIT FIVE
1780036 DIGIT SIX
1790037 DIGIT SEVEN
1800038 DIGIT EIGHT
1810039 DIGIT NINE
182003A COLON
183 x (armenian full stop - 0589)
184 x (hebrew punctuation sof pasuq - 05C3)
185 x (ratio - 2236)
186003B SEMICOLON
187 x (greek question mark - 037E)
188 x (arabic semicolon - 061B)
189 x (reversed semicolon - 204F)
190003C LESS-THAN SIGN
191 x (single left-pointing angle quotation mark - 2039)
192 x (left-pointing angle bracket - 2329)
193 x (left angle bracket - 3008)
194003D EQUALS SIGN
195 * other related characters: 2241-2263
196 x (not equal to - 2260)
197 x (identical to - 2261)
198003E GREATER-THAN SIGN
199 x (single right-pointing angle quotation mark - 203A)
200 x (right-pointing angle bracket - 232A)
201 x (right angle bracket - 3009)
202003F QUESTION MARK
203 x (inverted question mark - 00BF)
204 x (greek question mark - 037E)
205 x (arabic question mark - 061F)
206 x (interrobang - 203D)
207 x (question exclamation mark - 2048)
208 x (exclamation question mark - 2049)
2090040 COMMERCIAL AT
2100041 LATIN CAPITAL LETTER A
2110042 LATIN CAPITAL LETTER B
212 x (script capital b - 212C)
2130043 LATIN CAPITAL LETTER C
214 x (double-struck capital c - 2102)
215 x (black-letter capital c - 212D)
2160044 LATIN CAPITAL LETTER D
2170045 LATIN CAPITAL LETTER E
218 x (euler constant - 2107)
219 x (script capital e - 2130)
2200046 LATIN CAPITAL LETTER F
221 x (script capital f - 2131)
222 x (turned capital f - 2132)
2230047 LATIN CAPITAL LETTER G
224 * invented circa 300 BCE by Spurius Carvilius Ruga, who added a stroke to the letter C
2250048 LATIN CAPITAL LETTER H
226 x (script capital h - 210B)
227 x (black-letter capital h - 210C)
228 x (double-struck capital h - 210D)
2290049 LATIN CAPITAL LETTER I
230 * Turkish and Azerbaijani use 0131 for lowercase
231 x (latin capital letter i with dot above - 0130)
232 x (cyrillic capital letter byelorussian-ukrainian i - 0406)
233 x (cyrillic letter palochka - 04C0)
234 x (script capital i - 2110)
235 x (black-letter capital i - 2111)
236 x (roman numeral one - 2160)
237004A LATIN CAPITAL LETTER J
238004B LATIN CAPITAL LETTER K
239 x (kelvin sign - 212A)
240004C LATIN CAPITAL LETTER L
241 x (script capital l - 2112)
242004D LATIN CAPITAL LETTER M
243 x (script capital m - 2133)
244004E LATIN CAPITAL LETTER N
245 x (double-struck capital n - 2115)
246004F LATIN CAPITAL LETTER O
2470050 LATIN CAPITAL LETTER P
248 x (double-struck capital p - 2119)
2490051 LATIN CAPITAL LETTER Q
250 x (double-struck capital q - 211A)
2510052 LATIN CAPITAL LETTER R
252 x (script capital r - 211B)
253 x (black-letter capital r - 211C)
254 x (double-struck capital r - 211D)
2550053 LATIN CAPITAL LETTER S
2560054 LATIN CAPITAL LETTER T
2570055 LATIN CAPITAL LETTER U
2580056 LATIN CAPITAL LETTER V
2590057 LATIN CAPITAL LETTER W
2600058 LATIN CAPITAL LETTER X
2610059 LATIN CAPITAL LETTER Y
262005A LATIN CAPITAL LETTER Z
263 x (double-struck capital z - 2124)
264 x (black-letter capital z - 2128)
265005B LEFT SQUARE BRACKET
266 = OPENING SQUARE BRACKET
267 * other bracket characters: 3008-301B
268005C REVERSE SOLIDUS
269 = BACKSLASH
270 x (combining reverse solidus overlay - 20E5)
271 x (set minus - 2216)
272005D RIGHT SQUARE BRACKET
273 = CLOSING SQUARE BRACKET
274005E CIRCUMFLEX ACCENT
275 * this is a spacing character
276 x (modifier letter up arrowhead - 02C4)
277 x (modifier letter circumflex accent - 02C6)
278 x (combining circumflex accent - 0302)
279 x (up arrowhead - 2303)
280005F LOW LINE
281 = SPACING UNDERSCORE
282 * this is a spacing character
283 x (modifier letter low macron - 02CD)
284 x (combining macron below - 0331)
285 x (combining low line - 0332)
286 x (double low line - 2017)
2870060 GRAVE ACCENT
288 * this is a spacing character
289 x (modifier letter grave accent - 02CB)
290 x (combining grave accent - 0300)
291 x (reversed prime - 2035)
2920061 LATIN SMALL LETTER A
2930062 LATIN SMALL LETTER B
2940063 LATIN SMALL LETTER C
2950064 LATIN SMALL LETTER D
2960065 LATIN SMALL LETTER E
297 x (estimated symbol - 212E)
298 x (script small e - 212F)
2990066 LATIN SMALL LETTER F
3000067 LATIN SMALL LETTER G
301 x (latin small letter script g - 0261)
302 x (script small g - 210A)
3030068 LATIN SMALL LETTER H
304 x (cyrillic small letter shha - 04BB)
305 x (planck constant - 210E)
3060069 LATIN SMALL LETTER I
307 * Turkish and Azerbaijani use 0130 for uppercase
308 x (latin small letter dotless i - 0131)
309006A LATIN SMALL LETTER J
310006B LATIN SMALL LETTER K
311006C LATIN SMALL LETTER L
312 x (script small l - 2113)
313006D LATIN SMALL LETTER M
314006E LATIN SMALL LETTER N
315 x (superscript latin small letter n - 207F)
316006F LATIN SMALL LETTER O
317 x (script small o - 2134)
3180070 LATIN SMALL LETTER P
3190071 LATIN SMALL LETTER Q
3200072 LATIN SMALL LETTER R
3210073 LATIN SMALL LETTER S
3220074 LATIN SMALL LETTER T
3230075 LATIN SMALL LETTER U
3240076 LATIN SMALL LETTER V
3250077 LATIN SMALL LETTER W
3260078 LATIN SMALL LETTER X
3270079 LATIN SMALL LETTER Y
328007A LATIN SMALL LETTER Z
329 x (latin small letter z with stroke - 01B6)
330007B LEFT CURLY BRACKET
331 = OPENING CURLY BRACKET
332 = opening brace
333007C VERTICAL LINE
334 = VERTICAL BAR
335 * used in pairs to indicate absolute value
336 x (latin letter dental click - 01C0)
337 x (hebrew punctuation paseq - 05C0)
338 x (divides - 2223)
339 x (light vertical bar - 2758)
340007D RIGHT CURLY BRACKET
341 = CLOSING CURLY BRACKET
342 = closing brace
343007E TILDE
344 * this is a spacing character
345 x (small tilde - 02DC)
346 x (combining tilde - 0303)
347 x (tilde operator - 223C)
348 x (fullwidth tilde - FF5E)
349007F <control>
350 = DELETE
351@@ 0080 C1 Controls and Latin-1 Supplement (Latin-1 Supplement) 00FF
352@ C1 controls
353@+ Alias names are those for ISO/IEC 6429:1992.
3540080 <control>
3550081 <control>
3560082 <control>
357 = BREAK PERMITTED HERE
3580083 <control>
359 = NO BREAK HERE
3600084 <control>
3610085 <control>
362 = NEXT LINE (NEL)
3630086 <control>
364 = START OF SELECTED AREA
3650087 <control>
366 = END OF SELECTED AREA
3670088 <control>
368 = CHARACTER TABULATION SET
3690089 <control>
370 = CHARACTER TABULATION WITH JUSTIFICATION
371008A <control>
372 = LINE TABULATION SET
373008B <control>
374 = PARTIAL LINE FORWARD
375008C <control>
376 = PARTIAL LINE BACKWARD
377008D <control>
378 = REVERSE LINE FEED
379008E <control>
380 = SINGLE SHIFT TWO
381008F <control>
382 = SINGLE SHIFT THREE
3830090 <control>
384 = DEVICE CONTROL STRING
3850091 <control>
386 = PRIVATE USE ONE
3870092 <control>
388 = PRIVATE USE TWO
3890093 <control>
390 = SET TRANSMIT STATE
3910094 <control>
392 = CANCEL CHARACTER
3930095 <control>
394 = MESSAGE WAITING
3950096 <control>
396 = START OF GUARDED AREA
3970097 <control>
398 = END OF GUARDED AREA
3990098 <control>
400 = START OF STRING
4010099 <control>
402009A <control>
403 = SINGLE CHARACTER INTRODUCER
404009B <control>
405 = CONTROL SEQUENCE INTRODUCER
406009C <control>
407 = STRING TERMINATOR
408009D <control>
409 = OPERATING SYSTEM COMMAND
410009E <control>
411 = PRIVACY MESSAGE
412009F <control>
413 = APPLICATION PROGRAM COMMAND
414@ ISO 8859-1 (aka Latin-1)
41500A0 NO-BREAK SPACE
416 x (space - 0020)
417 x (figure space - 2007)
418 x (narrow no-break space - 202F)
419 x (word joiner - 2060)
420 x (zero width no-break space - FEFF)
421 # <noBreak> 0020
42200A1 INVERTED EXCLAMATION MARK
423 * Spanish, Asturian, Galician
424 x (exclamation mark - 0021)
42500A2 CENT SIGN
42600A3 POUND SIGN
427 = pound sterling, Irish punt
428 x (lira sign - 20A4)
42900A4 CURRENCY SIGN
430 = Filzlaus, Ricardi-Sonne (German names)
431 * other currency symbol characters: 20A0-20AF
432 x (dollar sign - 0024)
43300A5 YEN SIGN
434 = yuan sign
435 * glyph may have one or two crossbars
43600A6 BROKEN BAR
437 = BROKEN VERTICAL BAR
438 = parted rule (in typography)
43900A7 SECTION SIGN
440 * paragraph sign in some European usage
44100A8 DIAERESIS
442 * this is a spacing character
443 x (combining diaeresis - 0308)
444 # 0020 0308
44500A9 COPYRIGHT SIGN
446 x (sound recording copyright - 2117)
44700AA FEMININE ORDINAL INDICATOR
448 * Spanish
449 # <super> 0061
45000AB LEFT-POINTING DOUBLE ANGLE QUOTATION MARK *
451 = LEFT POINTING GUILLEMET
452 = chevrons (in typography)
453 * usually opening, sometimes closing
454 x (much less-than - 226A)
455 x (left double angle bracket - 300A)
45600AC NOT SIGN
457 = angled dash (in typography)
458 x (reversed not sign - 2310)
45900AD SOFT HYPHEN
460 = discretionary hyphen
461 x (mongolian todo soft hyphen - 1806)
46200AE REGISTERED SIGN
463 = REGISTERED TRADE MARK SIGN
46400AF MACRON
465 = overline, APL overbar
466 * this is a spacing character
467 x (modifier letter macron - 02C9)
468 x (combining macron - 0304)
469 x (combining overline - 0305)
470 # 0020 0304
47100B0 DEGREE SIGN
472 * this is a spacing character
473 x (ring above - 02DA)
474 x (combining ring above - 030A)
475 x (superscript zero - 2070)
476 x (ring operator - 2218)
47700B1 PLUS-MINUS SIGN
478 x (minus-or-plus sign - 2213)
47900B2 SUPERSCRIPT TWO
480 = squared
481 * other superscript digit characters: 2070-2079
482 x (superscript one - 00B9)
483 # <super> 0032
48400B3 SUPERSCRIPT THREE
485 = cubed
486 x (superscript one - 00B9)
487 # <super> 0033
48800B4 ACUTE ACCENT
489 * this is a spacing character
490 x (modifier letter prime - 02B9)
491 x (modifier letter acute accent - 02CA)
492 x (combining acute accent - 0301)
493 x (prime - 2032)
494 # 0020 0301
49500B5 MICRO SIGN
496 # 03BC greek small letter mu
49700B6 PILCROW SIGN
498 = PARAGRAPH SIGN
499 * section sign in some European usage
500 x (reversed pilcrow sign - 204B)
501 x (curved stem paragraph sign ornament - 2761)
50200B7 MIDDLE DOT
503 = midpoint (in typography)
504 = Georgian comma
505 = Greek middle dot
506 x (greek ano teleia - 0387)
507 x (bullet - 2022)
508 x (one dot leader - 2024)
509 x (hyphenation point - 2027)
510 x (bullet operator - 2219)
511 x (dot operator - 22C5)
512 x (katakana middle dot - 30FB)
51300B8 CEDILLA
514 * this is a spacing character
515 * other spacing accent characters: 02D8-02DB
516 x (combining cedilla - 0327)
517 # 0020 0327
51800B9 SUPERSCRIPT ONE
519 x (superscript two - 00B2)
520 x (superscript three - 00B3)
521 # <super> 0031
52200BA MASCULINE ORDINAL INDICATOR
523 * Spanish
524 # <super> 006F
52500BB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK *
526 = RIGHT POINTING GUILLEMET
527 * usually closing, sometimes opening
528 x (much greater-than - 226B)
529 x (right double angle bracket - 300B)
53000BC VULGAR FRACTION ONE QUARTER
531 * bar may be horizontal or slanted
532 * other fraction characters: 2153-215E
533 # 0031 2044 0034
53400BD VULGAR FRACTION ONE HALF
535 * bar may be horizontal or slanted
536 # 0031 2044 0032
53700BE VULGAR FRACTION THREE QUARTERS
538 * bar may be horizontal or slanted
539 # 0033 2044 0034
54000BF INVERTED QUESTION MARK
541 = turned question mark
542 * Spanish
543 x (question mark - 003F)
54400C0 LATIN CAPITAL LETTER A WITH GRAVE
545 : 0041 0300
54600C1 LATIN CAPITAL LETTER A WITH ACUTE
547 : 0041 0301
54800C2 LATIN CAPITAL LETTER A WITH CIRCUMFLEX
549 : 0041 0302
55000C3 LATIN CAPITAL LETTER A WITH TILDE
551 : 0041 0303
55200C4 LATIN CAPITAL LETTER A WITH DIAERESIS
553 : 0041 0308
55400C5 LATIN CAPITAL LETTER A WITH RING ABOVE
555 x (angstrom sign - 212B)
556 : 0041 030A
55700C6 LATIN CAPITAL LETTER AE (ash) *
558 = LATIN CAPITAL LIGATURE AE
55900C7 LATIN CAPITAL LETTER C WITH CEDILLA
560 : 0043 0327
56100C8 LATIN CAPITAL LETTER E WITH GRAVE
562 : 0045 0300
56300C9 LATIN CAPITAL LETTER E WITH ACUTE
564 : 0045 0301
56500CA LATIN CAPITAL LETTER E WITH CIRCUMFLEX
566 : 0045 0302
56700CB LATIN CAPITAL LETTER E WITH DIAERESIS
568 : 0045 0308
56900CC LATIN CAPITAL LETTER I WITH GRAVE
570 : 0049 0300
57100CD LATIN CAPITAL LETTER I WITH ACUTE
572 : 0049 0301
57300CE LATIN CAPITAL LETTER I WITH CIRCUMFLEX
574 : 0049 0302
57500CF LATIN CAPITAL LETTER I WITH DIAERESIS
576 : 0049 0308
57700D0 LATIN CAPITAL LETTER ETH (Icelandic)
578 x (latin small letter eth - 00F0)
579 x (latin capital letter d with stroke - 0110)
580 x (latin capital letter african d - 0189)
58100D1 LATIN CAPITAL LETTER N WITH TILDE
582 : 004E 0303
58300D2 LATIN CAPITAL LETTER O WITH GRAVE
584 : 004F 0300
58500D3 LATIN CAPITAL LETTER O WITH ACUTE
586 : 004F 0301
58700D4 LATIN CAPITAL LETTER O WITH CIRCUMFLEX
588 : 004F 0302
58900D5 LATIN CAPITAL LETTER O WITH TILDE
590 : 004F 0303
59100D6 LATIN CAPITAL LETTER O WITH DIAERESIS
592 : 004F 0308
59300D7 MULTIPLICATION SIGN
594 = z notation Cartesian product
59500D8 LATIN CAPITAL LETTER O WITH STROKE
596 = LATIN CAPITAL LETTER O SLASH
597 x (empty set - 2205)
59800D9 LATIN CAPITAL LETTER U WITH GRAVE
599 : 0055 0300
60000DA LATIN CAPITAL LETTER U WITH ACUTE
601 : 0055 0301
60200DB LATIN CAPITAL LETTER U WITH CIRCUMFLEX
603 : 0055 0302
60400DC LATIN CAPITAL LETTER U WITH DIAERESIS
605 : 0055 0308
60600DD LATIN CAPITAL LETTER Y WITH ACUTE
607 : 0059 0301
60800DE LATIN CAPITAL LETTER THORN (Icelandic)
60900DF LATIN SMALL LETTER SHARP S (German)
610 = Eszett
611 * German
612 * uppercase is "SS"
613 * in origin a ligature of 017F and 0073
614 x (greek small letter beta - 03B2)
61500E0 LATIN SMALL LETTER A WITH GRAVE
616 : 0061 0300
61700E1 LATIN SMALL LETTER A WITH ACUTE
618 : 0061 0301
61900E2 LATIN SMALL LETTER A WITH CIRCUMFLEX
620 : 0061 0302
62100E3 LATIN SMALL LETTER A WITH TILDE
622 * Portuguese
623 : 0061 0303
62400E4 LATIN SMALL LETTER A WITH DIAERESIS
625 : 0061 0308
62600E5 LATIN SMALL LETTER A WITH RING ABOVE
627 * Danish, Norwegian, Swedish, Walloon
628 : 0061 030A
62900E6 LATIN SMALL LETTER AE (ash) *
630 = LATIN SMALL LIGATURE AE
631