|
1 This is a Unicode converter test file containing Unicode data. Its encoding is |
|
2 determined by the second-to-last dot-separated component of the filename. For |
|
3 example, if this file is named foo.utf8.txt, its encoding is UTF-8; if this file |
|
4 is named foo.utf16le.txt, its encoding is UTF-16LE. This file is marked as |
|
5 binary in Mozilla's version control system so that it's not accidentally |
|
6 "mangled". |
|
7 |
|
8 The contents of each file must differ ONLY by encoding, so if you edit this file |
|
9 you must edit all files with the name of this file (with the encoding-specific |
|
10 part changed). |
|
11 |
|
12 == BEGIN UNICODE TEST DATA == |
|
13 |
|
14 == U+000000 -- U+00007F == |
|
15 |
|
16 BELL: "" |
|
17 DATA LINK ESCAPE: "" |
|
18 DELETE: "" |
|
19 |
|
20 == U+000080 -- U+0007FF == |
|
21 |
|
22 CONTROL: "" |
|
23 NO-BREAK SPACE: " " |
|
24 POUND SIGN: "£" |
|
25 YEN SIGN: "¥" |
|
26 CURRENCY SIGN: "¢" |
|
27 LATIN SMALL LETTER SCHWA: "ə" |
|
28 LATIN LETTER BILABIAL PERCUSSIVE: "ʬ" |
|
29 |
|
30 == U+000800 -- U+00FFFF == |
|
31 |
|
32 BUGINESE LETTER TA: "ᨈ" |
|
33 BUGINESE LETTER DA: "ᨉ" |
|
34 AIRPLANE: "✈" |
|
35 ZERO WIDTH NO-BREAK SPACE: "" |
|
36 |
|
37 |
|
38 == U+010000 -- U+10FFFF == |
|
39 |
|
40 SHAVIAN LETTER IAN: "𐑾" |
|
41 MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE: "𝅘𝅥𝅲" |
|
42 CJK UNIFIED IDEOGRAPH-20000: "𠀀" |
|
43 (private use U+10FEFF): "" |