michael@0: This is a Unicode converter test file containing Unicode data. Its encoding is michael@0: determined by the second-to-last dot-separated component of the filename. For michael@0: example, if this file is named foo.utf8.txt, its encoding is UTF-8; if this file michael@0: is named foo.utf16le.txt, its encoding is UTF-16LE. This file is marked as michael@0: binary in Mozilla's version control system so that it's not accidentally michael@0: "mangled". michael@0: michael@0: The contents of each file must differ ONLY by encoding, so if you edit this file michael@0: you must edit all files with the name of this file (with the encoding-specific michael@0: part changed). michael@0: michael@0: == BEGIN UNICODE TEST DATA == michael@0: michael@0: == U+000000 -- U+00007F == michael@0: michael@0: BELL: "" michael@0: DATA LINK ESCAPE: "" michael@0: DELETE: "" michael@0: michael@0: == U+000080 -- U+0007FF == michael@0: michael@0: CONTROL: "€" michael@0: NO-BREAK SPACE: " " michael@0: POUND SIGN: "£" michael@0: YEN SIGN: "¥" michael@0: CURRENCY SIGN: "¢" michael@0: LATIN SMALL LETTER SCHWA: "ə" michael@0: LATIN LETTER BILABIAL PERCUSSIVE: "ʬ" michael@0: michael@0: == U+000800 -- U+00FFFF == michael@0: michael@0: BUGINESE LETTER TA: "ᨈ" michael@0: BUGINESE LETTER DA: "ᨉ" michael@0: AIRPLANE: "✈" michael@0: ZERO WIDTH NO-BREAK SPACE: "" michael@0: michael@0: michael@0: == U+010000 -- U+10FFFF == michael@0: michael@0: SHAVIAN LETTER IAN: "𐑾" michael@0: MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE: "𝅘𝅥𝅲" michael@0: CJK UNIFIED IDEOGRAPH-20000: "𠀀" michael@0: (private use U+10FEFF): "􏻿"