intl/uconv/tests/unit/data/unicode-conversion.utf8.txt

Wed, 31 Dec 2014 07:22:50 +0100

author
Michael Schloh von Bennewitz <michael@schloh.com>
date
Wed, 31 Dec 2014 07:22:50 +0100
branch
TOR_BUG_3246
changeset 4
fc2d59ddac77
permissions
-rw-r--r--

Correct previous dual key logic pending first delivery installment.

michael@0 1 This is a Unicode converter test file containing Unicode data. Its encoding is
michael@0 2 determined by the second-to-last dot-separated component of the filename. For
michael@0 3 example, if this file is named foo.utf8.txt, its encoding is UTF-8; if this file
michael@0 4 is named foo.utf16le.txt, its encoding is UTF-16LE. This file is marked as
michael@0 5 binary in Mozilla's version control system so that it's not accidentally
michael@0 6 "mangled".
michael@0 7
michael@0 8 The contents of each file must differ ONLY by encoding, so if you edit this file
michael@0 9 you must edit all files with the name of this file (with the encoding-specific
michael@0 10 part changed).
michael@0 11
michael@0 12 == BEGIN UNICODE TEST DATA ==
michael@0 13
michael@0 14 == U+000000 -- U+00007F ==
michael@0 15
michael@0 16 BELL: ""
michael@0 17 DATA LINK ESCAPE: ""
michael@0 18 DELETE: ""
michael@0 19
michael@0 20 == U+000080 -- U+0007FF ==
michael@0 21
michael@0 22 CONTROL: "€"
michael@0 23 NO-BREAK SPACE: " "
michael@0 24 POUND SIGN: "£"
michael@0 25 YEN SIGN: "¥"
michael@0 26 CURRENCY SIGN: "¢"
michael@0 27 LATIN SMALL LETTER SCHWA: "ə"
michael@0 28 LATIN LETTER BILABIAL PERCUSSIVE: "ʬ"
michael@0 29
michael@0 30 == U+000800 -- U+00FFFF ==
michael@0 31
michael@0 32 BUGINESE LETTER TA: "ᨈ"
michael@0 33 BUGINESE LETTER DA: "ᨉ"
michael@0 34 AIRPLANE: "✈"
michael@0 35 ZERO WIDTH NO-BREAK SPACE: ""
michael@0 36
michael@0 37
michael@0 38 == U+010000 -- U+10FFFF ==
michael@0 39
michael@0 40 SHAVIAN LETTER IAN: "𐑾"
michael@0 41 MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE: "𝅘𝅥𝅲"
michael@0 42 CJK UNIFIED IDEOGRAPH-20000: "𠀀"
michael@0 43 (private use U+10FEFF): "􏻿"

mercurial