intl/uconv/tests/unit/data/unicode-conversion.utf8.txt

Wed, 31 Dec 2014 07:22:50 +0100

author
Michael Schloh von Bennewitz <michael@schloh.com>
date
Wed, 31 Dec 2014 07:22:50 +0100
branch
TOR_BUG_3246
changeset 4
fc2d59ddac77
permissions
-rw-r--r--

Correct previous dual key logic pending first delivery installment.

     1 This is a Unicode converter test file containing Unicode data.  Its encoding is
     2 determined by the second-to-last dot-separated component of the filename.  For
     3 example, if this file is named foo.utf8.txt, its encoding is UTF-8; if this file
     4 is named foo.utf16le.txt, its encoding is UTF-16LE.  This file is marked as
     5 binary in Mozilla's version control system so that it's not accidentally
     6 "mangled".
     8 The contents of each file must differ ONLY by encoding, so if you edit this file
     9 you must edit all files with the name of this file (with the encoding-specific
    10 part changed).
    12 == BEGIN UNICODE TEST DATA ==
    14 == U+000000 -- U+00007F ==
    16 BELL:              ""
    17 DATA LINK ESCAPE:  ""
    18 DELETE:            ""
    20 == U+000080 -- U+0007FF ==
    22 CONTROL:                           "€"
    23 NO-BREAK SPACE:                    " "
    24 POUND SIGN:                        "£"
    25 YEN SIGN:                          "¥"
    26 CURRENCY SIGN:                     "¢"
    27 LATIN SMALL LETTER SCHWA:          "ə"
    28 LATIN LETTER BILABIAL PERCUSSIVE:  "ʬ"
    30 == U+000800 -- U+00FFFF ==
    32 BUGINESE LETTER TA:         "ᨈ"
    33 BUGINESE LETTER DA:         "ᨉ"
    34 AIRPLANE:                   "✈"
    35 ZERO WIDTH NO-BREAK SPACE:  ""
    38 == U+010000 -- U+10FFFF ==
    40 SHAVIAN LETTER IAN:                             "𐑾"
    41 MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE:  "𝅘𝅥𝅲"
    42 CJK UNIFIED IDEOGRAPH-20000:                    "𠀀"
    43 (private use U+10FEFF):                         "􏻿"

mercurial