1.1 --- /dev/null Thu Jan 01 00:00:00 1970 +0000 1.2 +++ b/dom/encoding/domainsfallbacks.properties Wed Dec 31 06:09:35 2014 +0100 1.3 @@ -0,0 +1,167 @@ 1.4 +# This Source Code Form is subject to the terms of the Mozilla Public 1.5 +# License, v. 2.0. If a copy of the MPL was not distributed with this 1.6 +# file, You can obtain one at http://mozilla.org/MPL/2.0/. 1.7 + 1.8 +# This file contains educated guesses about which top-level domains are 1.9 +# likely to host legacy content that assumes a non-windows-1252 encoding. 1.10 +# Punycode TLDs are included on the theory that legacy content might appear 1.11 +# behind those relatively new TLDs if DNS just points to a legacy server. 1.12 +# 1.13 +# Encodings for which a confident-enough educated guess is missing are 1.14 +# listed in nonparticipatingdomains.properties. Domains that are listed 1.15 +# neither there nor here get windows-1252 as the associated fallback. 1.16 +# 1.17 +# The list below includes Arabic-script TLDs not on IANA list but on the 1.18 +# ICANN list: 1.19 +# http://www.icann.org/en/resources/idn/fast-track/string-evaluation-completion 1.20 +# Otherwise, the list includes non-windows-1252-affilited country TLDs from 1.21 +# https://data.iana.org/TLD/tlds-alpha-by-domain.txt 1.22 +# 1.23 +# The guesses are assigned as follows: 1.24 +# * If the country has a dominant country-affiliated language and that language 1.25 +# is part of the languages to fallbacks mapping, use the encoding for that 1.26 +# language from that mapping. 1.27 +# * Use windows-1256 for countries that have a dominant Arabic-script 1.28 +# language or whose all languages are Arabic-script languages. 1.29 +# * Use windows-1251 likewise but for Cyrillic script. 1.30 + 1.31 +ae=windows-1256 1.32 +xn--mgbaam7a8h=windows-1256 1.33 + 1.34 +af=windows-1256 1.35 + 1.36 +bg=windows-1251 1.37 + 1.38 +bh=windows-1256 1.39 + 1.40 +by=windows-1251 1.41 + 1.42 +cn=gbk 1.43 +xn--fiqs8s=gbk 1.44 +# Assume that Traditional Chinese TLD is meant to work if URL input happens to 1.45 +# be in the traditional mode. Expect content to be simplified anyway. 1.46 +xn--fiqz9s=gbk 1.47 + 1.48 +cz=windows-1250 1.49 + 1.50 +dz=windows-1256 1.51 +xn--lgbbat1ad8j=windows-1256 1.52 + 1.53 +ee=windows-1257 1.54 + 1.55 +eg=windows-1256 1.56 +xn--wgbh1c=windows-1256 1.57 + 1.58 +gr=ISO-8859-7 1.59 + 1.60 +hk=Big5-HKSCS 1.61 +xn--j6w193g=Big5-HKSCS 1.62 + 1.63 +hr=windows-1250 1.64 + 1.65 +hu=ISO-8859-2 1.66 + 1.67 +iq=windows-1256 1.68 + 1.69 +ir=windows-1256 1.70 +xn--mgba3a4f16a=windows-1256 1.71 + 1.72 +jo=windows-1256 1.73 +xn--mgbayh7gpa=windows-1256 1.74 + 1.75 +jp=Shift_JIS 1.76 + 1.77 +kg=windows-1251 1.78 + 1.79 +kp=EUC-KR 1.80 + 1.81 +kr=EUC-KR 1.82 +xn--3e0b707e=EUC-KR 1.83 + 1.84 +kw=windows-1256 1.85 + 1.86 +kz=windows-1251 1.87 +xn--80ao21a=windows-1251 1.88 + 1.89 +lb=windows-1256 1.90 + 1.91 +lt=windows-1257 1.92 + 1.93 +lv=windows-1257 1.94 + 1.95 +ma=windows-1256 1.96 +xn--mgbc0a9azcg=windows-1256 1.97 + 1.98 +mk=windows-1251 1.99 + 1.100 +mn=windows-1251 1.101 +xn--l1acc=windows-1251 1.102 + 1.103 +mo=Big5 1.104 + 1.105 +# my 1.106 +xn--mgbx4cd0ab=windows-1256 1.107 + 1.108 +om=windows-1256 1.109 +xn--mgb9awbf=windows-1256 1.110 + 1.111 +#pk 1.112 +xn--mgbai9azgqp6j=windows-1256 1.113 + 1.114 +pl=ISO-8859-2 1.115 + 1.116 +ps=windows-1256 1.117 +xn--ygbi2ammx=windows-1256 1.118 + 1.119 +qa=windows-1256 1.120 +xn--wgbl6a=windows-1256 1.121 + 1.122 +rs=windows-1251 1.123 +xn--90a3ac=windows-1251 1.124 + 1.125 +ru=windows-1251 1.126 +xn--p1ai=windows-1251 1.127 + 1.128 +sa=windows-1256 1.129 +xn--mgberp4a5d4ar=windows-1256 1.130 + 1.131 +sd=windows-1256 1.132 +xn--mgbpl2fh=windows-1256 1.133 + 1.134 +sg=gbk 1.135 +xn--yfro4i67o=gbk 1.136 + 1.137 +si=ISO-8859-2 1.138 + 1.139 +sk=windows-1250 1.140 + 1.141 +su=windows-1251 1.142 + 1.143 +sy=windows-1256 1.144 +xn--mgbtf8fl=windows-1256 1.145 + 1.146 +th=windows-874 1.147 +xn--o3cw4h=windows-874 1.148 + 1.149 +tj=windows-1251 1.150 + 1.151 +tn=windows-1256 1.152 +xn--pgbs0dh=windows-1256 1.153 + 1.154 +tr=windows-1254 1.155 + 1.156 +tw=Big5 1.157 +# Assume that the Simplified Chinese TLD is meant to work when URL input 1.158 +# happens in the simplified mode. Assume content is tradition anyway. 1.159 +xn--kprw13d=Big5 1.160 +xn--kpry57d=Big5 1.161 + 1.162 +ua=windows-1251 1.163 +xn--j1amh=windows-1251 1.164 + 1.165 +uz=windows-1251 1.166 + 1.167 +vn=windows-1258 1.168 + 1.169 +ye=windows-1256 1.170 +xn--mgb2ddes=windows-1256