If you are going to do any analysis of the differences, please take a look also at my message of <br><br><table cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td style="font-size: 80%; text-indent: 4px;" nowrap="nowrap">
From: <b id="_user_mark.davis@icu-project.org">Mark Davis <<a href="mailto:mark.davis@icu-project.org">mark.davis@icu-project.org</a>></b></td><td style="font-size: 65%;" align="right" nowrap="nowrap">Mailed-By: <b>
<a href="http://gmail.com">gmail.com</a></b></td></tr></tbody></table><br><div class="mhl">Subject: <b>Re: Languages and Scripts</b></div><br>which contained a listing of differences between the CLDR data and other Unicode data that we would like to reconcile. I repeat the data below.
<br><br>In particular, I had asked on this list if anyone knew anything about the couple of dozen purported languages at the end for which Magda couldn't find 639-3 language codes. Hearing no response, we will probably just remove them from the Unicode site as bogus.
<br><br>[abq] ; Abaza ; [Cyrl] ;<br>[ady] ; Adygei ; [Cyrl] ;<br>[ain] ; Ainu ; [Kana]; [Latn] ;<br>[amo] ; Amo ; [Latn] ;<br>[av] ; Avar (Avaric?) ; [Cyrl] ;<br>[awa] ; Awadhi ; [Deva] ;<br>[ba] ; Bashkir ; [Cyrl] ;
<br>[bbc] ; Batak toba ; [Batk]*; [Latn] ;<br>[bfq] ; Badaga ; [Taml] ;<br>[bft] ; Balti ; [Deva] ; [Removed Balti]<br>[bfy] ; Bagheli ; [Deva] ;<br>[bh] ; Bihari ; [Deva] ;<br>[bhb] ; Bhili ; [Deva] ;
<br>[bho] ; Bhojpuri ; [Deva] ;<br>[bjj] ; Kanauji ; [Deva] ;<br>[bku] ; Buhid ; [Buhd] ;<br>[br] ; Breton ; [Latn] ;<br>[bra] ; Braj bhasha ; [Deva] ;<br>[btk] ; Batak ; [Batk]*, [Latn] ;<br>[btv] ; Bateri (aka Bhatneri) ; [Deva] ;
<br>[ccp] ; Chakma ; [Beng]; ; [Removed Chakma]<br>[ce] ; Chechen ; [Cyrl] ;<br>[chm] ; Mari ; [Cyrl]; [Latn] ;<br>[cjs] ; Shor ; [Cyrl] ;<br>[co] ; Corsican ; [Latn] ;<br>[cop] ; Coptic ; [Arab]; [Copt] ; "[Added Grek, but is that right now?]"
<br>[cr] ; Cree ; [Cans]; [Latn] ;<br>[cv] ; Chuvash ; [Cyrl] ;<br>[dar] ; Dargwa ; [Cyrl] ;<br>[en] ; English ; [Latn] ; "[Had Shavian and Deseret, but those never<br>had any significant usage]"
<br>[evn] ; Evenki ; [Cyrl] ;<br>[gag] ; Gagauz ; [Cyrl] ;<br>[gbm] ; Garhwali ; [Deva]<br>[gd] ; Gaelic ; [Latn]<br>[gld] ; Nanai ; [Cyrl]<br>[gon] ; Gondi ; [Deva]; [Telu]<br>[grt] ; Garo ; [Beng]
<br>[hmn] ; Hmong ; [Latn]; [Hmng]*<br>[hnn] ; Hanunóo ; [Latn]; [Hano]<br>[hoc] ; Ho ; [Deva]<br>[hoj] ; Harauti ; [Deva]<br>[hop] ; Hopi ; [Latn]<br>[hy] ; Armenian ; [Armn]; [Syrc]*<br>[ibb] ; Ibibio ; [Latn]
<br>[id] ; Indonesian ; "[Arab]*, [Latn]"<br>[ik] ; Iñupiaq ; [Latn]<br>[inh] ; Ingush ; [Arab]; [Latn]<br>[jv] ; Javanese ; [Latn]; [Java]*<br>[kaa] ; Karakalpak ; [Cyrl]<br>[kac]? ; Kachchi ; [Deva]
<br>[kbd] ; Kabardian ; [Cyrl]<br>[kca] ; Khanty ; [Cyrl]<br>[kdt] ; Kuy ; Thai<br>[kha] ; Khasi ; [Latn]; [Beng]<br>[kht] ; Khamti ; [Mymr]<br>[kr] ; Kanuri ; [Latn]<br>[krc] ; Karachay ; [Cyrl]
<br>[krl] ; Karelian ; [Latn]; [Cyrl]<br>[kv] ; Komi ; [Cyrl]; [Latn]<br>[ky] ; Kirghiz ; [Arab]*; [Latn]; [Cyrl]<br>[lad] ; Ladino ; [Hebr]<br>[lbe] ; Lak ; [Cyrl]<br>[lcp] ; "Lawa, western" ; Thai
<br>[lep] ; Lepcha ; [Lepc]*<br>[lez] ; Lezghian (Lezghi?) ; [Cyrl]<br>[li]? ; Limbu ; [Deva]; [Limb]<br>[lis] ; Lisu ; "Lisu (Fraser)*, [Latn]"<br>[lmn] ; Lambadi ; [Telu]<br>[lut] ; Lushootseed ; [Latn]
<br>[lwl] ; "Lawa, eastern" ; Thai<br>[mnc] ; Manchu ; [Mong]<br>[mni] ; Meitei ; "Meetai Mayek*, [Beng]"<br>[mns] ; Mansi ; [Cyrl]<br>[mnw] ; Mon ; [Mymr]<br>[muw] ; Mundari ; [Beng]; [Deva]
<br>[mwr] ; Marwari ; [Deva]<br>[nbf] ; Naxi ; Naxi*<br>[new] ; Newari ; "[Deva]; Ranjana, Parachalit"<br>[nog] ; Nogai ; [Cyrl]<br>[nv] ; Navajo ; [Latn]<br>[om] ; Oromo ; [Ethi]*; [Latn] ; "[According to wikipedia, Ethi usage is old"
<br>[os] ; Ossetic ; [Cyrl]; [Latn] ;<br>[pi] ; Pali ; [Sinh]; [Deva]; [Thai] ;<br>[prd] ; Parsi-dari ; [Arab] ;<br>[prg] ; Prussian ; [Latn] ;<br>[ro] ; Romanian ; [Latn]; [Cyrl]* ;<br>[rom] ; Romany ; [Cyrl]; [Latn] ;
<br>[sa] ; Sanskrit ; [Sinh]; [Deva]; etc. ; [CLDR doesn't do 'etc': what<br>is the exact list: all modern Indic scripts + Sinhala?]<br>[sah] ; Yakut ; [Cyrl] ;<br>[sat] ; Santali ; [Deva]; [Beng]; [Orya]; [Olck]* ;
<br>[sel] ; Selkup ; [Cyrl] ;<br>[shn] ; Shan ; [Mymr] ;<br>[smi] ; Sami ; [Cyrl]; [Latn] ; [Is Cyrl common?]<br>[smp] ; Samaritan ; [Hebr] ; [Removed Samaritan*<br>[sn] ; Shona ; [Latn] ;<br>[syl] ; Sylhetti ; [Sylo]; [Beng] ;
<br>[tab] ; Tabasaran ; [Cyrl] ;<br>[tbw] ; Tagbanwa ; [Latn]; ; [Removed Tagbanwa]<br>[tcy] ; Tulu ; [Knda] ;<br>[tl] ; Tagalog ; [Latn]; [Tglg] ; [It appears that Tglg is not in modern use]
<br>[tru] ; Turoyo ; [Syrc] ;<br>[ttt] ; Tat ; [Cyrl] ;<br>[tut] ; Altai (Altaic?) ; [Cyrl] ;<br>[ty] ; Tahitian ; [Latn] ;<br>[udm] ; Udmurt ; [Cyrl]; [Latn] ;<br>[ug] ; Uighur ; [Arab]; [Latn]; [Cyrl] ; "[adds Latn, Cyrl; is the
<br>latter common? Removed Uighur]"<br>[vi] ; Vietnamese ; [Latn]; Chu Nom ; [Chu Nom would be Hani*]<br>[xal] ; Kalmyk ; [Cyrl] ;<br>[xsr] ; Sherpa ; [Deva] ;<br>[yrk] ; Nenets ; [Cyrl] ;
<br><br># Missing Language Codes<br><br>??? ; Aisor ; [Cyrl]<br>??? ; Assyrian (modern) ; [Syrc]<br>??? ; Bahasa ; [Latn]<br>??? ; Balear ; [Latn]<br>??? ; Balkar ; [Cyrl]<br>??? ; Bugis ; [Bugi]
<br>??? ; Buryat ; [Cyrl]<br>??? ; Cham ; [Cham]*<br>??? ; Chhattisgarhi ; [Deva]<br>??? ; Chukchi ; [Cyrl]<br>??? ; Dungan ; [Cyrl]<br>??? ; Edo ; [Latn]<br>??? ; Garshuni ; [Syrc]
<br>??? ; Gascon ; [Latn]<br>??? ; Judezmo ; [Hebr]<br>??? ; Kankan ; [Deva]<br>??? ; Khakass ; [Cyrl]<br>??? ; Koryak ; [Cyrl]<br>??? ; Lapp ; [Latn]<br>??? ; Mordvin ; [Cyrl]
<br>??? ; Naga ; [Latn]; [Beng]<br>??? ; Riang ; [Beng]<br>??? ; Swadaya ; [Syrc]<br>??? ; Tamazight ; [Tfng], [Latn]<br>??? ; Tsalagi ; (see Cherokee)<br>??? ; Tuva ; [Cyrl]<br>??? ; Udekhe ; [Cyrl]
<br><br><div><span class="gmail_quote">On 9/27/06, <b class="gmail_sendername">Frank Ellermann</b> <<a href="mailto:nobody@xyzzy.claranet.de">nobody@xyzzy.claranet.de</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
John Cowan wrote:<br><br>> I'll post (some of) the current state of things shortly.<br><br>Here's a list of 67 differences between LTRU and CLDR 1.4<br><br>Lines where LTRU has no script and CLDR more than one are<br>most probably sane, but anything else should be checked:
<br><br>lang aa ltru cldr Latn<br>lang ak ltru cldr Latn<br>lang az ltru cldr Arab Cyrl Latn<br>lang bo ltru cldr Tibt<br>lang cr ltru cldr Cans Latn<br>lang ee ltru cldr Latn<br>lang fy ltru cldr Latn
<br>lang gd ltru cldr Latn<br>lang ha ltru cldr Arab Latn<br>lang ho ltru cldr Latn<br>lang ia ltru cldr Latn<br>lang ig ltru cldr Latn<br>lang iu ltru cldr Cans Cyrl Latn<br>lang ja ltru cldr Hani Hira Kana
<br>lang ko ltru cldr Hang Hani<br>lang ks ltru cldr Arab Deva<br>lang ku ltru cldr Arab Cyrl Latn<br>lang kw ltru cldr Latn<br>lang ky ltru cldr Arab Cyrl<br>lang mi ltru cldr Latn<br>
lang mn ltru cldr Cyrl Mong<br>lang mo ltru Latn cldr Cyrl Latn<br>lang ms ltru Latn cldr Arab Latn<br>lang oc ltru cldr Latn<br>lang os ltru cldr Latn<br>lang pa ltru Guru cldr Arab Guru<br>lang rm ltru cldr Latn
<br>lang sa ltru cldr Deva<br>lang sd ltru cldr Arab Deva<br>lang se ltru cldr Latn<br>lang sh ltru cldr Latn<br>lang sr ltru cldr Cyrl Latn<br>lang tg ltru cldr Arab Cyrl Latn<br>lang tk ltru cldr Arab Cyrl Latn
<br>lang tr ltru Latn cldr Arab Latn<br>lang tt ltru cldr Cyrl<br>lang ug ltru cldr Arab<br>lang uz ltru cldr Arab Cyrl Latn<br>lang yo ltru cldr Latn<br>lang zh ltru cldr Bopo Hani Hans Hant
<br>lang bal ltru cldr Arab Latn<br>lang byn ltru cldr Ethi<br>lang cch ltru cldr Latn<br>lang chr ltru cldr Cher Latn<br>lang cop ltru cldr Arab Copt<br>lang fil ltru cldr Latn<br>lang fiu ltru cldr Latn
<br>lang fur ltru cldr Latn<br>lang gaa ltru cldr Latn<br>lang gez ltru cldr Ethi<br>lang gsw ltru cldr Latn<br>lang haw ltru cldr Latn<br>lang kaj ltru cldr Latn<br>lang kam ltru cldr Latn
<br>lang kcg ltru cldr Latn<br>lang kfo ltru cldr Latn<br>lang kpe ltru cldr Latn<br>lang sid ltru cldr Latn<br>lang sma ltru cldr Latn<br>lang smi ltru cldr Latn<br>lang smj ltru cldr Latn
<br>lang smn ltru cldr Latn<br>lang sms ltru cldr Latn<br>lang syr ltru cldr Syrc<br>lang tet ltru cldr Latn<br>lang tig ltru cldr Ethi<br>lang wal ltru cldr Ethi<br><br><br>_______________________________________________
<br>Ietf-languages mailing list<br><a href="mailto:Ietf-languages@alvestrand.no">Ietf-languages@alvestrand.no</a><br><a href="http://www.alvestrand.no/mailman/listinfo/ietf-languages">http://www.alvestrand.no/mailman/listinfo/ietf-languages
</a><br></blockquote></div><br>