frr, fy, ngo, tt
Mark Davis
mark.davis at icu-project.org
Thu Sep 28 00:39:49 CEST 2006
If you are going to do any analysis of the differences, please take a look
also at my message of
From: *Mark Davis <mark.davis at icu-project.org>*Mailed-By: *gmail.com*
Subject: *Re: Languages and Scripts*
which contained a listing of differences between the CLDR data and other
Unicode data that we would like to reconcile. I repeat the data below.
In particular, I had asked on this list if anyone knew anything about the
couple of dozen purported languages at the end for which Magda couldn't find
639-3 language codes. Hearing no response, we will probably just remove them
from the Unicode site as bogus.
[abq] ; Abaza ; [Cyrl] ;
[ady] ; Adygei ; [Cyrl] ;
[ain] ; Ainu ; [Kana]; [Latn] ;
[amo] ; Amo ; [Latn] ;
[av] ; Avar (Avaric?) ; [Cyrl] ;
[awa] ; Awadhi ; [Deva] ;
[ba] ; Bashkir ; [Cyrl] ;
[bbc] ; Batak toba ; [Batk]*; [Latn] ;
[bfq] ; Badaga ; [Taml] ;
[bft] ; Balti ; [Deva] ; [Removed Balti]
[bfy] ; Bagheli ; [Deva] ;
[bh] ; Bihari ; [Deva] ;
[bhb] ; Bhili ; [Deva] ;
[bho] ; Bhojpuri ; [Deva] ;
[bjj] ; Kanauji ; [Deva] ;
[bku] ; Buhid ; [Buhd] ;
[br] ; Breton ; [Latn] ;
[bra] ; Braj bhasha ; [Deva] ;
[btk] ; Batak ; [Batk]*, [Latn] ;
[btv] ; Bateri (aka Bhatneri) ; [Deva] ;
[ccp] ; Chakma ; [Beng]; ; [Removed Chakma]
[ce] ; Chechen ; [Cyrl] ;
[chm] ; Mari ; [Cyrl]; [Latn] ;
[cjs] ; Shor ; [Cyrl] ;
[co] ; Corsican ; [Latn] ;
[cop] ; Coptic ; [Arab]; [Copt] ; "[Added Grek, but is that
right now?]"
[cr] ; Cree ; [Cans]; [Latn] ;
[cv] ; Chuvash ; [Cyrl] ;
[dar] ; Dargwa ; [Cyrl] ;
[en] ; English ; [Latn] ; "[Had Shavian and Deseret, but those
never
had any significant usage]"
[evn] ; Evenki ; [Cyrl] ;
[gag] ; Gagauz ; [Cyrl] ;
[gbm] ; Garhwali ; [Deva]
[gd] ; Gaelic ; [Latn]
[gld] ; Nanai ; [Cyrl]
[gon] ; Gondi ; [Deva]; [Telu]
[grt] ; Garo ; [Beng]
[hmn] ; Hmong ; [Latn]; [Hmng]*
[hnn] ; Hanunóo ; [Latn]; [Hano]
[hoc] ; Ho ; [Deva]
[hoj] ; Harauti ; [Deva]
[hop] ; Hopi ; [Latn]
[hy] ; Armenian ; [Armn]; [Syrc]*
[ibb] ; Ibibio ; [Latn]
[id] ; Indonesian ; "[Arab]*, [Latn]"
[ik] ; Iñupiaq ; [Latn]
[inh] ; Ingush ; [Arab]; [Latn]
[jv] ; Javanese ; [Latn]; [Java]*
[kaa] ; Karakalpak ; [Cyrl]
[kac]? ; Kachchi ; [Deva]
[kbd] ; Kabardian ; [Cyrl]
[kca] ; Khanty ; [Cyrl]
[kdt] ; Kuy ; Thai
[kha] ; Khasi ; [Latn]; [Beng]
[kht] ; Khamti ; [Mymr]
[kr] ; Kanuri ; [Latn]
[krc] ; Karachay ; [Cyrl]
[krl] ; Karelian ; [Latn]; [Cyrl]
[kv] ; Komi ; [Cyrl]; [Latn]
[ky] ; Kirghiz ; [Arab]*; [Latn]; [Cyrl]
[lad] ; Ladino ; [Hebr]
[lbe] ; Lak ; [Cyrl]
[lcp] ; "Lawa, western" ; Thai
[lep] ; Lepcha ; [Lepc]*
[lez] ; Lezghian (Lezghi?) ; [Cyrl]
[li]? ; Limbu ; [Deva]; [Limb]
[lis] ; Lisu ; "Lisu (Fraser)*, [Latn]"
[lmn] ; Lambadi ; [Telu]
[lut] ; Lushootseed ; [Latn]
[lwl] ; "Lawa, eastern" ; Thai
[mnc] ; Manchu ; [Mong]
[mni] ; Meitei ; "Meetai Mayek*, [Beng]"
[mns] ; Mansi ; [Cyrl]
[mnw] ; Mon ; [Mymr]
[muw] ; Mundari ; [Beng]; [Deva]
[mwr] ; Marwari ; [Deva]
[nbf] ; Naxi ; Naxi*
[new] ; Newari ; "[Deva]; Ranjana, Parachalit"
[nog] ; Nogai ; [Cyrl]
[nv] ; Navajo ; [Latn]
[om] ; Oromo ; [Ethi]*; [Latn] ; "[According to wikipedia, Ethi usage
is old"
[os] ; Ossetic ; [Cyrl]; [Latn] ;
[pi] ; Pali ; [Sinh]; [Deva]; [Thai] ;
[prd] ; Parsi-dari ; [Arab] ;
[prg] ; Prussian ; [Latn] ;
[ro] ; Romanian ; [Latn]; [Cyrl]* ;
[rom] ; Romany ; [Cyrl]; [Latn] ;
[sa] ; Sanskrit ; [Sinh]; [Deva]; etc. ; [CLDR doesn't do 'etc': what
is the exact list: all modern Indic scripts + Sinhala?]
[sah] ; Yakut ; [Cyrl] ;
[sat] ; Santali ; [Deva]; [Beng]; [Orya]; [Olck]* ;
[sel] ; Selkup ; [Cyrl] ;
[shn] ; Shan ; [Mymr] ;
[smi] ; Sami ; [Cyrl]; [Latn] ; [Is Cyrl common?]
[smp] ; Samaritan ; [Hebr] ; [Removed Samaritan*
[sn] ; Shona ; [Latn] ;
[syl] ; Sylhetti ; [Sylo]; [Beng] ;
[tab] ; Tabasaran ; [Cyrl] ;
[tbw] ; Tagbanwa ; [Latn]; ; [Removed Tagbanwa]
[tcy] ; Tulu ; [Knda] ;
[tl] ; Tagalog ; [Latn]; [Tglg] ; [It appears that Tglg is not
in modern use]
[tru] ; Turoyo ; [Syrc] ;
[ttt] ; Tat ; [Cyrl] ;
[tut] ; Altai (Altaic?) ; [Cyrl] ;
[ty] ; Tahitian ; [Latn] ;
[udm] ; Udmurt ; [Cyrl]; [Latn] ;
[ug] ; Uighur ; [Arab]; [Latn]; [Cyrl] ; "[adds Latn, Cyrl;
is the
latter common? Removed Uighur]"
[vi] ; Vietnamese ; [Latn]; Chu Nom ; [Chu Nom would be Hani*]
[xal] ; Kalmyk ; [Cyrl] ;
[xsr] ; Sherpa ; [Deva] ;
[yrk] ; Nenets ; [Cyrl] ;
# Missing Language Codes
??? ; Aisor ; [Cyrl]
??? ; Assyrian (modern) ; [Syrc]
??? ; Bahasa ; [Latn]
??? ; Balear ; [Latn]
??? ; Balkar ; [Cyrl]
??? ; Bugis ; [Bugi]
??? ; Buryat ; [Cyrl]
??? ; Cham ; [Cham]*
??? ; Chhattisgarhi ; [Deva]
??? ; Chukchi ; [Cyrl]
??? ; Dungan ; [Cyrl]
??? ; Edo ; [Latn]
??? ; Garshuni ; [Syrc]
??? ; Gascon ; [Latn]
??? ; Judezmo ; [Hebr]
??? ; Kankan ; [Deva]
??? ; Khakass ; [Cyrl]
??? ; Koryak ; [Cyrl]
??? ; Lapp ; [Latn]
??? ; Mordvin ; [Cyrl]
??? ; Naga ; [Latn]; [Beng]
??? ; Riang ; [Beng]
??? ; Swadaya ; [Syrc]
??? ; Tamazight ; [Tfng], [Latn]
??? ; Tsalagi ; (see Cherokee)
??? ; Tuva ; [Cyrl]
??? ; Udekhe ; [Cyrl]
On 9/27/06, Frank Ellermann <nobody at xyzzy.claranet.de> wrote:
>
> John Cowan wrote:
>
> > I'll post (some of) the current state of things shortly.
>
> Here's a list of 67 differences between LTRU and CLDR 1.4
>
> Lines where LTRU has no script and CLDR more than one are
> most probably sane, but anything else should be checked:
>
> lang aa ltru cldr Latn
> lang ak ltru cldr Latn
> lang az ltru cldr Arab Cyrl Latn
> lang bo ltru cldr Tibt
> lang cr ltru cldr Cans Latn
> lang ee ltru cldr Latn
> lang fy ltru cldr Latn
> lang gd ltru cldr Latn
> lang ha ltru cldr Arab Latn
> lang ho ltru cldr Latn
> lang ia ltru cldr Latn
> lang ig ltru cldr Latn
> lang iu ltru cldr Cans Cyrl Latn
> lang ja ltru cldr Hani Hira Kana
> lang ko ltru cldr Hang Hani
> lang ks ltru cldr Arab Deva
> lang ku ltru cldr Arab Cyrl Latn
> lang kw ltru cldr Latn
> lang ky ltru cldr Arab Cyrl
> lang mi ltru cldr Latn
> lang mn ltru cldr Cyrl Mong
> lang mo ltru Latn cldr Cyrl Latn
> lang ms ltru Latn cldr Arab Latn
> lang oc ltru cldr Latn
> lang os ltru cldr Latn
> lang pa ltru Guru cldr Arab Guru
> lang rm ltru cldr Latn
> lang sa ltru cldr Deva
> lang sd ltru cldr Arab Deva
> lang se ltru cldr Latn
> lang sh ltru cldr Latn
> lang sr ltru cldr Cyrl Latn
> lang tg ltru cldr Arab Cyrl Latn
> lang tk ltru cldr Arab Cyrl Latn
> lang tr ltru Latn cldr Arab Latn
> lang tt ltru cldr Cyrl
> lang ug ltru cldr Arab
> lang uz ltru cldr Arab Cyrl Latn
> lang yo ltru cldr Latn
> lang zh ltru cldr Bopo Hani Hans Hant
> lang bal ltru cldr Arab Latn
> lang byn ltru cldr Ethi
> lang cch ltru cldr Latn
> lang chr ltru cldr Cher Latn
> lang cop ltru cldr Arab Copt
> lang fil ltru cldr Latn
> lang fiu ltru cldr Latn
> lang fur ltru cldr Latn
> lang gaa ltru cldr Latn
> lang gez ltru cldr Ethi
> lang gsw ltru cldr Latn
> lang haw ltru cldr Latn
> lang kaj ltru cldr Latn
> lang kam ltru cldr Latn
> lang kcg ltru cldr Latn
> lang kfo ltru cldr Latn
> lang kpe ltru cldr Latn
> lang sid ltru cldr Latn
> lang sma ltru cldr Latn
> lang smi ltru cldr Latn
> lang smj ltru cldr Latn
> lang smn ltru cldr Latn
> lang sms ltru cldr Latn
> lang syr ltru cldr Syrc
> lang tet ltru cldr Latn
> lang tig ltru cldr Ethi
> lang wal ltru cldr Ethi
>
>
> _______________________________________________
> Ietf-languages mailing list
> Ietf-languages at alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/ietf-languages
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.alvestrand.no/pipermail/ietf-languages/attachments/20060927/152460e5/attachment-0001.html
More information about the Ietf-languages
mailing list