frr, fy, ngo, tt

Mark Davis mark.davis at icu-project.org
Thu Sep 28 00:39:49 CEST 2006


If you are going to do any analysis of the differences, please take a look
also at my message of

From: *Mark Davis <mark.davis at icu-project.org>*Mailed-By: *gmail.com*
Subject: *Re: Languages and Scripts*

which contained a listing of differences between the CLDR data and other
Unicode data that we would like to reconcile. I repeat the data below.

In particular, I had asked on this list if anyone knew anything about the
couple of dozen purported languages at the end for which Magda couldn't find
639-3 language codes. Hearing no response, we will probably just remove them
from the Unicode site as bogus.

[abq] ; Abaza ; [Cyrl] ;
[ady] ; Adygei ;        [Cyrl] ;
[ain] ; Ainu ;  [Kana]; [Latn] ;
[amo] ; Amo ;   [Latn] ;
[av] ;  Avar (Avaric?) ;        [Cyrl] ;
[awa] ; Awadhi ;        [Deva] ;
[ba] ;  Bashkir ;       [Cyrl] ;
[bbc] ; Batak toba ;    [Batk]*; [Latn] ;
[bfq] ; Badaga ;        [Taml] ;
[bft] ; Balti ; [Deva] ;        [Removed Balti]
[bfy] ; Bagheli ;       [Deva] ;
[bh] ;  Bihari ;        [Deva] ;
[bhb] ; Bhili ; [Deva] ;
[bho] ; Bhojpuri ;      [Deva] ;
[bjj] ; Kanauji ;       [Deva] ;
[bku] ; Buhid ; [Buhd] ;
[br] ;  Breton ;        [Latn] ;
[bra] ; Braj bhasha ;   [Deva] ;
[btk] ; Batak ; [Batk]*, [Latn] ;
[btv] ; Bateri (aka Bhatneri)  ;        [Deva] ;
[ccp] ; Chakma ;        [Beng]; ;       [Removed Chakma]
[ce] ;  Chechen ;       [Cyrl] ;
[chm] ; Mari ;  [Cyrl]; [Latn] ;
[cjs] ; Shor ;  [Cyrl] ;
[co] ;  Corsican ;      [Latn] ;
[cop] ; Coptic ;        [Arab]; [Copt] ;        "[Added Grek, but is that
right now?]"
[cr] ;  Cree ;  [Cans]; [Latn] ;
[cv] ;  Chuvash ;       [Cyrl] ;
[dar] ; Dargwa ;        [Cyrl] ;
[en] ;  English ;       [Latn] ;        "[Had Shavian and Deseret, but those
never
had any significant usage]"
[evn] ; Evenki ;        [Cyrl] ;
[gag] ; Gagauz ;        [Cyrl] ;
[gbm] ; Garhwali ;      [Deva]
[gd] ;  Gaelic ;        [Latn]
[gld] ; Nanai ; [Cyrl]
[gon] ; Gondi ; [Deva]; [Telu]
[grt] ; Garo ;  [Beng]
[hmn] ; Hmong ; [Latn]; [Hmng]*
[hnn] ; Hanunóo ;       [Latn]; [Hano]
[hoc] ; Ho ;    [Deva]
[hoj] ; Harauti ;       [Deva]
[hop] ; Hopi ;  [Latn]
[hy] ;  Armenian ;      [Armn]; [Syrc]*
[ibb] ; Ibibio ;        [Latn]
[id] ;  Indonesian ;    "[Arab]*, [Latn]"
[ik] ;  Iñupiaq ;       [Latn]
[inh] ; Ingush ;        [Arab]; [Latn]
[jv] ;  Javanese ;      [Latn]; [Java]*
[kaa] ; Karakalpak ;    [Cyrl]
[kac]? ;        Kachchi ;       [Deva]
[kbd] ; Kabardian ;     [Cyrl]
[kca] ; Khanty ;        [Cyrl]
[kdt] ; Kuy ;   Thai
[kha] ; Khasi ; [Latn]; [Beng]
[kht] ; Khamti ;        [Mymr]
[kr] ;  Kanuri ;        [Latn]
[krc] ; Karachay ;      [Cyrl]
[krl] ; Karelian ;      [Latn]; [Cyrl]
[kv] ;  Komi ;  [Cyrl]; [Latn]
[ky] ;  Kirghiz ;       [Arab]*; [Latn]; [Cyrl]
[lad] ; Ladino ;        [Hebr]
[lbe] ; Lak ;   [Cyrl]
[lcp] ; "Lawa, western" ;       Thai
[lep] ; Lepcha ;        [Lepc]*
[lez] ; Lezghian (Lezghi?) ;    [Cyrl]
[li]? ; Limbu ; [Deva]; [Limb]
[lis] ; Lisu ;  "Lisu (Fraser)*, [Latn]"
[lmn] ; Lambadi ;       [Telu]
[lut] ; Lushootseed ;   [Latn]
[lwl] ; "Lawa, eastern" ;       Thai
[mnc] ; Manchu ;        [Mong]
[mni] ; Meitei ;        "Meetai Mayek*, [Beng]"
[mns] ; Mansi ; [Cyrl]
[mnw] ; Mon ;   [Mymr]
[muw] ; Mundari ;       [Beng]; [Deva]
[mwr] ; Marwari ;       [Deva]
[nbf] ; Naxi ;  Naxi*
[new] ; Newari ;        "[Deva]; Ranjana, Parachalit"
[nog] ; Nogai ; [Cyrl]
[nv] ;  Navajo ;        [Latn]
[om] ;  Oromo ; [Ethi]*; [Latn] ;       "[According to wikipedia, Ethi usage
is old"
[os] ;  Ossetic ;       [Cyrl]; [Latn] ;
[pi] ;  Pali ;  [Sinh]; [Deva]; [Thai] ;
[prd] ; Parsi-dari ;    [Arab] ;
[prg] ; Prussian ;      [Latn] ;
[ro] ;  Romanian ;      [Latn]; [Cyrl]* ;
[rom] ; Romany ;        [Cyrl]; [Latn] ;
[sa] ;  Sanskrit ;      [Sinh]; [Deva]; etc. ;  [CLDR doesn't do 'etc': what
is the exact list: all modern Indic scripts + Sinhala?]
[sah] ; Yakut ; [Cyrl] ;
[sat] ; Santali ;       [Deva]; [Beng]; [Orya]; [Olck]* ;
[sel] ; Selkup ;        [Cyrl] ;
[shn] ; Shan ;  [Mymr] ;
[smi] ; Sami ;  [Cyrl]; [Latn] ;        [Is Cyrl common?]
[smp] ; Samaritan ;     [Hebr] ;        [Removed Samaritan*
[sn] ;  Shona ; [Latn] ;
[syl] ; Sylhetti ;      [Sylo]; [Beng] ;
[tab] ; Tabasaran ;     [Cyrl] ;
[tbw] ; Tagbanwa ;      [Latn];  ;      [Removed Tagbanwa]
[tcy] ; Tulu ;  [Knda] ;
[tl] ;  Tagalog ;       [Latn]; [Tglg] ;        [It appears that Tglg is not
in modern use]
[tru] ; Turoyo ;        [Syrc] ;
[ttt] ; Tat ;   [Cyrl] ;
[tut] ; Altai (Altaic?) ;       [Cyrl] ;
[ty] ;  Tahitian ;      [Latn] ;
[udm] ; Udmurt ;        [Cyrl]; [Latn] ;
[ug] ;  Uighur ;        [Arab]; [Latn]; [Cyrl] ;        "[adds Latn, Cyrl;
is the
latter common?  Removed Uighur]"
[vi] ;  Vietnamese ;    [Latn]; Chu Nom ;       [Chu Nom would be Hani*]
[xal] ; Kalmyk ;        [Cyrl] ;
[xsr] ; Sherpa ;        [Deva] ;
[yrk] ; Nenets ;        [Cyrl] ;

# Missing Language Codes

??? ;   Aisor ; [Cyrl]
??? ;   Assyrian (modern) ;     [Syrc]
??? ;   Bahasa ;        [Latn]
??? ;   Balear ;        [Latn]
??? ;   Balkar ;        [Cyrl]
??? ;   Bugis ; [Bugi]
??? ;   Buryat ;        [Cyrl]
??? ;   Cham ;  [Cham]*
??? ;   Chhattisgarhi ; [Deva]
??? ;   Chukchi ;       [Cyrl]
??? ;   Dungan ;        [Cyrl]
??? ;   Edo ;   [Latn]
??? ;   Garshuni ;      [Syrc]
??? ;   Gascon ;        [Latn]
??? ;   Judezmo ;       [Hebr]
??? ;   Kankan ;        [Deva]
??? ;   Khakass ;       [Cyrl]
??? ;   Koryak ;        [Cyrl]
??? ;   Lapp ;  [Latn]
??? ;   Mordvin ;       [Cyrl]
??? ;   Naga ;  [Latn]; [Beng]
??? ;   Riang ; [Beng]
??? ;   Swadaya ;       [Syrc]
??? ;   Tamazight ;     [Tfng], [Latn]
??? ;   Tsalagi ;       (see Cherokee)
??? ;   Tuva ;  [Cyrl]
??? ;   Udekhe ;        [Cyrl]

On 9/27/06, Frank Ellermann <nobody at xyzzy.claranet.de> wrote:
>
> John Cowan wrote:
>
> > I'll post (some of) the current state of things shortly.
>
> Here's a list of 67 differences between LTRU and CLDR 1.4
>
> Lines where LTRU has no script and CLDR more than one are
> most probably sane, but anything else should be checked:
>
> lang aa  ltru      cldr Latn
> lang ak  ltru      cldr Latn
> lang az  ltru      cldr Arab Cyrl Latn
> lang bo  ltru      cldr Tibt
> lang cr  ltru      cldr Cans Latn
> lang ee  ltru      cldr Latn
> lang fy  ltru      cldr Latn
> lang gd  ltru      cldr Latn
> lang ha  ltru      cldr Arab Latn
> lang ho  ltru      cldr Latn
> lang ia  ltru      cldr Latn
> lang ig  ltru      cldr Latn
> lang iu  ltru      cldr Cans Cyrl Latn
> lang ja  ltru      cldr Hani Hira Kana
> lang ko  ltru      cldr Hang Hani
> lang ks  ltru      cldr Arab Deva
> lang ku  ltru      cldr Arab Cyrl Latn
> lang kw  ltru      cldr Latn
> lang ky  ltru      cldr Arab Cyrl
> lang mi  ltru      cldr Latn
> lang mn  ltru      cldr Cyrl Mong
> lang mo  ltru Latn cldr Cyrl Latn
> lang ms  ltru Latn cldr Arab Latn
> lang oc  ltru      cldr Latn
> lang os  ltru      cldr Latn
> lang pa  ltru Guru cldr Arab Guru
> lang rm  ltru      cldr Latn
> lang sa  ltru      cldr Deva
> lang sd  ltru      cldr Arab Deva
> lang se  ltru      cldr Latn
> lang sh  ltru      cldr Latn
> lang sr  ltru      cldr Cyrl Latn
> lang tg  ltru      cldr Arab Cyrl Latn
> lang tk  ltru      cldr Arab Cyrl Latn
> lang tr  ltru Latn cldr Arab Latn
> lang tt  ltru      cldr Cyrl
> lang ug  ltru      cldr Arab
> lang uz  ltru      cldr Arab Cyrl Latn
> lang yo  ltru      cldr Latn
> lang zh  ltru      cldr Bopo Hani Hans Hant
> lang bal ltru      cldr Arab Latn
> lang byn ltru      cldr Ethi
> lang cch ltru      cldr Latn
> lang chr ltru      cldr Cher Latn
> lang cop ltru      cldr Arab Copt
> lang fil ltru      cldr Latn
> lang fiu ltru      cldr Latn
> lang fur ltru      cldr Latn
> lang gaa ltru      cldr Latn
> lang gez ltru      cldr Ethi
> lang gsw ltru      cldr Latn
> lang haw ltru      cldr Latn
> lang kaj ltru      cldr Latn
> lang kam ltru      cldr Latn
> lang kcg ltru      cldr Latn
> lang kfo ltru      cldr Latn
> lang kpe ltru      cldr Latn
> lang sid ltru      cldr Latn
> lang sma ltru      cldr Latn
> lang smi ltru      cldr Latn
> lang smj ltru      cldr Latn
> lang smn ltru      cldr Latn
> lang sms ltru      cldr Latn
> lang syr ltru      cldr Syrc
> lang tet ltru      cldr Latn
> lang tig ltru      cldr Ethi
> lang wal ltru      cldr Ethi
>
>
> _______________________________________________
> Ietf-languages mailing list
> Ietf-languages at alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/ietf-languages
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.alvestrand.no/pipermail/ietf-languages/attachments/20060927/152460e5/attachment-0001.html


More information about the Ietf-languages mailing list