[Suppress-Script] Initial list of 300 languages
mark.davis at icu-project.org
Mon Mar 13 21:55:37 CET 2006
Thanks to Erik van der Poel, there are some preliminary* results that
may shed some light on a couple of issues that people have raised.
Cherokee pages show about 5 times as many characters in Latin as in
Cherokee. However, the sample is very small, and web presence may be
hindered by lack of fonts.
Korean pages show about 230 times as many characters in Hangul as in
Han. The sample here is much larger and thus more reliable.
* There are definitely some caveats here. Because the tagging of
document language is so haphazard on the web, it has to be supplemented
by language detection.
More information about the Ietf-languages