[Suppress-Script] Initial list of 300 languages

Mark Davis mark.davis at icu-project.org
Mon Mar 13 21:55:37 CET 2006


Thanks to Erik van der Poel, there are some preliminary* results that 
may shed some light on a couple of issues that people have raised.

Cherokee pages show about 5 times as many characters in Latin as in 
Cherokee. However, the sample is very small, and web presence may be 
hindered by lack of fonts.

Korean pages show about 230 times as many characters in Hangul as in 
Han. The sample here is much larger and thus more reliable.

Mark

* There are definitely some caveats here. Because the tagging of 
document language is so haphazard on the web, it has to be supplemented 
by language detection.


More information about the Ietf-languages mailing list