el-latn, ru-latn, and related possibilities
Caoimhin O Donnaile
caoimhin at smo.uhi.ac.uk
Fri Oct 7 14:04:36 CEST 2005
On Fri, 7 Oct 2005, John.Cowan wrote:
> Saying something is in the Xxxx script does not mean that it includes only
> Unicode characters whose script property is Xxxx. In particular,
> Kurdish in Cyrillic requires the Latin letters Q/q/W/w.
That's interesting. To combat phishers registering lookalike
"international" (i.e. non-ASCII) domain names (with Cyrillic 'r'
in place of Latin 'p', for example) a common expedient is to ban
mixed script IDNs. It seems that this expedient might give the
Kurds some problems.
> The Turkmen orthography uses things that aren't even Unicode letters,
> like the cent sign and the (British) pound sign, but that doesn't mean
> it isn't basically in Latin script.
And Breton uses an apostrophe, in the combination "c'h". If I
understand things correctly they'll just have to do without it
in IDNs.
(Sorry, I am getting off-topic)
Caoimhín
More information about the Ietf-languages
mailing list