Allowed characters (was: Re: Casefolding Sigma (was: Re:
IDNAbis Preprocessing Draft)
kenw at sybase.com
Wed Mar 26 20:56:06 CET 2008
> I really just wanted a list of
Just go to:
and scroll down and read the list.
That table is due for another imminent update, but it is
unlikely to affect anything except the few format
control characters (e.g. U+0600 ARABIC NUMBER SIGN),
which will go from CONTEXTO (neither "in" or "out",
but requires a context rule), to simply DISALLOWED.
> Which Arabic letters were in and which were out.
All of them are PVALID ("in"), except the few which have
decompositions (e.g. 0675..0678).
> Which Arabic diacritics were in and which were out.
All of them are PVALID ("in").
> Which punctuation and symbols in the Arabic block were in and which were out.
All of them are DISALLOWED ("out").
> Sorry if this is too complicated.
No, it's pretty easy. ;-)
> Michael Everson * http://www.evertype.com
More information about the Idna-update