Allowed characters (was: Re: Casefolding Sigma (was: Re:
IDNAbis Preprocessing Draft)
Kenneth Whistler
kenw at sybase.com
Wed Mar 26 20:56:06 CET 2008
Michael,
> I really just wanted a list of
Just go to:
http://www.ietf.org/internet-drafts/draft-faltstrom/idnabis-tables-05.txt
and scroll down and read the list.
That table is due for another imminent update, but it is
unlikely to affect anything except the few format
control characters (e.g. U+0600 ARABIC NUMBER SIGN),
which will go from CONTEXTO (neither "in" or "out",
but requires a context rule), to simply DISALLOWED.
>
> Which Arabic letters were in and which were out.
All of them are PVALID ("in"), except the few which have
decompositions (e.g. 0675..0678).
>
> Which Arabic diacritics were in and which were out.
All of them are PVALID ("in").
>
> Which punctuation and symbols in the Arabic block were in and which were out.
All of them are DISALLOWED ("out").
>
> Sorry if this is too complicated.
No, it's pretty easy. ;-)
--Ken
> --
> Michael Everson * http://www.evertype.com
More information about the Idna-update
mailing list