Allowed characters (was: Re: Casefolding Sigma (was: Re: IDNAbis Preprocessing Draft)

Kenneth Whistler kenw at sybase.com
Wed Mar 26 20:56:06 CET 2008


Michael,

> I really just wanted a list of

Just go to:

http://www.ietf.org/internet-drafts/draft-faltstrom/idnabis-tables-05.txt

and scroll down and read the list.

That table is due for another imminent update, but it is
unlikely to affect anything except the few format
control characters (e.g. U+0600 ARABIC NUMBER SIGN),
which will go from CONTEXTO (neither "in" or "out",
but requires a context rule), to simply DISALLOWED.

> 
> Which Arabic letters were in and which were out.

All of them are PVALID ("in"), except the few which have
decompositions (e.g. 0675..0678).

> 
> Which Arabic diacritics were in and which were out.

All of them are PVALID ("in").

> 
> Which punctuation and symbols in the Arabic block were in and which were out.

All of them are DISALLOWED ("out").

> 
> Sorry if this is too complicated.

No, it's pretty easy. ;-)

--Ken

> -- 
> Michael Everson * http://www.evertype.com



More information about the Idna-update mailing list