Allowed characters (was: Re: Casefolding Sigma (was: Re:
IDNAbis Preprocessing Draft)
everson at evertype.com
Wed Mar 26 21:09:18 CET 2008
At 12:56 -0700 2008-03-26, Kenneth Whistler wrote:
> > Which Arabic letters were in and which were out.
>All of them are PVALID ("in"), except the few which have
>decompositions (e.g. 0675..0678).
So DOTLESS BEH and DOTLESS QAF are in.
> > Which Arabic diacritics were in and which were out.
>All of them are PVALID ("in").
So all the Qur'anic diacritics are in.
> > Which punctuation and symbols in the Arabic block were in and
>which were out.
>All of them are DISALLOWED ("out").
>> Sorry if this is too complicated.
>No, it's pretty easy. ;-)
Michael Everson * http://www.evertype.com
More information about the Idna-update