Allowed characters (was: Re: Casefolding Sigma (was: Re:
IDNAbis Preprocessing Draft)
Patrik Fältström
patrik at frobbit.se
Sun Mar 30 13:50:01 CEST 2008
On 30 mar 2008, at 13.04, Michael Everson wrote:
> At 12:36 +0200 2008-03-30, Patrik Fältström wrote:
>
>>> I see that this character is allowed:
>>>
>>> FE73 ; PVALID # ARABIC TAIL FRAGMENT
>>>
>>> Why?
>>
>> Because it is general category Lo.
>>
>> FE73;ARABIC TAIL FRAGMENT;Lo;0;AL;;;;;N;;;;;
>
> But
>
> FE70..FE72 ; DISALLOWED # ARABIC FATHATAN ISOLATED FORM..ARABIC
> DAMMAT
>
> also have the general category Lo, and they are disallowed.
Correct, because they are not stable under casefolding/NFKC. (i.e.
matches both category A and B in the tables document).
Patrik
More information about the Idna-update
mailing list