Allowed characters (was: Re: Casefolding Sigma (was: Re: IDNAbis Preprocessing Draft)

Patrik Fältström patrik at frobbit.se
Sun Mar 30 13:50:01 CEST 2008


On 30 mar 2008, at 13.04, Michael Everson wrote:

> At 12:36 +0200 2008-03-30, Patrik Fältström wrote:
>
>>> I see that this character is allowed:
>>>
>>> FE73        ; PVALID      # ARABIC TAIL FRAGMENT
>>>
>>> Why?
>>
>> Because it is general category Lo.
>>
>> FE73;ARABIC TAIL FRAGMENT;Lo;0;AL;;;;;N;;;;;
>
> But
>
> FE70..FE72  ; DISALLOWED  # ARABIC FATHATAN ISOLATED FORM..ARABIC  
> DAMMAT
>
> also have the general category Lo, and they are disallowed.

Correct, because they are not stable under casefolding/NFKC. (i.e.  
matches both category A and B in the tables document).

    Patrik



More information about the Idna-update mailing list