Unassigned code points
Simon Josefsson
simon at josefsson.org
Sun May 29 20:32:38 CEST 2011
Patrik Fältström <patrik at frobbit.se> writes:
> On 29 maj 2011, at 19.37, Simon Josefsson wrote:
>
>> Looks fine except that I believe U+20B9 should be DISALLOWED rather than
>> UNASSIGNED. For consistency with RFC 5892 you may want to include
>> 10FFFF too (other noncharacters are included in the list).
>
> Yeah, 10FFFF is a bug in my software that I have never fixed...
>
> I will have a look at U+20B9.
>
> What rule is it failing on?
I quoted the output for the wrong code point, see proper output below.
It is the final else clause that decides the category.
/Simon
Exception(U+20b9) = UNKNOWN
BackwardCompatible(U+20b9) = UNKNOWN
General_Category-Cn(U+20b9) == FALSE
Noncharacter_Code_Point(U+20b9) == FALSE
Unassigned (U+20b9) == FALSE
LDH (U+20b9) == FALSE
JoinControl (U+20b9) == FALSE
toNFKC(U+20B9) = U+20B9
toCaseFold(toNFKC(U+20B9)) = U+20B9
toNFKC(toCaseFold(toNFKC(U+20B9))) = U+20B9
Unstable (U+20b9) == FALSE
IgnorableProperties (U+20b9) == FALSE
Block(U+20b9) = Currency Symbols
Combining Diacritical Marks for Symbols(U+20b9) = FALSE
Musical Symbols(U+20b9) = FALSE
Ancient Greek Musical Notation(U+20b9) = FALSE
IgnorableBlocks (U+20b9) == FALSE
OldHangulJamo (U+20b9) == FALSE
General_Category-Ll(U+20b9) == FALSE
General_Category-Lu(U+20b9) == FALSE
General_Category-Lo(U+20b9) == FALSE
General_Category-Nd(U+20b9) == FALSE
General_Category-Lm(U+20b9) == FALSE
General_Category-Mn(U+20b9) == FALSE
General_Category-Mc(U+20b9) == FALSE
LetterDigits (U+20b9) == FALSE
Property(U+20b9) == DISALLOWED
More information about the Idna-update
mailing list