Unassigned code points

Simon Josefsson simon at josefsson.org
Sun May 29 20:32:38 CEST 2011


Patrik Fältström <patrik at frobbit.se> writes:

> On 29 maj 2011, at 19.37, Simon Josefsson wrote:
>
>> Looks fine except that I believe U+20B9 should be DISALLOWED rather than
>> UNASSIGNED.  For consistency with RFC 5892 you may want to include
>> 10FFFF too (other noncharacters are included in the list).
>
> Yeah, 10FFFF is a bug in my software that I have never fixed...
>
> I will have a look at U+20B9.
>
> What rule is it failing on?

I quoted the output for the wrong code point, see proper output below.
It is the final else clause that decides the category.

/Simon

Exception(U+20b9) = UNKNOWN
BackwardCompatible(U+20b9) = UNKNOWN
	General_Category-Cn(U+20b9) == FALSE
	Noncharacter_Code_Point(U+20b9) == FALSE
Unassigned (U+20b9) == FALSE
LDH (U+20b9) == FALSE
JoinControl (U+20b9) == FALSE
	toNFKC(U+20B9) = U+20B9
	toCaseFold(toNFKC(U+20B9)) = U+20B9
	toNFKC(toCaseFold(toNFKC(U+20B9))) = U+20B9
Unstable (U+20b9) == FALSE
IgnorableProperties (U+20b9) == FALSE
	Block(U+20b9) = Currency Symbols
	Combining Diacritical Marks for Symbols(U+20b9) = FALSE
	Musical Symbols(U+20b9) = FALSE
	Ancient Greek Musical Notation(U+20b9) = FALSE
IgnorableBlocks (U+20b9) == FALSE
OldHangulJamo (U+20b9) == FALSE
	General_Category-Ll(U+20b9) == FALSE
	General_Category-Lu(U+20b9) == FALSE
	General_Category-Lo(U+20b9) == FALSE
	General_Category-Nd(U+20b9) == FALSE
	General_Category-Lm(U+20b9) == FALSE
	General_Category-Mn(U+20b9) == FALSE
	General_Category-Mc(U+20b9) == FALSE
LetterDigits (U+20b9) == FALSE
Property(U+20b9) == DISALLOWED


More information about the Idna-update mailing list