idnabis-tables-04 problem #1: Inconsistencies in category definitions
Patrik Fältström
patrik at frobbit.se
Mon Dec 22 15:28:17 CET 2008
On 6 dec 2008, at 00.52, Kenneth Whistler wrote:
> ==========================================================
>
> 2.1 LetterDigits (A)
>
> Current:
>
> A: generalCategory(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc}
>
> Suggested fix:
>
> A: General_Category(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc}
Fixed.
> This then entails deleting the following sentence in Section
> 2.1:
>
> "The generalCategory() operation returns the General Category
> for a particular Unicode code point."
Fixed.
> And replacing it with a generic statement as the last
> paragraph of Section 2, above Section 2.1:
>
> "In the following specification of categories, the operation
> which returns the value of a particular Unicode character
> property for a code point is designated by using the
> formal name of that property (from PropertyAliases.txt)
> followed by '(cp)'. For example, the value of the
> General_Category property for a code point is indicated
> by General_Category(cp)."
Fixed.
> ==========================================================
>
> 2.2 Unstable (B)
>
> Current:
>
> B: toNFKC(toCasefold(toNFKC(cp))) != cp
>
> Suggested fix:
>
> None needed. This is o.k. These are not property names, but
> functional operations defined elsewhere.
>
> However, in the third paragraph, for consistency:
>
> toCaseFold...
> -->
> The toCasefold() operation...
Fixed.
> ==========================================================
>
> 2.3 IgnorableProperties (C)
>
> Current:
>
> C: property(cp) is in {Default_Ignorable_Code_Point, White_Space,
> Noncharacter_Code_Point}
>
> Suggested:
>
> C: Default_Ignorable_Code_Point(cp) = True
> or
> White_Space(cp) = True
> or
> Noncharacter_Code_Point(cp) = True
Fixed.
> ==========================================================
>
> 2.4 IgnorableBlocks (D)
>
> Current:
>
> D: block(cp) in {Combining Diacritical Marks for Symbols,
> Musical Symbols, Ancient Greek Musical Notation}
>
> Suggested:
>
> D: Block(cp) is in {Combining Diacritical Marks for Symbols,
> Musical Symbols, Ancient Greek Musical Notation}
Fixed.
> ==========================================================
>
> 2.6 Exceptions (F)
>
> Current:
>
> F: cp in {002D, 00B7, 00DF, 02B9, 0375, 0483, 05F3, 05F4, 06FD,
> 06FE, 0F0B, 3005, 3007, 302E, 302F, 303B, 30FB}
>
> Suggested:
>
> F: cp is in {002D, 00B7, 00DF, 02B9, 0375, 0483, 05F3, 05F4, 06FD,
> 06FE, 0F0B, 3005, 3007, 302E, 302F, 303B, 30FB}
>
> Add "is" for consistency.
Fixed.
> ==========================================================
>
> 2.7 BackwardCompatible (G)
>
> Current:
>
> G: cp in {}
>
> Suggested:
>
> G: cp is in {}
>
> Add "is" for consistency.
Fixed.
> ==========================================================
>
> 2.8 JoinControl (H)
>
> Current:
>
> H: property(cp) is in {Join_Control}
>
> Suggested:
>
> H: Join_Control(cp) = True
Fixed.
> ==========================================================
>
> 2.9 OldHangulJamo (I)
>
> Current:
>
> I: HangulSyllableType(cp) is in {L, V, T}
>
> Suggested:
>
> I: Hangul_Syllable_Type(cp) is in {L, V, T}
Fixed.
> ==========================================================
>
> 2.10 Unassigned (J)
>
> Current:
>
> J: cp is in {Cn} and property(cp) is not in {Noncharacter_Code_Point}
>
> Suggested:
>
> J: General_Category(cp) is in {Cn}
> and
> Noncharacter_Code_Point(cp) = False
Fixed.
Patrik
More information about the Idna-update
mailing list