idnabis-tables-04 problem #1: Inconsistencies in category definitions

Patrik Fältström patrik at frobbit.se
Mon Dec 22 15:28:17 CET 2008


On 6 dec 2008, at 00.52, Kenneth Whistler wrote:

> ==========================================================
>
> 2.1 LetterDigits (A)
>
> Current:
>
> A: generalCategory(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc}
>
> Suggested fix:
>
> A: General_Category(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc}

Fixed.

> This then entails deleting the following sentence in Section
> 2.1:
>
> "The generalCategory() operation returns the General Category
> for a particular Unicode code point."

Fixed.

> And replacing it with a generic statement as the last
> paragraph of Section 2, above Section 2.1:
>
> "In the following specification of categories, the operation
> which returns the value of a particular Unicode character
> property for a code point is designated by using the
> formal name of that property (from PropertyAliases.txt)
> followed by '(cp)'. For example, the value of the
> General_Category property for a code point is indicated
> by General_Category(cp)."

Fixed.

> ==========================================================
>
> 2.2 Unstable (B)
>
> Current:
>
> B: toNFKC(toCasefold(toNFKC(cp))) != cp
>
> Suggested fix:
>
> None needed. This is o.k. These are not property names, but
> functional operations defined elsewhere.
>
> However, in the third paragraph, for consistency:
>
> toCaseFold...
> -->
> The toCasefold() operation...

Fixed.

> ==========================================================
>
> 2.3 IgnorableProperties (C)
>
> Current:
>
> C: property(cp) is in {Default_Ignorable_Code_Point, White_Space,
>                       Noncharacter_Code_Point}
>
> Suggested:
>
> C: Default_Ignorable_Code_Point(cp) = True
>   or
>   White_Space(cp) = True
>   or
>   Noncharacter_Code_Point(cp) = True

Fixed.

> ==========================================================
>
> 2.4 IgnorableBlocks (D)
>
> Current:
>
> D: block(cp) in {Combining Diacritical Marks for Symbols,
>                 Musical Symbols, Ancient Greek Musical Notation}
>
> Suggested:
>
> D: Block(cp) is in {Combining Diacritical Marks for Symbols,
>                    Musical Symbols, Ancient Greek Musical Notation}

Fixed.

> ==========================================================
>
> 2.6 Exceptions (F)
>
> Current:
>
> F: cp in {002D, 00B7, 00DF, 02B9, 0375, 0483, 05F3, 05F4, 06FD,
>          06FE, 0F0B, 3005, 3007, 302E, 302F, 303B, 30FB}
>
> Suggested:
>
> F: cp is in {002D, 00B7, 00DF, 02B9, 0375, 0483, 05F3, 05F4, 06FD,
>             06FE, 0F0B, 3005, 3007, 302E, 302F, 303B, 30FB}
>
> Add "is" for consistency.

Fixed.

> ==========================================================
>
> 2.7 BackwardCompatible (G)
>
> Current:
>
> G: cp in {}
>
> Suggested:
>
> G: cp is in {}
>
> Add "is" for consistency.

Fixed.

> ==========================================================
>
> 2.8 JoinControl (H)
>
> Current:
>
> H: property(cp) is in {Join_Control}
>
> Suggested:
>
> H: Join_Control(cp) = True

Fixed.

> ==========================================================
>
> 2.9 OldHangulJamo (I)
>
> Current:
>
> I: HangulSyllableType(cp) is in {L, V, T}
>
> Suggested:
>
> I: Hangul_Syllable_Type(cp) is in {L, V, T}

Fixed.

> ==========================================================
>
> 2.10 Unassigned (J)
>
> Current:
>
> J: cp is in {Cn} and property(cp) is not in {Noncharacter_Code_Point}
>
> Suggested:
>
> J: General_Category(cp) is in {Cn}
>   and
>   Noncharacter_Code_Point(cp) = False

Fixed.

    Patrik



More information about the Idna-update mailing list