A plethora of dots (Re: Comments on protocol-04)
Harald Tveit Alvestrand
harald at alvestrand.no
Tue Mar 4 00:25:03 CET 2008
Erik van der Poel skrev:
>> > Other examples of processing for localization that might be applied,
>> > if appropriate, at this point (but even further outside the scope of
>> > this specification) include interpreting the KANA MIDDLE DOT as
>>
>> Bad example. Since the Middle dot is allowed currently, it cannot be
>> treated as a separator.
>>
>
> A better example might be U+06D4 (ARABIC FULL STOP).
Careful - we have both MIDDLE DOT (U+00B7) and KATAKANA MIDDLE DOT
(U+30FB).
Both are on the "CONTEXT0" list in issues-07.
There's even a CANADIAN SYLLABICS FINAL MIDDLE DOT (U+1427), which is
not (it's PVALID, since Unicode has declared that it's a letter, not
punctuation).
Agree that the ARABIC FULL STOP is a better example, because:
1) there are already people arguing that it should be treated like a dot
2) it is not at all visually confusable with a dot - the Unicode book
makes it look quite similar to the hyphen (U+002D, HYPHEN-MINUS), though
somewhat more elegant.
Harald
More information about the Idna-update
mailing list