Katakana Middle Dot again (Was: tables-06b.txt: A.5, A.6, A.9)

Harald Tveit Alvestrand harald at alvestrand.no
Fri Aug 7 13:12:37 CEST 2009

Yoshiro YONEYA skrev:
> Dear Patrik,
>> False;
>> For All Characters:
>>     If Script(cp) .in. {Hiragana, Katakana, Han} Then True;
> Please include U+3005..U+3007 into the scripts set because they are also 
> Japanese character family.
If nothing has changed, they're already in; from Unicode 5.1 "Scripts.txt":

3005          ; Han # Lm       IDEOGRAPHIC ITERATION MARK
3007          ; Han # Nl       IDEOGRAPHIC NUMBER ZERO

3006 is "IDEOGRAPHIC CLOSING MARK", and has script "Common"; is it worth 
it to add yet another exception to the ruleset for allowing strings that 
use this letter and no other (Hiragana, Katakana, Han)?


More information about the Idna-update mailing list