Category changes with Unicode 6.3
"Martin J. Dürst"
duerst at it.aoyama.ac.jp
Wed Oct 16 04:34:07 CEST 2013
Excuse me if this has been checked and/or discussed already, but I just
downloaded the Unicode 6.3 version (officially published a few days ago)
of http://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt and
found several changes in character classification:
OLD
180E;MONGOLIAN VOWEL SEPARATOR;Zs;0;WS;;;;;N;;;;;
NEW
180E;MONGOLIAN VOWEL SEPARATOR;Cf;0;BN;;;;;N;;;;;
OLD
1A1B;BUGINESE VOWEL SIGN AE;Mc;0;L;;;;;N;;;;;
NEW
1A1B;BUGINESE VOWEL SIGN AE;Mn;0;NSM;;;;;N;;;;;
OLD
2308;LEFT CEILING;Sm;0;ON;;;;;Y;;;;;
2309;RIGHT CEILING;Sm;0;ON;;;;;Y;;;;;
230A;LEFT FLOOR;Sm;0;ON;;;;;Y;;;;;
230B;RIGHT FLOOR;Sm;0;ON;;;;;Y;;;;;
NEW
2308;LEFT CEILING;Ps;0;ON;;;;;Y;;;;;
2309;RIGHT CEILING;Pe;0;ON;;;;;Y;;;;;
230A;LEFT FLOOR;Ps;0;ON;;;;;Y;;;;;
230B;RIGHT FLOOR;Pe;0;ON;;;;;Y;;;;;
Can somebody check whether and how they affect IDNA 2008 and/or precis?
Again, if that has already been done, sorry for the noise.
Regards, Martin.
P.S.:
All the other changes in UnicodeData.txt:
Change in numerical value only:
OLD
12456;CUNEIFORM NUMERIC SIGN NIGIDAMIN;Nl;0;L;;;;-1;N;;;;;
12457;CUNEIFORM NUMERIC SIGN NIGIDAESH;Nl;0;L;;;;-1;N;;;;;
NEW
12456;CUNEIFORM NUMERIC SIGN NIGIDAMIN;Nl;0;L;;;;2;N;;;;;
12457;CUNEIFORM NUMERIC SIGN NIGIDAESH;Nl;0;L;;;;3;N;;;;;
New characters (my understanding is that these are taken care of
automatically):
061C;ARABIC LETTER MARK;Cf;0;AL;;;;;N;;;;;
2066;LEFT-TO-RIGHT ISOLATE;Cf;0;LRI;;;;;N;;;;;
2067;RIGHT-TO-LEFT ISOLATE;Cf;0;RLI;;;;;N;;;;;
2068;FIRST STRONG ISOLATE;Cf;0;FSI;;;;;N;;;;;
2069;POP DIRECTIONAL ISOLATE;Cf;0;PDI;;;;;N;;;;;
More information about the Idna-update
mailing list