Category changes with Unicode 6.3

"Martin J. Dürst" duerst at it.aoyama.ac.jp
Wed Oct 16 04:34:07 CEST 2013


Excuse me if this has been checked and/or discussed already, but I just 
downloaded the Unicode 6.3 version (officially published a few days ago) 
of http://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt and 
found several changes in character classification:

OLD
180E;MONGOLIAN VOWEL SEPARATOR;Zs;0;WS;;;;;N;;;;;
NEW
180E;MONGOLIAN VOWEL SEPARATOR;Cf;0;BN;;;;;N;;;;;

OLD
1A1B;BUGINESE VOWEL SIGN AE;Mc;0;L;;;;;N;;;;;
NEW
1A1B;BUGINESE VOWEL SIGN AE;Mn;0;NSM;;;;;N;;;;;

OLD
2308;LEFT CEILING;Sm;0;ON;;;;;Y;;;;;
2309;RIGHT CEILING;Sm;0;ON;;;;;Y;;;;;
230A;LEFT FLOOR;Sm;0;ON;;;;;Y;;;;;
230B;RIGHT FLOOR;Sm;0;ON;;;;;Y;;;;;
NEW
2308;LEFT CEILING;Ps;0;ON;;;;;Y;;;;;
2309;RIGHT CEILING;Pe;0;ON;;;;;Y;;;;;
230A;LEFT FLOOR;Ps;0;ON;;;;;Y;;;;;
230B;RIGHT FLOOR;Pe;0;ON;;;;;Y;;;;;

Can somebody check whether and how they affect IDNA 2008 and/or precis?

Again, if that has already been done, sorry for the noise.

Regards,   Martin.


P.S.:
All the other changes in UnicodeData.txt:

Change in numerical value only:

OLD
12456;CUNEIFORM NUMERIC SIGN NIGIDAMIN;Nl;0;L;;;;-1;N;;;;;
12457;CUNEIFORM NUMERIC SIGN NIGIDAESH;Nl;0;L;;;;-1;N;;;;;
NEW
12456;CUNEIFORM NUMERIC SIGN NIGIDAMIN;Nl;0;L;;;;2;N;;;;;
12457;CUNEIFORM NUMERIC SIGN NIGIDAESH;Nl;0;L;;;;3;N;;;;;


New characters (my understanding is that these are taken care of 
automatically):

061C;ARABIC LETTER MARK;Cf;0;AL;;;;;N;;;;;

2066;LEFT-TO-RIGHT ISOLATE;Cf;0;LRI;;;;;N;;;;;
2067;RIGHT-TO-LEFT ISOLATE;Cf;0;RLI;;;;;N;;;;;
2068;FIRST STRONG ISOLATE;Cf;0;FSI;;;;;N;;;;;
2069;POP DIRECTIONAL ISOLATE;Cf;0;PDI;;;;;N;;;;;


More information about the Idna-update mailing list