Jamo [RE: Consensus Call Tranche 8 (Character Adjustments)]

Michel SUIGNARD Michel at suignard.com
Fri Oct 17 05:57:00 CEST 2008


I would like to know where in ISO/IEC 10646 the type of sequence described in 3 is ‘allowed’ to represent such Hangul syllables. Because to the best of my knowledge it is not.
If it is not, the whole argument falls flat.
IN XML 1.1, the syllable itself GGA is already allowed in the same BaseChar production list: “[#xAC00-#xD7A3]”, so the Hangul syllable repertoire is already covered w/o adding that sequence explicitly, and the syllable is the NFC representation of the GGA syllable.

Michel Suignard
(project editor for 10646)

From: idna-update-bounces at alvestrand.no [mailto:idna-update-bounces at alvestrand.no] On Behalf Of k kim

 
- According to UCS (ISO/IEC 10646), each of the following three can represent 
Hangul syllable GGA:
1) UAC01 (GGA)
2) U1101 (GG), U1161 (A)
3) U1100 (G), U1100 (G), U1161 (A)
 - By NFC, 2) U1101 (GG), U1161 (A) will be changed to 1) UAC01 (GGA);
 - However, by NFC, 3) U1100 (G), U1100 (G), U1161 (A) will be changed to 
U1100 (G), UAC00 (GA), which is "different" from 1) UAC01.
 
 > The comparisons *do* work correctly, 
 
- ??? Isn't it considered comparison failure? (Am I missing something here?)
- As we saw, NFC/NFD does not work correctly even for modern Hangul,
(not to mention Old Hangul)!
------------
Note. For example, in XML 1.0 (fourth ed), 2) U1101 (GG), U1161 (A) is NOT allowed since #x1101 is not included; in contrast, 3) U1100 (G), U1100 (G), U1161 (A) IS allowed.
(source :http://www.w3.org/TR/2006/REC-xml-20060816/#NT-BaseChar)
.#x1100 | [#x1102-#x1103] | [#x1105-#x1107] | #x1109 | [#x110B-#x110C] | 
[#x110E-#x1112] | #x113C | #x113E | #x1140 | #x114C | #x114E | #x1150 | 
[#x1154-#x1155] | #x1159 | [#x115F-#x1161] | #x1163 | #x1165 | #x1167 | #x1169 |
[#x116D-#x116E] | [#x1172-#x1173] | #x1175 | #x119E | #x11A8 | #x11AB | 
[#x11AE-#x11AF] | [#x11B7-#x11B8] | #x11BA | [#x11BC-#x11C2] | #x11EB | #x11F0 | 
#x11F9 | ..
KIM, Kyongsok
* I have been a chair of Korea JTC1/SC2 (a committee on Coded Character Set) since 1993.
This committee represents Korea in ISO/IEC JTC1/SC2 which is in charge of UCS (ISO/IEC 10646).
  
* * *


-- 
KIM, Kyongsok
Dept. of Computer Science
Pusan National University
Busan, Rep. of KOREA


More information about the Idna-update mailing list