Jamo [RE: Consensus Call Tranche 8 (Character Adjustments)]

k kim nulo0000 at gmail.com
Fri Oct 17 02:20:04 CEST 2008


> Mark Davis <mark at macchiato.com> Wed, Oct 15, 2008 at 5:56 AM

> That is, each of the Hangul precomposed syllables decomposes into one or
two

one or two (wrong)---> two or three (correct) (Am I missing something here?)

> combining jamo under NFD, and under NFC that sequence of combining jamo
composes
> back into that syllable. The comparisons *do* work correctly,
> since IDNA labels have to be in NFC.

- Well, I would have to disagree with you.
Let me explain why the above claim is not correct.

- According to UCS (ISO/IEC 10646), each of the following three can
represent
Hangul syllable GGA:

1) UAC01 (GGA)

2) U1101 (GG), U1161 (A)

3) U1100 (G), U1100 (G), U1161 (A)

 - By NFC, 2) U1101 (GG), U1161 (A) will be changed to 1) UAC01 (GGA);
 - However, by NFC, 3) U1100 (G), U1100 (G), U1161 (A) will be changed to
U1100 (G), UAC00 (GA), which is "different" from 1) UAC01.

 > The comparisons *do* work correctly,

- ??? Isn't it considered comparison failure? (Am I missing something here?)
- As we saw, NFC/NFD does not work correctly even for modern Hangul,
(not to mention Old Hangul)!

------------

Note. For example, in XML 1.0 (fourth ed), 2) U1101 (GG), U1161 (A) is NOT
allowed since #x1101 is not included; in contrast, 3) U1100 (G), U1100 (G),
U1161 (A) IS allowed.

(source :http://www.w3.org/TR/2006/REC-xml-20060816/#NT-BaseChar)

.#x1100 | [#x1102-#x1103] | [#x1105-#x1107] | #x1109 | [#x110B-#x110C] |

[#x110E-#x1112] | #x113C | #x113E | #x1140 | #x114C | #x114E | #x1150 |

[#x1154-#x1155] | #x1159 | [#x115F-#x1161] | #x1163 | #x1165 | #x1167 |
#x1169 |

[#x116D-#x116E] | [#x1172-#x1173] | #x1175 | #x119E | #x11A8 | #x11AB |

[#x11AE-#x11AF] | [#x11B7-#x11B8] | #x11BA | [#x11BC-#x11C2] | #x11EB |
#x11F0 |

#x11F9 | ..

KIM, Kyongsok
* I have been a chair of Korea JTC1/SC2 (a committee on Coded Character Set)
since 1993.
This committee represents Korea in ISO/IEC JTC1/SC2 which is in charge of
UCS (ISO/IEC 10646).



* * *


-- 
KIM, Kyongsok
Dept. of Computer Science
Pusan National University
Busan, Rep. of KOREA
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.alvestrand.no/pipermail/idna-update/attachments/20081017/7dea7121/attachment.htm 


More information about the Idna-update mailing list