Hangul jamo issues
Soobok Lee
lsb at lsb.org
Tue Jan 2 05:27:35 CET 2007
This is the issue list for Hangul Jamos:
Hangul Jamo ( in Range: 1100-11FF)
These should be available as input and allowed in labels to make
jamo-only sequences of labels and archaic hangul syllables of labels.
We already have registrations using this characters in IDN.com.
Hangul Compatibility Jamo ( in Range: 3130-318F)
These should be available as input and be mapped
into Hangul Jamo Range 1100-11FF by IDNAbis preprocessing stage
in applications.
Ordinary Korean users can type in only these Compatibility
Jamos and cannot type directly those in 1100-11FF (in Windows).
NFKC does this mapping( and composing), but NFC does not.
3164 === U+1160 : compatibility equivalence for hangul filler
3131 === U+1100 : compatibility equivalence for initial KI-EOK
and so on.
Need of jamo sequences in inputs:
KSC5601 has only standard 2350 hangul syllables, while its Window-specific
extension (CP949) has full set of 11172 hangul syllables.
Microsoft added those thousands of characters to serve korean users' needs,
especially from teenagers and scholars.
So, in linux x-terminal, for example, we cannot type directly these
extended syllables, but can type in only compat. jamo sequences.
And, some code conversion tools(cp949 -> ksc5601) may transform
extended hangul syllables into compat. jamo sequences.
If these compat. jamo sequences are mapped into jamo sequeces(u+11xx)
by preprocessing stage in IDNAbis,
NFC in IDNAbis would further combine these sequences into
composed hangul syllables.
Hangul Half-Width Jamo ( in Range: FFA0-FFDC)
Ordinary Korean users seldom type in these Jamos in Windows, AFAIK.
So the need of these characters in label inputs is questionable.
NFKC maps these characters into Hangul Jamo Range 1100-11FF.
But NFC does not.
FFA0 === 3164 === U+1160 : compatibility equivalence for hangul filler
FFA1 === 3131 === U+1100 : compatibility equivalence for initial KI-EOK
and so on.
U+3164, U+1160, U+FFA0 Hangul Filler:
U+3164, U+1160 are displayed as blank space
in Windows.
U+FFA0 Half-width Hangul Filler is displayed
as bold-faced middle dot in Windows.
Need cautions in displaying these characters.
Both initial consonant U+1100 and
its final consonant correspondent U+11A8
are displayed in the exactly same glyph and margin in Windows.
And so forth for other consonants.
Need cautions in registering and displaying these characters.
Soobok
More information about the Idna-update
mailing list