Hangul jamo issues

Soobok Lee lsb at lsb.org
Tue Jan 2 05:27:35 CET 2007


This is the issue list for Hangul Jamos:

Hangul Jamo ( in Range: 1100-11FF)
  These  should be available as input and allowed in labels to make
  jamo-only sequences of labels and archaic hangul syllables of labels.
  We already have registrations using this characters in IDN.com.
  
Hangul Compatibility Jamo ( in Range: 3130-318F)
  These  should be available as input and be mapped
  into Hangul Jamo Range 1100-11FF by IDNAbis preprocessing stage
   in applications.
  Ordinary Korean users can type in only these Compatibility
  Jamos and cannot type directly those in 1100-11FF (in Windows).
  NFKC does this mapping( and composing), but NFC does not.
  3164 === U+1160 : compatibility equivalence for hangul filler
  3131 === U+1100 : compatibility equivalence for initial KI-EOK
  and so on.

  Need of jamo sequences in inputs:
   KSC5601 has only standard 2350 hangul syllables, while its Window-specific
    extension (CP949) has full set of 11172 hangul syllables.
    Microsoft added those thousands of characters to serve korean users' needs,
    especially from teenagers and scholars.
   So, in linux x-terminal, for example, we cannot type directly these
    extended syllables, but can type in only compat. jamo sequences.
   And, some code conversion tools(cp949 -> ksc5601) may transform 
     extended hangul syllables into compat. jamo sequences.
   If these compat. jamo sequences are mapped into jamo sequeces(u+11xx)
    by preprocessing stage in IDNAbis,
    NFC in IDNAbis would further combine these sequences into 
    composed hangul syllables.
    

Hangul Half-Width Jamo ( in Range: FFA0-FFDC)
  Ordinary Korean users seldom type in these Jamos in Windows, AFAIK.
  So the need of these characters in label inputs is questionable.
  NFKC maps these characters into Hangul Jamo Range 1100-11FF.
  But NFC does not.
  FFA0 === 3164 === U+1160 : compatibility equivalence for hangul filler
  FFA1 === 3131 === U+1100 : compatibility equivalence for initial KI-EOK
  and so on.

U+3164, U+1160, U+FFA0 Hangul Filler:
 U+3164, U+1160 are displayed as blank space 
  in Windows.
 U+FFA0  Half-width Hangul Filler is displayed
  as bold-faced middle dot in Windows.
 Need cautions in displaying these characters.

Both initial consonant U+1100 and 
 its final consonant correspondent U+11A8  
 are displayed in the exactly same glyph and margin in Windows.
 And so forth for other consonants.
  Need cautions in registering and displaying these characters.

Soobok



More information about the Idna-update mailing list