Tables and contextual rule for Katakana middle dot
yone at jprs.co.jp
Mon Apr 6 12:12:58 CEST 2009
Japanese uses Hiragana, Katakana, Han, Alphabet letters (a-z), and
digit (0-9) for names. KATAKANA MIDDLEDOT is usually used with those
names, so the following kind of case is really exists and used:
Play<KATAKANA MIDDLEDOT>Station<KATAKANA MIDDLEDOT>4.jp
That is the reason why I said "Japanese context".
To be precise, Japanese scripts (for IDN) are consists from:
Hiragana, Katakana, Han, Alphabet, Digit,
IDEOGRAPHIC CLOSING MARK, IDEOGRAPHIC NUMBER ZERO,
KATAKANA MIDDLEDOT and IDEOGRAPHIC ITERATION MARK
Extracting Alphabet and Digit from the list is unacceptable.
I'll try to express this ambiguous situation more clearly.
Yoshiro YONEYA <yone at jprs.co.jp>
On Mon, 6 Apr 2009 10:47:20 +0200 Patrik Fältström <patrik at frobbit.se> wrote:
> On 6 apr 2009, at 09.36, Yoshiro YONEYA wrote:
> > Dear John-san and Patrik-san,
> > In reality, in Japanese context, KATAKANA MIDDLEDOT (U+30FB) is used
> > to compose names and to concatenate words, so it is used various
> > places.
> > And sometimes preceeding and/or succeeding character is alphabet or
> > digit
> > ([a-zA-Z0-9]). Furthermore, KATAKANA MIDDLEDOT is sometimes placed at
> > the beginning or the ending of names.
> > Therefore, it is very difficult to define rule set for KATAKANA
> > MIDDLEDOT.
> > What I can say is:
> > (KATAKANA MIDDLEDOT) MUST be used in Japanese context.
> Can you express this in more specific terms that can be included in
> the draft? Do you with "japanese" imply it has to be one of the
> japanese scripts (as John said) for example?
More information about the Idna-update