What may make sense, for this and the Katakana middle dot, is to just have rules like the following. That is, not test for adjacency or all characters, but require the label to have <i>at least</i> one character of the appropriate script. That gives the following<br>

<br>GREEK LOWER NUMERAL SIGN:<br><br>Rule Set:<br>     False;<br>     For All Characters:<br>        If Script(cp) .eq.  Greek Then True;<br>     End For;<br><br>KATAKANA MIDDLE DOT:<br><br>Rule Set:<br>     False;<br>     For All Characters:<br>


         If Script(cp) <span class="insert">.in. {Hiragana, Katakana, Han}</span> Then True;<br>


      End For;<br><br>We should also apply this to the Hebrew; the problem with looking at the character before is that if someone uses combining marks on the preceding letter (not common for Hebrew itself, but does occur in other orthographies), then it would be improperly disallowed. Yet it is not worth having a convoluted test for that. So we could apply the same mechanism to those two cases as well:<br>

<br>HEBREW PUNCTUATION GERESH:<br>HEBREW PUNCTUATION GERSHAYIM:<br><br>Rule Set:<br>     False;<br>     For All Characters:<br>


         If Script(cp) <span class="insert">.eq. Hebrew</span> Then True;<br>


      End For;<br><br clear="all">Mark<br>

<br><br><div class="gmail_quote">On Fri, Jul 24, 2009 at 07:35, Wil Tan <span dir="ltr">&lt;<a href="mailto:wil@cloudregistry.net">wil@cloudregistry.net</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

Hi all,<br>

<br>

Several of the contextual rules specify that the label must only<br>

contain a certain script (e.g. Greek, Cyrillic). However, I believe<br>

that in some cases, the use of [0-9] and Hyphen-minus, all of which<br>

are in the &quot;Zyyy&quot; script, is often permitted and makes sense. For<br>

example,<br>

<br>

Appendix A.5. GREEK LOWER NUMERAL SIGN (KERAIA)<br>

   Code point:<br>

      U+0375<br>

   Overview:<br>

      Greek script only.<br>

   Lookup:<br>

      False<br>

   Rule Set:<br>

      True;<br>

      For All Characters:<br>

         If Script(cp) .ne.  Greek Then False;<br>

      End For;<br>

<br>

I wonder if we are being too restrictive here. I note that the .gr<br>

registry allows 0-9 in their IDN policies.<br>

<br>

Perhaps we should change the rule to the following?<br>

<br>

  True;<br>

  For All Characters:<br>

    If Script(cp) .ne. Greek And cp Not .in. 002D,0030..0039 Then False;<br>

  End For;<br>

<br>

<br>

Similar treatment may also be warranted in A.6 Combining Cyrillic Titlo?<br>

<br>

=wil<br>

_______________________________________________<br>

Idna-update mailing list<br>

<a href="mailto:Idna-update@alvestrand.no">Idna-update@alvestrand.no</a><br>

<a href="http://www.alvestrand.no/mailman/listinfo/idna-update" target="_blank">http://www.alvestrand.no/mailman/listinfo/idna-update</a><br>

</blockquote></div><br>