I agree. I think most of the CONTEXTO items should either be PVALID or DISALLOWED. The only reason for a CONTEXT* is where we really have to have the character, but it is extremely confusable with syntax characters, or generally invisible (joiners).<br>
<br clear="all">Mark<br>
<br><br><div class="gmail_quote">On Wed, Jul 15, 2009 at 17:29, Kenneth Whistler <span dir="ltr"><<a href="mailto:kenw@sybase.com">kenw@sybase.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Patrik,<br>
<br>
Michael's point:<br>
<div class="im"><br>
> "Romanized Cyrillic" is Latin. Not Cyrillic. The modifier letter prime<br>
> is not used with Cyrillic.<br>
<br>
</div>and Mark Davis' point:<br>
<div class="im"><br>
> > On 5 nov 2008, at 03.15, Mark Davis wrote:<br>
> ><br>
> >> For example those<br>
> >> for MODIFIER LETTER PRIME (used in romanized Cyrillic as well as<br>
</div>> >> Greek) which can be a final characters in words, ...<br>
<br>
is that U+02B9 is not used just as the normalized equivalent of<br>
the U+0374 GREEN NUMERAL SIGN. It occurs commonly in Latin transliteration<br>
of Slavic languages -- including Old Church Slavonic, but even Russian,<br>
too. So it is used with the Latin script (but not with the Cyrillic).<br>
<div class="im"><br>
> > Is what you say that the existing rules:<br>
> ><br>
> >> Appendix A.6. MODIFIER LETTER PRIME<br>
> >> Code point:<br>
> >> U+02B9<br>
> >> Overview:<br>
> >> Permitted only in contexts in which GREEK LOWER NUMERAL SIGN,<br>
> >> U+0375, is permitted. GREEK NUMERAL SIGN, U+0374, and the Lower<br>
> >> Numeral Sign (U+0375) are indicators for numeric use of<br>
> >> letters in<br>
> >> older Greek writing systems. U+02B9 is relevant because<br>
> >> normalization maps U+0374 into it.<br>
> >> Lookup:<br>
> >> False<br>
> >> Rule Set:<br>
> >> True;<br>
> >> For All Characters:<br>
> >> If Script(cp) .ne. Greek Then False;<br>
> >> End For;<br>
> ><br>
> > ...should be changed so that script cyrillic, or '-' as adjacent<br>
> > character makes this ok?<br>
<br>
</div>Neither.<br>
<div class="im"><br>
> ><br>
> > Can you please provide a new rule that works?<br>
<br>
</div>Once again, this attribution of CONTEXTO to U+02B9 is accomplishing<br>
nothing but adding unnecessary complication to the rules.<br>
<br>
U+02B9 should simply be PVALID -- which will happen<br>
automatically if it is removed from the exception list in 2.6.<br>
And then Appendix A.6 should simply be removed instead of<br>
trying to define a complex rule that would make any sense<br>
for it as CONTEXTO.<br>
<br>
I realize that people think that the Greek numeral signs<br>
are "dangerous" for IDN because they look kind of like<br>
syntax characters. But this is no more true than any number<br>
of other modifier letters which are quietly PVALID in the<br>
table. If the Greeks think that the two numeral signs<br>
make sense for IDNs, then we should just make 0375<br>
PVALID in 2.6, leave 02B9 PVALID by rule, and be done with<br>
it. No context rules needed. Simpler and cleaner.<br>
<br>
--Ken<br>
<div><div></div><div class="h5"><br>
_______________________________________________<br>
Idna-update mailing list<br>
<a href="mailto:Idna-update@alvestrand.no">Idna-update@alvestrand.no</a><br>
<a href="http://www.alvestrand.no/mailman/listinfo/idna-update" target="_blank">http://www.alvestrand.no/mailman/listinfo/idna-update</a><br>
</div></div></blockquote></div><br>