Contextual rule for MODIFIER LETTER PRIME?

Patrik Fältström patrik at frobbit.se
Thu Jul 16 09:58:42 CEST 2009


Can you list what codepoints you talk about?

   paf

On 16 jul 2009, at 02.39, Mark Davis ⌛ wrote:

> I agree. I think most of the CONTEXTO items should either be PVALID or
> DISALLOWED. The only reason for a CONTEXT* is where we really have  
> to have
> the character, but it is extremely confusable with syntax  
> characters, or
> generally invisible (joiners).
>
> Mark
>
>
> On Wed, Jul 15, 2009 at 17:29, Kenneth Whistler <kenw at sybase.com>  
> wrote:
>
>> Patrik,
>>
>> Michael's point:
>>
>>> "Romanized Cyrillic" is Latin. Not Cyrillic. The modifier letter  
>>> prime
>>> is not used with Cyrillic.
>>
>> and Mark Davis' point:
>>
>>>> On 5 nov 2008, at 03.15, Mark Davis wrote:
>>>>
>>>>> For example those
>>>>> for MODIFIER LETTER PRIME (used in romanized Cyrillic as well as
>>>>> Greek) which can be a final characters in words, ...
>>
>> is that U+02B9 is not used just as the normalized equivalent of
>> the U+0374 GREEN NUMERAL SIGN. It occurs commonly in Latin  
>> transliteration
>> of Slavic languages -- including Old Church Slavonic, but even  
>> Russian,
>> too. So it is used with the Latin script (but not with the Cyrillic).
>>
>>>> Is what you say that the existing rules:
>>>>
>>>>> Appendix A.6.  MODIFIER LETTER PRIME
>>>>>  Code point:
>>>>>     U+02B9
>>>>>  Overview:
>>>>>     Permitted only in contexts in which GREEK LOWER NUMERAL SIGN,
>>>>>     U+0375, is permitted.  GREEK NUMERAL SIGN, U+0374, and the  
>>>>> Lower
>>>>>     Numeral Sign (U+0375) are indicators for numeric use of
>>>>> letters in
>>>>>     older Greek writing systems.  U+02B9 is relevant because
>>>>>     normalization maps U+0374 into it.
>>>>>  Lookup:
>>>>>     False
>>>>>  Rule Set:
>>>>>     True;
>>>>>     For All Characters:
>>>>>        If Script(cp) .ne.  Greek Then False;
>>>>>     End For;
>>>>
>>>> ...should be changed so that script cyrillic, or '-' as adjacent
>>>> character makes this ok?
>>
>> Neither.
>>
>>>>
>>>> Can you please provide a new rule that works?
>>
>> Once again, this attribution of CONTEXTO to U+02B9 is accomplishing
>> nothing but adding unnecessary complication to the rules.
>>
>> U+02B9 should simply be PVALID -- which will happen
>> automatically if it is removed from the exception list in 2.6.
>> And then Appendix A.6 should simply be removed instead of
>> trying to define a complex rule that would make any sense
>> for it as CONTEXTO.
>>
>> I realize that people think that the Greek numeral signs
>> are "dangerous" for IDN because they look kind of like
>> syntax characters. But this is no more true than any number
>> of other modifier letters which are quietly PVALID in the
>> table. If the Greeks think that the two numeral signs
>> make sense for IDNs, then we should just make 0375
>> PVALID in 2.6, leave 02B9 PVALID by rule, and be done with
>> it. No context rules needed. Simpler and cleaner.
>>
>> --Ken
>>
>> _______________________________________________
>> Idna-update mailing list
>> Idna-update at alvestrand.no
>> http://www.alvestrand.no/mailman/listinfo/idna-update
>>

-------------- next part --------------
A non-text attachment was scrubbed...
Name: PGP.sig
Type: application/pgp-signature
Size: 186 bytes
Desc: This is a digitally signed message part
Url : http://www.alvestrand.no/pipermail/idna-update/attachments/20090716/4402b78c/attachment.pgp 


More information about the Idna-update mailing list