Contextual rule for GERESH?
Patrik Fältström
patrik at frobbit.se
Thu Jul 16 09:55:22 CEST 2009
Thanks.
paf
On 15 jul 2009, at 19.14, Mark Davis ⌛ wrote:
> That is what I was saying: the test should be changed from:
>
> The script of the preceding character and the subsequent
> character, if any, MUST be Hebrew.
>
> to
>
> The script of the preceding character MUST be Hebrew.
>
> or, more accurately:
>
> There must be a preceding character, and its script MUST be
> Hebrew.
>
> Mark
>
>
> 2009/7/15 Matitiahu Allouche <matial at il.ibm.com>
>
>>
>> Patrik Fältström asked (speaking of the Rule Set for Geresh):
>> <quote>Should be changed so it is also ok if the character before
>> or after
>> is
>> a '-'?:<end of quote>
>>
>> I don't see a valid use case where Geresh could come after a '-'.
>>
>> Geresh is normally used in Hebrew as the last character of an
>> abbreviation.
>> The previous character should thus be a Hebrew letter.
>>
>> In the context of IDNA where 2 or more parts of speech may be
>> joined as one
>> token (label), I don't see a way to restrict what may come after
>> the Geresh.
>>
>> Shalom (Regards), Mati
>> Bidi Architect
>> Globalization Center Of Competency - Bidirectional Scripts
>> IBM Israel
>> Phone: +972 2 5888802 Fax: +972 2 5870333 Mobile:
>> +972 52
>> 2554160
>>
>>
>>
>> *Patrik Fältström <patrik at frobbit.se>*
>> Sent by: idna-update-bounces at alvestrand.no
>>
>> 15/07/2009 16:00
>> To
>> Mark Davis <mark at macchiato.com> cc
>> idna-update at alvestrand.no Subject
>> Contextual rule for GERESH?
>>
>>
>>
>>
>> On 5 nov 2008, at 03.15, Mark Davis wrote:
>>
>>> For example those for GERESH, which can be a final character in
>>> words, but the rules
>>> require to be followed by a Hebrew letter, when "-" should also be
>>> allowed
>>> (the GERSHAYIM rules on the other hand are ok, since it is only
>>> medial).
>>
>> Is what you are saying that the current rules:
>>
>>> Appendix A.8. HEBREW PUNCTUATION GERESH
>>> Code point:
>>> U+05F3
>>> Overview:
>>> The script of the preceding character and the subsequent
>>> character, if any, MUST be Hebrew.
>>> Lookup:
>>> False
>>> Rule Set:
>>> False;
>>> If Script(Before(cp)) .eq. Hebrew And
>>> LastChar .eq. cp Then True;
>>> If Script(Before(cp)) .eq. Hebrew And
>>> Script(After(cp)) .eq. Hebrew Then True;
>>>
>>
>> Should be changed so it is also ok if the character before or after
>> is
>> a '-'?
>>
>> Please provide new rules.
>>
>> paf
>>
>> _______________________________________________
>> Idna-update mailing list
>> Idna-update at alvestrand.no
>> http://www.alvestrand.no/mailman/listinfo/idna-update
>>
>>
>> _______________________________________________
>> Idna-update mailing list
>> Idna-update at alvestrand.no
>> http://www.alvestrand.no/mailman/listinfo/idna-update
>>
>>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: PGP.sig
Type: application/pgp-signature
Size: 186 bytes
Desc: This is a digitally signed message part
Url : http://www.alvestrand.no/pipermail/idna-update/attachments/20090716/5e4469eb/attachment.pgp
More information about the Idna-update
mailing list