ZWNJ contextual rule

Patrik Fältström patrik at frobbit.se
Wed Jul 15 14:54:32 CEST 2009


On 5 nov 2008, at 03.15, Mark Davis wrote:

> For example, the rules for ZWNJ which don't match actual usage,

Can you explain what "actual usage" is?

The rule in -05 are now:

Appendix A.2.  ZERO WIDTH NON-JOINER
    Code point:
       U+200C
    Overview:
       Between two characters from the same script only.  The script  
must
       be one in which the use of this character causes significant
       visual transformation of one or both of the adjacent characters.
    Lookup:
       True
    Rule Set:
       False;
       If Script(Before(cp)) .eq. ( Deva | Tamil | Arabic ) And
          Script(After(cp)) .eq. ( Deva | Tamil | Arabic ) And
          Script(Before(cp)) .eq.  Script(After(cp)) Then True;
       If Not
          RegExpMatch((Joining_Type:L)(Joining_Type:
          T)*\u200C(Joining_Type:T)*(Joining_Type:R))
          Then False;

    Patrik

-------------- next part --------------
A non-text attachment was scrubbed...
Name: PGP.sig
Type: application/pgp-signature
Size: 186 bytes
Desc: This is a digitally signed message part
Url : http://www.alvestrand.no/pipermail/idna-update/attachments/20090715/f08d1976/attachment.pgp 


More information about the Idna-update mailing list