U+200C rule

Patrik Fältström patrik at frobbit.se
Sun Mar 20 19:11:05 CET 2011


On 20 mar 2011, at 18.47, Simon Josefsson wrote:

> Patrik Fältström <patrik at frobbit.se> writes:
> 
>>> If not, what is the intended way to implemented RegExpMatch?
>> 
>> The expression try to say that you need around _each_ \u200C the following:
>> 
>>> One codepoint with either Joining_Type L or D
>>> 
>>> Zero or more codepoints with Joining_Type T
>>> 
>>> The \u200C
>>> 
>>> Zero or more codepoints with Joining_Type T
>>> 
>>> One codepoint with either Joining_Type R or D
>> 
>> The regexp does not take into account more than one \u200c in each string.
> 
> Thanks for clarification, I'll implement it this way and will add a
> couple of test vectors for it.

If you have suggestions on how to fix the rule, let me know.

   Patrik


More information about the Idna-update mailing list