U+200C rule

Simon Josefsson simon at josefsson.org
Sun Mar 20 18:47:55 CET 2011


Patrik Fältström <patrik at frobbit.se> writes:

>> If not, what is the intended way to implemented RegExpMatch?
>
> The expression try to say that you need around _each_ \u200C the following:
>
>> One codepoint with either Joining_Type L or D
>> 
>> Zero or more codepoints with Joining_Type T
>> 
>> The \u200C
>> 
>> Zero or more codepoints with Joining_Type T
>> 
>> One codepoint with either Joining_Type R or D
>
> The regexp does not take into account more than one \u200c in each string.

Thanks for clarification, I'll implement it this way and will add a
couple of test vectors for it.

/Simon


More information about the Idna-update mailing list