U+200C rule
Patrik Fältström
patrik at frobbit.se
Sun Mar 20 19:11:05 CET 2011
On 20 mar 2011, at 18.47, Simon Josefsson wrote:
> Patrik Fältström <patrik at frobbit.se> writes:
>
>>> If not, what is the intended way to implemented RegExpMatch?
>>
>> The expression try to say that you need around _each_ \u200C the following:
>>
>>> One codepoint with either Joining_Type L or D
>>>
>>> Zero or more codepoints with Joining_Type T
>>>
>>> The \u200C
>>>
>>> Zero or more codepoints with Joining_Type T
>>>
>>> One codepoint with either Joining_Type R or D
>>
>> The regexp does not take into account more than one \u200c in each string.
>
> Thanks for clarification, I'll implement it this way and will add a
> couple of test vectors for it.
If you have suggestions on how to fix the rule, let me know.
Patrik
More information about the Idna-update
mailing list