comments on draft-ietf-idnabis-bidi

Harald Tveit Alvestrand harald at alvestrand.no
Wed Aug 5 14:05:16 CEST 2009


Erik van der Poel skrev:
> CS and ET certainly are two of the more noticeable differences between
> Mati's proposal and the expired draft, but I don't know whether they
> are so beneficial, since they are mostly (all?) punctuation and symbol
> characters that are not allowed anyway.
>
> Similarly, the rule that allows EN followed by ET at the end of a
> label may not be so beneficial due to the prohibition of punctuation
> and symbol characters.
All the ET and CS characters are in classes normally forbidden in 
IDNA2008 - at least in Unicode 5.1:

egrep ";(CS|ET);" UnicodeData.txt | egrep -v ';(Sc|Po|So|Sm|Zs);'

(no output)

But there's no reason for Bidi to disallow them for that reason - 
exceptions can occur in the future.

             Harald



More information about the Idna-update mailing list