comments on draft-ietf-idnabis-bidi
Harald Tveit Alvestrand
harald at alvestrand.no
Wed Aug 5 14:05:16 CEST 2009
Erik van der Poel skrev:
> CS and ET certainly are two of the more noticeable differences between
> Mati's proposal and the expired draft, but I don't know whether they
> are so beneficial, since they are mostly (all?) punctuation and symbol
> characters that are not allowed anyway.
> Similarly, the rule that allows EN followed by ET at the end of a
> label may not be so beneficial due to the prohibition of punctuation
> and symbol characters.
All the ET and CS characters are in classes normally forbidden in
IDNA2008 - at least in Unicode 5.1:
egrep ";(CS|ET);" UnicodeData.txt | egrep -v ';(Sc|Po|So|Sm|Zs);'
But there's no reason for Bidi to disallow them for that reason -
exceptions can occur in the future.
More information about the Idna-update