looking up domain names with unassigned code points

Frank Ellermann hmdmhdfmhdjmzdtjmzdtzktdkztdjz at gmail.com
Thu May 15 22:42:23 CEST 2008


Erik van der Poel wrote:

> For example, xn--en32g would produce U+110000, which is outside
> the range of valid code points. (The highest code point is U+10FFFF.)

> If an app receives such a punycode string, it should not attempt
> to display the corresponding Unicode (since it is invalid). I'm
> guessing that we can all agree on that. :-)

Definitely.  My UTF-32BE to UTF-8 encoder failed miserably for
0xFFFFFFFF, I forgot to check "negative" (non-) code points ;-)
So far for "assume valid input".

 Frank



More information about the Idna-update mailing list