prohibiting previously mapped and unmapped characters

Erik van der Poel erikv at google.com
Sat Dec 2 01:18:29 CET 2006


One correction: 0.0188% of all the URLs in the sample contained
character sequences in their domain names that were mapped to
something else in the IDNA and Nameprep processes, but not the
Punycode process. This includes the various versions of the dot (CJK,
full-width, etc), characters mapped to nothing and any sequences
affected by normalization and case-mapping, excluding ASCII
case-mapping.

Erik

On 12/1/06, Erik van der Poel <erikv at google.com> wrote:
> 0.0188% of the domain names
> are mapped to different strings by the IDNA process, from the links
> found in HTML to the domain names passed to DNS.


More information about the Idna-update mailing list