prohibiting previously mapped and unmapped characters
    Tina Dam 
    tina.dam at icann.org
       
    Wed Nov 29 21:31:09 CET 2006
    
    
  
I agree that getting some stats on the table would be a great idea...... 
> --On November 29, 2006 11:22 AM, Harald Alvestrand wrote:
> 
> > --On 29. november 2006 09:42 -0800 Erik van der Poel 
> > <erikv at google.com>
> > wrote:
> 
> > If it would help, I can take a look at Google's copies of web 
> > documents to see which characters are actually used there 
> and how many 
> > occurrences there are of each. Of course, such a sample would omit 
> > domain names used in email, but the web is quite an 
> important part of 
> > the Internet too.
> 
> I think such a listing (frequency count of characters 
> actually used in Punycoded domains that actually serve web 
> pages) would be very interesting.
> For the characters that *never* occur, it seems hard to argue 
> that a large community of present users would be hurt by 
> their omission.
-can we say: "As an initial starting point ....for the characters that
*never* occur, it seems hard to argue that a large community of present
users would be hurt by their omission..."  ?
If interesting then I can also pass a request to the gTLD regsitries and see
if they can provide some data about how many of the currently registered
IDNs would be unavailable under the new protocol limitations?
Tina
 
 
> While you're at it, perhaps you could get a count of how many 
> xn-- domains there are out there, as a percentage of the 
> total number of domains for which Google fetches web pages?
> 
> I *love* statistics :-)
> 
>                    Harald
> 
> _______________________________________________
> Idna-update mailing list
> Idna-update at alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/idna-update
> 
    
    
More information about the Idna-update
mailing list