Try your Eszett

Vint Cerf vint at google.com
Sat Mar 7 15:43:56 CET 2009


your test page is password protected.

v


Vint Cerf
Google
1818 Library Street, Suite 400
Reston, VA 20190
202-370-5637
vint at google.com




On Mar 7, 2009, at 9:26 AM, Peter Dambier wrote:

> Sorry if I am a bit beside my shoes - or off topic.
>
> I have prepared a littel test for your browser
>
> http://www.das-loch-von-koelle.de/UTF/
>
> ISO-8859-15  	2681 bytes
> UTF-8 		2709 bytes
> UTF-16 		5372 bytes
> UTF-16-BE 	5386 bytes
> UTF-16-LE 	5386 bytes
> UTF-32-BE 	10722 bytes
> UTF-32-LE 	10722 bytes
>
> It is the same text in 7 codings.
>
> Just for fun you can see that very same example on
>
> http://www.hessen-braucht-sechs.de/UTF/
>
> but here our appache keeps telling you everything
> is UTF-8 although it is not.
>
> Not all browsers / operating systems will show all
> of them.
>
> From the tcp/ip point of view only UTF-32-BE makes
> sense because that is the internet byte order.
>
> On the other hand today we have 63.63...63.63 with
> a maximum of 255 characters per domainname. That
> would shrink to 15.15..15.15 with a max of 63.
>
> As DNS is not a tree but a flat file with practically
> all domains in ".com" those 63 character probably
> would be enough but 15 characters between the dots?
>
> UTF-16 is a mess but some browsers dont even get
> UTF-16-BE correctly which is again the natural byte
> order.
>
> UTF-8 and ISO-8859-15 did not work the way I expected
> so I had to code ISO-8859-1 really. See for yourself
> if UTF-8 works. It might depend on browser and os.
>
> Well the "upper case" sz and the "lower case" &euro
> simply do not exist. If you invent them it will kill
> your search algorithms.
>
> As for the french accents or their non existence in
> the "upper" case ... french writers (those comming
> from the french government) are a minority.
>
> Most french writers do come from Québec or from
> Canada at least, maybe from Belgium or Swizzerland.
>
> And even if they do come from Normandie or the
> Channel Islands, you will find a lot of old literature
> that does have accents on capital letters.
>
> Kind regards
> Peter
>
> -- 
> Peter and Karin Dambier
> Cesidian Root - Radice Cesidiana
> Rimbacher Strasse 16
> D-69509 Moerlenbach-Bonsweiher
> +49(6209)795-816 (Telekom)
> +49(6252)750-308 (VoIP: sipgate.de)
> mail: peter at peter-dambier.de
> http://www.peter-dambier.de/
> http://iason.site.voila.fr/
> https://sourceforge.net/projects/iason/
> ULA= fd80:4ce1:c66a::/48
> _______________________________________________
> Idna-update mailing list
> Idna-update at alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/idna-update



More information about the Idna-update mailing list