Even for English, IDNA does not permit all valid words: "Joe's Bar" will not work, because of both the space and the apostrophe and "Joe'sBar" because of the apostrophe. IDNA2008 explicitly does not permit all sequences that would be valid words in all languages; nor could it do otherwise.<br>
<br>There are two compatibility problems: <br><ol><li>Existing web pages and other documents that contain ß and expect to go to location X and not Y.</li><li>There is still an existing body of billions of browsers that will take years to disappear (as Erik points out, something like 20% of browsers are still IE6).<br>
</li></ol>Let's suppose that IDNA2008 allows ß, and that the newer browsers use it (and not a compatibility scheme like <a href="http://unicode.org/reports/tr46/">http://unicode.org/reports/tr46/</a>). The only real purpose to allowing ß is so that you can distinguish from ss. But what happens when Herr Stosser gets <a href="http://stosser.at">stosser.at</a> and Herr Stoßer gets <a href="http://stosser.at">stoßer.at</a>?<br>
<br>Initially, 100% of all browsers will go to the ss form. Nobody will go to Stoßer's site; his email won't work, etc.: href="<a href="http://stosser.at">stoßer.at</a>" goes directly to "<a href="http://stosser.at">stosser.at</a>". Even on Stoßer's site, absolute intrapage links will go to the 'wrong' place. After a while, some newer browsers will take page href="<a href="http://stosser.at">stoßer.at</a>" and go to <a href="http://stosser.at">stoßer.at</a>, while all the other browsers will go to href="<a href="http://stosser.at">stosser.at</a>". The same goes for email. So access to all of those links (and mail, etc.) will be unreliable, and the subject of security and compatibility problems. As a result, practically, people would be unable to use href="<a href="http://stosser.at">stoßer.at</a>" in their web pages or in email until essentially all existing browsers were supplanted, which will be maybe 5 years down the line. And during that time, these will also bollux up all the search engines, since indexing assumes that links don't have ambiguous targets.<br>
<br>And that assumes that nobody does use a compatibility mechanism. Another, more likely, alternative based on our conversations with vendors is that people disregard that part of IDNA2008, and use a compatibility mechanism like <a href="http://unicode.org/reports/tr46/">http://unicode.org/reports/tr46/</a>; that allows the browsers, emailers, search engines and others to keep functioning correctly - there is no transition. The downside is that you can't register both <a href="http://stosser.at">stosser.at</a> and <a href="http://stosser.at">stoßer.at</a>. So of the thousand Herr Stoßers, instead of one of them getting that name, none of them do; it's like "Joe'sBar". And, of course, there is the disadvantage of having UTS46 have to exist in the first place.<br>
<br>The third alternative is the really awful one: the "let a thousand flowers bloom" scenario. In this scenario, there is no single compatibility mechanism like <a href="http://unicode.org/reports/tr46/">http://unicode.org/reports/tr46/</a>; instead, we get different variants from different vendors, and the situation is chaotic. That is the scenario that the Unicode consortium's members are really concerned with. While the ideal solution would be an IDNA2008 that maintained compatibility with IDNA2003, the second best solution is only one compatibility mechanism; not dozens.<br>
<br>Mark<br>
<br><br><div class="gmail_quote">On Mon, Nov 30, 2009 at 07:55, Georg Ochsner <span dir="ltr"><<a href="mailto:g.ochsner@revolistic.com">g.ochsner@revolistic.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
> -----Ursprüngliche Nachricht-----<br>
> Von: <a href="mailto:idna-update-bounces@alvestrand.no">idna-update-bounces@alvestrand.no</a> [mailto:<a href="mailto:idna-update-">idna-update-</a><br>
> <a href="mailto:bounces@alvestrand.no">bounces@alvestrand.no</a>] Im Auftrag von Alexander Mayrhofer<br>
> Gesendet: Montag, 30. November 2009 15:57<br>
<div class="im"><br>
> > I would though be more "on your side" if the number of domain<br>
> > names that contained ß where say 100 times higher than today<br>
> > in published documents. Because then people would be TOLD to<br>
> > type in something (ß) that mapped to something else (ss) that<br>
> > was registered. That, I claim, is not the case. At least not<br>
> > "heavily".<br>
><br>
> I understand that. And i'm saying that the potential of around 500<br>
> useful "ß" registrations (based on looking through our inventory of 900k<br>
> domains) is by far not worth the effort.<br>
<br>
</div>In several talks with people from the Austrian registry I've now heard this<br>
argument. But I think this decision should not be made depending on<br>
commercial factors. IDNA is in my eyes not a question of return on<br>
investment but about the native use of language in domain names around the<br>
globe. Once more, the Austrian registry can still refuse to have sharp s<br>
registered within their namespace, but maybe other registries pay more<br>
attention to the language aspect than to commercial calculations.<br>
<br>
Best regards<br>
<font color="#888888">Georg<br>
</font><div><div></div><div class="h5"><br>
_______________________________________________<br>
Idna-update mailing list<br>
<a href="mailto:Idna-update@alvestrand.no">Idna-update@alvestrand.no</a><br>
<a href="http://www.alvestrand.no/mailman/listinfo/idna-update" target="_blank">http://www.alvestrand.no/mailman/listinfo/idna-update</a><br>
</div></div></blockquote></div><br>