The paragraph in question originated in Harald&#39;s document:<br><br>&gt; Bidi-5.<br>&gt; &nbsp; &nbsp;One particular example of the last case is if a program chooses to<br>&gt; &nbsp; &nbsp;examine the last character (in network order) of a string in order to

<br>&gt; &nbsp; &nbsp;determine its directionality, rather than its first; if it finds an<br>&gt;<br>&gt; &nbsp; &nbsp;NSM character and tries to display the string as if it was a left-to-<br>&gt; &nbsp; &nbsp;right string, the resulting display may be interesting, but not

&gt; &nbsp; &nbsp;useful. I was speaking loosely of a URL when I should have said IRI. I was operating on the same level as Harald&#39;s original text, which is a Unicode character level, not a Punycode level. So replace what I said by IRI. Sorry for the confusion.

Harald&#39;s text must also have been referring to IRI as well, since NSMs don&#39;t occur in URLs. So much of what you wrote was directed at something that I didn&#39;t mean, and I&#39;ll skip over that. There are a few parts I&#39;ll comment on below.

<br><br><br><div class="gmail_quote">On Jan 13, 2008 11:20 AM, John C Klensin &lt;<a href="mailto:klensin@jck.com">klensin@jck.com</a>&gt; wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

Mark,<br><br></blockquote><div>...<br>&nbsp;<br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Certainly, as you, Martin, Erik, and others have pointed out in

<br>various ways, there are many places in which strings appear that<br>look like URLs and don&#39;t conform to URL rules. &nbsp; It may be<br>perfectly reasonable in some contexts to have a string that<br>looks like a URL but that contains non-ASCII characters. &nbsp;But,

unless it is an IRI in a context in which IRIs are permitted, one gets from such a string to a URL via exactly the sort of preprocessing that we&#39;ve been discussing as &quot;user agent&quot; functionality in the IDNAbis context.

</blockquote><div><br>I don&#39;t think it&#39;s as simple as calling it a &quot;UI&quot; context. Using the term &quot;preprocessing&quot; step (as you do below) is clearer. For more, see below.<br><br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

It is also possible that I misunderstand what you mean by<br>&quot;assume&quot;. &nbsp; Neither an implementation of IDNA2003 nor an<br>implementation of IDNA200X is conformant with the intent of<br>those specifications if it &quot;assumes&quot; any of these things and

<br>then goes off and behaves as if they are true. &nbsp;In both cases,<br>implementations are expected to test the strings they intend to<br>pass (or intend others to pass) to the DNS so that<br>non-conforming strings will fail. &nbsp;In IDNA2003, most of the

<br>testing is built into ToASCII and the operations surrounding it.<br>In IDNA200X, much of the testing is more explicit. &nbsp;But neither<br>assumes things that it doesn&#39;t verify.</blockquote><div><br>I think we may agree on this. Part of my confusion with Harald&#39;s original text presumed that we had an implementation that made a (false) presumption by assuming that IDNAs were necessarily IDNAbis -- so a change would cause a problem for some implementation. 

<br><br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Clearly, there is at least other issue. &nbsp;It arises for names<br>that are valid under IDNA200X but not obviously valid under

<br>IDNA2003. &nbsp;An IDNA2003 lookup implementation will reject some of<br>them as invalid (some or most of those that merely contain<br>codepoints that are unassigned in Unicode 3.2 but assigned in<br>later versions may slip through). &nbsp;In the long term, the only

<br>way to make all of the newly-available characters and strings<br>available to IDN-using applications is for implementations of<br>those applications to upgrade. &nbsp;That would be true of any update<br>to IDNA that moves beyond Unicode 

3.2, especially since<br>registration of strings that contain codepoints that are are<br>unassigned at registration time is, fairly obviously, the worst<br>of bad practices.</blockquote><div><br>I foresee an indefinitely long period in which many programs like browsers, emailers, etc would need to handle both IDNA2003 and IDNAbis.  

<br><br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Now I&#39;m being a little pedantic here, for which I apologize, but<br>I think the point is important. &nbsp; If any of the majority of the

<br>cases you list above, what the strings occur in is not a URL,<br>but something that must be transformed into a URL.</blockquote><div>&nbsp;</div><div>agreed<br>&nbsp;<br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

...</blockquote><div>&nbsp;</div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Now I&#39;m going to make two assumptions with which you may<br>

disagree. &nbsp;The first is that the IDNA200X model is sufficiently<br>different from the IDNA2003 one that few, if any, applications<br>are going to switch (or be able or inclined to switch) from<br>IDNA2003 to IDNA200X by a completely automatic process without

<br>anyone thinking about it or noticing. &nbsp;</blockquote><div><br>I would disagree a bit with the first one. Many programs (such as those in my company, Google) will need to handle both, for an indefinite period. I expect what we will probably do is to 

<br><ul><li>See if it works under IDNA2003. If so, fine</li><li>Otherwise see if it works under IDNAbis, if so, fine</li><li>See if the major browsers accept it anyway, if so, we&#39;ll need to take it anyway.</li></ul>Take a look at the following table, 

<font size="2">for example:<br><br></font><table id="table1" style="border-collapse: collapse;" border="1">

        <tbody><tr>

                <th align="left"><font size="2">&nbsp;</font></th>

                <th align="left"><font size="2">Link</font></th>

                <th align="left"><font size="2">Firefox</font></th>

                <th align="left"><font size="2">IE7</font></th>

        </tr>

        <tr>

                <td><font size="2">0</font></td>

                <td><font size="2">&lt;a href=&quot;<a href="http://b%C3%BCcher.de/">http://bücher.de</a>&quot;&gt;</font></td>

                <td><font size="2">works</font></td>

                <td><font size="2">works</font></td>

        </tr>

        <tr>

                <td><font size="2">1</font></td>

                <td><font size="2">&lt;a href=&quot;<a href="http://b%C3%BCcher.de/">http://Bücher.de</a>&quot;&gt;</font></td>

                <td><font size="2">works</font></td>

                <td><font size="2">works</font></td>

        </tr>

        <tr>

                <td><font size="2">2</font></td>

                <td><font size="2">&lt;a href=&quot;<a href="http://b%C3%BCcher.de/">http://xn--bcher-kva.de</a>&quot;&gt;</font></td>

                <td><font size="2">works</font></td>

                <td><font size="2">works</font></td>

        </tr>

        <tr>

                <td><font size="2">3</font></td>

                <td><font size="2">&lt;a href=&quot;<a href="http://b%c3%bccher.de/">http://B%C3%BCcher.de</a>&quot;&gt;</font></td>

                <td><font size="2">doesn&#39;t</font></td>

                <td><font size="2">doesn&#39;t</font></td>

        </tr>

</tbody></table>

<font size="2"><br></font><p><font size="2">Because Firefox and IE7 both accept (0), (1), and (2), I can&#39;t see any way around Google&#39;s handling them also. This is into the indefinite future, even though #0 and #1 are not in Punycode. And this is not a U

I issue; these are in the HTML page. That&#39;s why &quot;preprocessing&quot; is a better phrase than &quot;UI&quot;. The more of the web and net&#39;s infrastructure that accepts these variations, the more that other programs need to accommodate them, so that they interwork with one another.

</p>What I really don&#39;t want to see is an IDNAbis that fails to gain traction because of this (thinking back to XML 1.1, which failed to gain traction because of a really rather small incompatibility with XML 1.0).<br>

<p><br></p></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">...</blockquote><div>&nbsp;</div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

<br><br>The second assumption is that any implementation that now<br>depends upon, or offers to users, the input flexibilities of<br>IDNA2003 (some applications of IDNA2003 do not) would be stupid<br>to implement IDNA200X in a way that simply drops those

<br>flexibilities. &nbsp;Whether it should quietly retain them, or<br>produce more or less subtle warnings to users about the<br>conversions becomes a local design matter (and programs that<br>communicate with users obviously have choices that are not

<br>available to ones do not), it appears to me that we are already<br>heading in the direction of applications (and, if that approach<br>isn&#39;t stopped for other reasons, &quot;smart domain name servers&quot;)<br>making decisions about some things being safer than others and

<br>conditioning their actions on those decisions.</blockquote><div><br>I think we&#39;re in agreement here. <br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

<br><br>The rationale document doesn&#39;t cover that situation nearly well<br>enough at -05, but there is a new section and extensive text<br>about it in the working version of -06. &nbsp;I don&#39;t think anything<br>there will come as a surprise, since all of the issues have been

<br>discussed on this list and much of the text is derived from<br>discussions on the list.</blockquote><div><br>Good.<br>&nbsp;<br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

<div class="Ih2E3d"><br>&gt; What I&#39;m saying is that essentially all of the incompatible<br>&gt; differences between 2003 and the current bis are potential<br>&gt; problems for some implementation, and once we get done with

<br>&gt; bis, we will need to list them all. So just calling out #5 is<br>&gt; insufficient.<br><br></div>While our perspective on these &quot;incompatible differences&quot; is<br>quite different, I hope that the new text in issues-06 will

<br>address many of your concerns. &nbsp;</blockquote><div><br>Looking forward to it.<br>&nbsp;<br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

But it is also true that many of<br>those differences are differences in how and when IDNA is<br>applied that are simply not defined by the original protocol or<br>are differences that are important only if applicability<br>

principles or guidelines about the use of the original protocol were violated. &nbsp;If adjustments in those areas are impossible, then we are in very difficult waters indeed.</blockquote><div> Yes, I think we may need to be pragmatic about the changes that we introduce, because of the established conventions...

<br><br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><br><br>&gt;...<br><br>best,<br><font color="#888888"> &nbsp; &nbsp;john<br></font><div>

<div></div><div class="Wj3C7c"><br>_______________________________________________<br>Idna-update mailing list<br><a href="mailto:Idna-update@alvestrand.no">Idna-update@alvestrand.no</a><br><a href="http://www.alvestrand.no/mailman/listinfo/idna-update" target="_blank">

http://www.alvestrand.no/mailman/listinfo/idna-update</a><br></div></div></blockquote></div><br><br clear="all"><br>-- <br>Mark