I'm on vacation this week, but will respond when I get back to work.<br><br>I do know that in the test we are handling both the unescaped representation (å) and the escaped representation (%C3%A5), so we'll have to break the figures apart.
<br><br>Mark<br><br><div><span class="gmail_quote">On 4/4/07, <b class="gmail_sendername">Harald Alvestrand</b> <<a href="mailto:firstname.lastname@example.org">email@example.com</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Mark Davis wrote:<br>><br>><br>> On 3/15/07, *John C Klensin* <<a href="mailto:firstname.lastname@example.org">email@example.com</a><br>> <mailto:<a href="mailto:firstname.lastname@example.org">email@example.com</a>>> wrote:<br>>
<br>> [snip]<br>><br>> I'm trying to understand this experiment. Normally, an href<br>> that "uses IDNA" would have Punycode labels (A-labels) in its<br>> domain names.<br>><br>
><br>> I don't know the basis for saying that this would be the "normal"<br>> usage. There isn't anything in IDNA2003, unless I'm missing something,<br>> that requires or even suggests that it is not perfectly fine to have:
<br>><br>> <a href="<a href="http://ÖBB.at">http://ÖBB.at</a> <http://%C3%96BB.at>">Österreichishe Bundesbahn</a><br>That text is in the HTML spec.<br><br>All versions of HTML that claim that the stuff inside a "href=" is an
<br>URI (rather than an IRI) implicitly claim that the domain name is in<br>A-label form.<br>I'm not up to date on HTML updates, but RFC 3987 was published in<br>January 2005, so all versions older than that (including HTML
4.01)<br>referenced URIs.<br><br>After a little searching, I found that details on the recommended way of<br>handling those errors are in<br><<a href="http://www.w3.org/TR/html401/appendix/notes.html#non-ascii-chars">http://www.w3.org/TR/html401/appendix/notes.html#non-ascii-chars
</a>> - but<br>they're still errors.<br><br>Do you have stats on how many of the 831.000 cases you identified were<br>in A-label form rather than "possibly conformant U-label" form? That<br>would tell us something about how much standards are adhered to....
<br><br> Harald<br><br><br></blockquote></div><br><br clear="all"><br>-- <br>Mark