Dear colleagues,<br><br>

fascinating is the way our Unicode fellow Members are going to &quot;oppose the charter, disregard a consensus, reverse a 80% feed back, force the Chair to &quot;change his mind&quot; and most probably its consensus evaluation, imposing their interest to their commercial competition, and writing themseleves a paper legitimacy against the demand of their own users they most probably are  certain they can influence in the same manner&quot;.  <br>

<br>

Also fascinating to realize that all this small group of &quot;60 Hz&quot; people effort is void. The 60hzians&#39; strategy already failed on TATWEEL. This was due to the reasonable positions of Siavash Shahshahani and Roozbeh Pournader. Users&#39; interest and number and technical influence is moving to the <a href="mailto:workon@idna2010.org">workon@idna2010.org</a> subscribers diversity. <br>

<br>

Portzamparc<br><br>

NB. Microsoft&#39;s position regarding the French language has now been archived as an important pre-experimentation input <a href="http://iucg.org/wiki/Microsoft_IRT_the_French_language_orthotypography">http://iucg.org/wiki/Microsoft_IRT_the_French_language_orthotypography</a> - comments welcome.<br>

<br><br><div class="gmail_quote">2009/12/10 Kenneth Whistler <span dir="ltr">&lt;<a href="mailto:kenw@sybase.com">kenw@sybase.com</a>&gt;</span><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

Paul,<br>

<div class="im"><br>

&gt; &gt;Protocol, 5.2:<br>

&gt; &gt;<br>

&gt; &gt;5.2. Conversion to Unicode<br>

&gt; &gt;<br>

&gt; &gt;The string is converted from the local character set into<br>

&gt; &gt;Unicode, if it is not already in Unicode. Depending on local<br>

&gt; &gt;needs, this conversion may involve mapping some characters<br>

&gt; &gt;into other characters as well as coding conversions...<br>

&gt; &gt;The results MUST be a Unicode string in NFC form.<br>

&gt; &gt;<br>

&gt; &gt;<br>

&gt; &gt;Strings don&#39;t magically get to be &quot;in NFC form&quot;, without<br>

&gt; &gt;being mapped (via normalization algorithm) from whatever form<br>

&gt; &gt;they started out as, *into* NFC form.<br>

&gt;<br>

&gt; In this case, yes they do. That &quot;MUST&quot; is probably wrong; I believe<br>

&gt; that the statement is meant to say &quot;The results will be a<br>

&gt; Unicode string in NFC form&quot;.<br>

&gt;<br>

&gt; John, et. al.: is my understanding correct here?<br>

<br>

</div>I repeat: strings do not turn into NFC form by magic. They<br>

are mapped by the normalization algorithm. And that mapping<br>

involves, necessarily, PVALID --&gt; PVALID characters in<br>

this case.<br>

<br>

And &quot;the results&quot; will not be a &quot;Unicode string in NFC form&quot;<br>

unless it is mapped to that. (Except of course, by accident,<br>

if it happens to start out as NFC in the first place.)<br>

<br>

I think what you may be trying to get at is a requirement that<br>

the only valid *input* to the label processing is a string<br>

which is already a Unicode string in NFC form -- and that<br>

how it got to be that way and indeed whether it started out<br>

as a string in SJIS or 8859-7 or whatever, is beyond the<br>

protocol&#39;s scope of caring.<br>

<br>

But if that is the case, then why are we talking about trying<br>

to add into Section 5.2 a prohibition (a MUST NOT) against<br>

mapping PVALID characters? Because manifestly, for any<br>

actual implementation to meet the requirement of having<br>

Unicode strings in NFC format as valid input for the label<br>

processing, it MUST map strings (including PVALID<br>

characters) using the Unicode normalization algorithm.<br>

<br>

I understand that maybe you want to say that that isn&#39;t<br>

a requirement *in* the protocol -- it is simply a requirement<br>

on the well-formedness of valid input to be handled as<br>

labels. And maybe I don&#39;t understand how you distribute<br>

the MUSTs, SHOULDs and wills around the document to accomplish<br>

that.<br>

<br>

But my essential point here is that you cannot have you cake<br>

and eat it too -- trying to prohibit mapping of PVALID<br>

characters in Section 5.2 at the same time that you<br>

are requiring mapping of PVALID characters in Section 5.2 --<br>

however the exact protocol wordsmanship of that gets<br>

worked out.<br>

<br>

Maybe you can, for example, prohibit *case* mapping of PVALID<br>

characters in Section 5.2, while requiring canonical<br>

normalization mapping of PVALID characters to NFC. That<br>

at least would be a coherent position. But you cannot<br>

just prohibit mapping and require mapping of the same<br>

class of characters, without being more discriminating<br>

in what you mean.<br>

<br>

--Ken<br>

<div><div></div><div class="h5"><br>

<br>

_______________________________________________<br>

Idna-update mailing list<br>

<a href="mailto:Idna-update@alvestrand.no">Idna-update@alvestrand.no</a><br>

<a href="http://www.alvestrand.no/mailman/listinfo/idna-update" target="_blank">http://www.alvestrand.no/mailman/listinfo/idna-update</a><br>

</div></div></blockquote></div><br>