The macrolanguage mechanism might be perfectly fine for some applications. For internationalization, however, the thought of changing all use of &quot;zh&quot; to &quot;cmn&quot; is simply bizarre; it&#39;d be like getting all the people driving on the left side of the road (GB, AU, JP,...) to switch to the right. Ain&#39;t gonna happen. On the other hand, mapping &quot;cmn&quot; to &quot;zh&quot; for lookup is absolutely feasible, and maintains compatibility. And the instability in ISO 639-3, where any language could suddenly become a macrolanguage makes it even worse.<div>

<br></div><div>So for compatibility and stability reasons, in Unicode we map the predominant encompassed subtag to the macrolanguage. The &quot;no&quot; and &quot;sh&quot; predated the macrolanguage mechanism, and are handled differently -- also for compatibility. (Although frankly, in an ideal world we&#39;d also map &quot;nb&quot; to &quot;no&quot;, and &quot;sh&quot; to &quot;sr&quot;.)<div>

<br clear="all">Mark<br>

<br><br><div class="gmail_quote">On Fri, Jan 22, 2010 at 10:49, Leif Halvard Silli <span dir="ltr">&lt;<a href="mailto:xn--mlform-iua@xn--mlform-iua.no">xn--mlform-iua@xn--mlform-iua.no</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">

John Cowan, Fri, 22 Jan 2010 11:03:19 -0500:<br>

&gt; Mark Davis ☕ scripsit:<br>

<br>

( I placed back the cappuccino letter that John&#39;s mailer destroyed.)<br>

;-)<br>

<div class="im"><br>

&gt;&gt; The way we have it set up, it isn&#39;t for us to decide. But consumers<br>

&gt;&gt; of BCP47 (like CLDR) do have to, if ISO continues to change<br>

&gt;&gt; languages into macrolanguages<br>

<br>

</div>If you by that mean that CLDR should try to annulate the effect of<br>

macrolanguages, then I think rather the opposite is the way to go.<br>

Though, of course, every case should be treated independently, I guess.<br>

<div class="im"><br>

&gt;&gt;(will German be next?).<br>

&gt;<br>

&gt; Very improbable, given the description of &#39;de(u)&#39; in Ethnologue.<br>

&gt; Though Ethnologue is no more immune to error than ISO 639 [*], it does<br>

&gt; provide the denotation of the vast majority of 639-3 code elements, and<br>

&gt; <a href="http://www.ethnologue.com/show_language.asp?code=deu" target="_blank">http://www.ethnologue.com/show_language.asp?code=deu</a> is clearly about<br>

&gt; Standard German as spoken from Belgium to Kazakhstan, not any other kind<br>

&gt; of German.  It would be a massive and unnecessary widening of the term<br>

&gt; to turn it into a macrolanguage of any sort.<br>

<br>

</div>But even so, the CLDR issue can be looked at from another angle than<br>

the one Mark does:<br>

<br>

It has been pointed out that CLDR is derived from BCP47 and not the<br>

other way around. Currently the CLDR treats &#39;no&#39; as a synonym of &#39;nb&#39;<br>

(I would like to file a bug against this when I have power to follow it<br>

up ...)  However, just as CLDR this way overrules the effect of a<br>

macrolanguage classification,  it could also go the other way and<br>

effectively treat &#39;de&#39; as a macrolanguage/fallback code for  e.g.<br>

&#39;gsw&#39;. That to me seems as often a quite reasonable think to do. It<br>

would not not need to affect how language tagging should be performed<br>

however - it could remain &quot;forbidden&quot; to tag &quot;gsw&quot; content as &quot;de&quot;.<br>

--<br>

<font color="#888888">leif halvard silli<br>

</font><div><div></div><div class="h5">_______________________________________________<br>

Ietf-languages mailing list<br>

<a href="mailto:Ietf-languages@alvestrand.no">Ietf-languages@alvestrand.no</a><br>

<a href="http://www.alvestrand.no/mailman/listinfo/ietf-languages" target="_blank">http://www.alvestrand.no/mailman/listinfo/ietf-languages</a><br>

</div></div></blockquote></div><br></div></div>