<font face="times new roman,serif">* doesn't work, because it isn't a valid language tag.</font><div><font face="times new roman,serif"><br></font></div><div><font face="times new roman,serif">> </font><span style="font-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)">The </span><span style="font-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)">proposal here is for a subtag to encode the "elsewhere" condition: "if </span><span style="font-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)">there is no more specific language code, use 'zzz'".</span></div>

<div><font face="arial, sans-serif"><br></font></div><div><font face="arial, sans-serif">We use 'und' to signal, in a query, that the language is unspecified and should be filled in. It is not like 'und', in a query, is needed for any other purpose. The addition of 'zzz' would just mean that we'd map it to 'und' in all processing, so it would not be a useful addition.</font></div>

<div><font face="arial, sans-serif"><br clear="all"></font><font face="'times new roman', serif"><div style="background-color:transparent;margin-top:0px;margin-left:0px;margin-bottom:0px;margin-right:0px"><div><span><br>

<hr></span></div></div><div style="background-color:transparent;margin-top:0px;margin-left:0px;margin-bottom:0px;margin-right:0px"><a href="https://plus.google.com/114199149796022210033" target="_blank">Mark</a></div><div style="background-color:transparent;margin-top:0px;margin-left:0px;margin-bottom:0px;margin-right:0px">

<i><br></i></div><div style="background-color:transparent;margin-top:0px;margin-left:0px;margin-bottom:0px;margin-right:0px"><i>— Il meglio è l’inimico del bene —</i></div></font><div><div><font face="'times new roman', serif"><i><span style="font-style:normal"><i></i></span><i></i></i></font></div>

</div><br>

<br><br><div class="gmail_quote">On Wed, Jun 13, 2012 at 8:03 AM, Gordon P. Hemsley <span dir="ltr"><<a href="mailto:gphemsley@gmail.com" target="_blank">gphemsley@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

FWIW, I think it is important to be able to make the distinction<br>

between "we don't *know* what the language is" ('und' for<br>

"Undetermined") and "we don't *care* what the language is" (the<br>

proposed 'zzz' for "Any").<br>

<br>

In my understanding of the matter, both CLDR and Java (as you've<br>

described them) are using 'und' for the appropriate purpose (though<br>

I'm not sure I necessarily agree with the fallback choice—an<br>

orthogonal issue, in any case).<br>

<br>

As for Google, it seems to me like *querying* for "any language" would<br>

be better off using the asterisk rather than a particular subtag. The<br>

proposal here is for a subtag to encode the "elsewhere" condition: "if<br>

there is no more specific language code, use 'zzz'".<br>

<br>

So I support Peter's proposal. I think the usecase he mentions could<br>

actually be a common one in localized software development.<br>

<br>

Gordon<br>

<div class="HOEnZb"><div class="h5"><br>

On Wed, Jun 13, 2012 at 10:52 AM, Mark Davis ☕ <<a href="mailto:mark@macchiato.com">mark@macchiato.com</a>> wrote:<br>

> We use 'und' in CLDR when doing lookups, for example. The best patch for<br>

> "und-Cyrl" in the absence of other information is "ru-Cyrl-RU".<br>

><br>

> Java also uses 'und' in the BCP47 way, but also as a "replace bad input"<br>

> (like FFFD for Unicode).<br>

><br>

> <a href="http://download.java.net/jdk7/archive/b123/docs/api/java/util/Locale.html" target="_blank">http://download.java.net/jdk7/archive/b123/docs/api/java/util/Locale.html</a><br>

><br>

> At Google, we used to try to distinguish between these different senses of<br>

> "unknown" vs "any", but found that people too often just mixed them up, so<br>

> we ended up just settling on a single subtag. It just has slightly different<br>

> nuances when used as a query vs used as a result (or content tag). But<br>

> that's the case anyway for locale/language matching.<br>

><br>

> ________________________________<br>

> Mark<br>

><br>

> — Il meglio è l’inimico del bene —<br>

><br>

><br>

><br>

> On Wed, Jun 13, 2012 at 7:27 AM, Peter Constable <<a href="mailto:petercon@microsoft.com">petercon@microsoft.com</a>><br>

> wrote:<br>

>><br>

>> Thanks, Doug, for the reminder of that text, which is interesting.<br>

>><br>

>> Root, which is totally unqualified--I.e., 'neutral'-is different. In a<br>

>> matching mechanism that seeks the best match against a preference list, a<br>

>> neutral resource might be chosen in the absence of any other matching<br>

>> resource. This could be used to qualify a resource as a positive match for<br>

>> any entry in the preference list if there isn't a stronger match for that<br>

>> entry.<br>

>><br>

>> Mark, you mentioned using 'und' for some time. Has that been in private or<br>

>> public contexts? (We're looking at something that would be part of the<br>

>> Windows SDK.) And would you say the use was comparable to "root" (which I<br>

>> think is different)?<br>

>><br>

>> Peter<br>

>><br>

>> Sent from my Windows Phone<br>

>> ________________________________<br>

>> From: Doug Ewell<br>

>> Sent: 6/12/2012 5:15 PM<br>

>> To: <a href="mailto:ietf-languages@iana.org">ietf-languages@iana.org</a><br>

>> Subject: Re: lang ID for "*" (any language)<br>

>><br>

>> I tend to agree with Mark that 'und' is the best choice for this.<br>

>><br>

>> The passage in Section 4.1 seems to start off otherwise:<br>

>><br>

>> "The 'und' (Undetermined) primary language subtag identifies linguistic<br>

>> content whose language is not determined.  This subtag SHOULD NOT be<br>

>> used unless a language tag is required and language information is not<br>

>> available or cannot be determined.  Omitting the language tag (where<br>

>> permitted) is preferred."<br>

>><br>

>> but then goes on to give reasonable use cases:<br>

>><br>

>> "The 'und' subtag might be useful for protocols that require a language<br>

>> tag to be provided or where a primary language subtag is required (such<br>

>> as in "und-Latn").  The 'und' subtag MAY also be useful when matching<br>

>> language tags in certain situations."<br>

>><br>

>> On the list we've often talked about, for example, "und-Cyrl" to<br>

>> indicate text in the Cyrillic script. In a case like this, it might not<br>

>> be that the language cannot be determined, but that it doesn't matter.<br>

>><br>

>> I think CLDR uses 'root' for a purpose similar to this.<br>

>><br>

>> --<br>

>> Doug Ewell | Thornton, Colorado, USA<br>

>> <a href="http://www.ewellic.org" target="_blank">http://www.ewellic.org</a> | @DougEwell <br>

>><br>

>> _______________________________________________<br>

>> Ietf-languages mailing list<br>

>> <a href="mailto:Ietf-languages@alvestrand.no">Ietf-languages@alvestrand.no</a><br>

>> <a href="http://www.alvestrand.no/mailman/listinfo/ietf-languages" target="_blank">http://www.alvestrand.no/mailman/listinfo/ietf-languages</a><br>

>><br>

>><br>

>><br>

>><br>

>> _______________________________________________<br>

>> Ietf-languages mailing list<br>

>> <a href="mailto:Ietf-languages@alvestrand.no">Ietf-languages@alvestrand.no</a><br>

>> <a href="http://www.alvestrand.no/mailman/listinfo/ietf-languages" target="_blank">http://www.alvestrand.no/mailman/listinfo/ietf-languages</a><br>

>><br>

><br>

><br>

> _______________________________________________<br>

> Ietf-languages mailing list<br>

> <a href="mailto:Ietf-languages@alvestrand.no">Ietf-languages@alvestrand.no</a><br>

> <a href="http://www.alvestrand.no/mailman/listinfo/ietf-languages" target="_blank">http://www.alvestrand.no/mailman/listinfo/ietf-languages</a><br>

><br>

<br>

<br>

<br>

</div></div><span class="HOEnZb"><font color="#888888">--<br>

Gordon P. Hemsley<br>

<a href="mailto:me@gphemsley.org">me@gphemsley.org</a><br>

<a href="http://gphemsley.org/" target="_blank">http://gphemsley.org/</a> • <a href="http://gphemsley.org/blog/" target="_blank">http://gphemsley.org/blog/</a><br>

</font></span></blockquote></div><br></div>