<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns:p="urn:schemas-microsoft-com:office:powerpoint" xmlns:a="urn:schemas-microsoft-com:office:access" xmlns:dt="uuid:C2F41010-65B3-11d1-A29F-00AA00C14882" xmlns:s="uuid:BDC6E3F0-6DA3-11d1-A2A3-00AA00C14882" xmlns:rs="urn:schemas-microsoft-com:rowset" xmlns:z="#RowsetSchema" xmlns:b="urn:schemas-microsoft-com:office:publisher" xmlns:ss="urn:schemas-microsoft-com:office:spreadsheet" xmlns:c="urn:schemas-microsoft-com:office:component:spreadsheet" xmlns:oa="urn:schemas-microsoft-com:office:activation" xmlns:html="http://www.w3.org/TR/REC-html40" xmlns:q="http://schemas.xmlsoap.org/soap/envelope/" xmlns:D="DAV:" xmlns:x2="http://schemas.microsoft.com/office/excel/2003/xml" xmlns:ois="http://schemas.microsoft.com/sharepoint/soap/ois/" xmlns:dir="http://schemas.microsoft.com/sharepoint/soap/directory/" xmlns:ds="http://www.w3.org/2000/09/xmldsig#" xmlns:dsp="http://schemas.microsoft.com/sharepoint/dsp" xmlns:udc="http://schemas.microsoft.com/data/udc" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:sub="http://schemas.microsoft.com/sharepoint/soap/2002/1/alerts/" xmlns:ec="http://www.w3.org/2001/04/xmlenc#" xmlns:sp="http://schemas.microsoft.com/sharepoint/" xmlns:sps="http://schemas.microsoft.com/sharepoint/soap/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:udcxf="http://schemas.microsoft.com/data/udc/xmlfile" xmlns:wf="http://schemas.microsoft.com/sharepoint/soap/workflow/" xmlns:mver="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns:mrels="http://schemas.openxmlformats.org/package/2006/relationships" xmlns:ex12t="http://schemas.microsoft.com/exchange/services/2006/types" xmlns:ex12m="http://schemas.microsoft.com/exchange/services/2006/messages" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
        {font-family:SimSun;
        panose-1:2 1 6 0 3 1 1 1 1 1;}
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:Tahoma;
        panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
        {font-family:"\@SimSun";
        panose-1:2 1 6 0 3 1 1 1 1 1;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
tt
        {mso-style-priority:99;
        font-family:"Courier New";}
span.EmailStyle18
        {mso-style-type:personal-reply;
        font-family:"Calibri","sans-serif";
        color:#1F497D;}
.MsoChpDefault
        {mso-style-type:export-only;}
@page Section1
        {size:8.5in 11.0in;
        margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
        {page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'>If “zxx” were “not applicable”, I would not have any reservation
about semantic overloading for the application scenarios I have in mind now. Funny,
I really have no recollection of you suggesting that at that time. (Sorry.)<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'><o:p> </o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'><o:p> </o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'>Peter<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'><o:p> </o:p></span></p>
<div style='border:none;border-left:solid blue 1.5pt;padding:0in 0in 0in 4.0pt'>
<div>
<div style='border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in'>
<p class=MsoNormal><b><span style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'>From:</span></b><span
style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'>
Karen_Broome@spe.sony.com [mailto:Karen_Broome@spe.sony.com] <br>
<b>Sent:</b> Friday, March 14, 2008 12:51 PM<br>
<b>To:</b> Peter Constable<br>
<b>Cc:</b> ietf-languages@iana.org<br>
<b>Subject:</b> RE: ID for language-invariant strings<o:p></o:p></span></p>
</div>
</div>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal><br>
<span style='font-size:10.0pt;font-family:"Arial","sans-serif"'>I can keep
restating the point I've made from the beginning. The semantic for
"zxx" should have been defined as "not applicable" which
was the use case presented at the time it was created. Since it was not
expressed in this way, now we need another tag, I think.</span> <br>
<br>
<span style='font-size:10.0pt;font-family:"Arial","sans-serif"'>Regards,</span>
<br>
<br>
<span style='font-size:10.0pt;font-family:"Arial","sans-serif"'>Karen Broome<br>
Metadata Systems Designer<br>
Sony Pictures Entertainment<br>
310.244.4384</span> <br>
<br>
<tt><span style='font-size:10.0pt'>ietf-languages-bounces@alvestrand.no wrote
on 03/14/2008 08:49:31 AM:</span></tt><span style='font-size:10.0pt;font-family:
"Courier New"'><br>
<br>
<tt>> > From: ietf-languages-bounces@alvestrand.no
[mailto:ietf-languages-</tt><br>
<tt>> > bounces@alvestrand.no] On Behalf Of Doug Ewell</tt><br>
<tt>> > Sent: Thursday, March 13, 2008 11:16 PM</tt><br>
<tt>> > To: ietf-languages@iana.org</tt><br>
<tt>> > Subject: Re: ID for language-invariant strings</tt><br>
<tt>> </tt><br>
<tt>> > ["zxx" is] a "less bad" fit than the other
choices:</tt><br>
<tt>> ></tt><br>
<tt>> > zxx - content is not linguistic in nature</tt><br>
<tt>> > und - content is in an undetermined language</tt><br>
<tt>> > mis - content is in an otherwise uncoded language</tt><br>
<tt>> > i-default - content is in a default, fallback language
intelligible to</tt><br>
<tt>> > anglophones</tt><br>
<tt>> ></tt><br>
<tt>> > I agree that inventing a new code element/subtag for this
situation</tt><br>
<tt>> > would be undesirable.</tt><br>
<tt>> </tt><br>
<tt>> If it's less bad, I still think it kind of bad.</tt><br>
<tt>> </tt><br>
<tt>> For instance, suppose I need to apply language tags to each of the </tt><br>
<tt>> data elements in the main ISO 639-3 code table. For data in columns </tt><br>
<tt>> like the 639-3 ID, clearly "zxx" applies: the alpha-3
identifiers </tt><br>
<tt>> have no linguistic content. But what about the reference names? </tt><br>
<tt>> "zxx" would be a decidedly bad choice for that column, IMO,
since </tt><br>
<tt>> every single data element is definitely linguistic in nature.</tt><br>
<tt>> </tt><br>
<tt>> I don't know why people are so adverse to new special-purpose code </tt><br>
<tt>> elements when there is a reasonable need. It's not like there are a </tt><br>
<tt>> lot of different special-case semantics that are needed in language-</tt><br>
<tt>> tagging application scenarios; I think the set is very small, </tt><br>
<tt>> perhaps even that this is the only important gap. I am *far* more </tt><br>
<tt>> concerned about overloading tags with distinct, orthogonal semantics</tt><br>
<tt>> for particular application scenarios ("und" means X in this </tt><br>
<tt>> application but Y in that application): *that* can lead to serious
trouble.</tt><br>
<tt>> </tt><br>
<tt>> As I think about this, I'm inclined to propose a new special-purpose</tt><br>
<tt>> ID "zrf" in ISO 639:</tt><br>
<tt>> </tt><br>
<tt>> ID: zxn</tt><br>
<tt>> Reference name: language-neutral content</tt><br>
<tt>> Comment: This ID is provided primarily for application scenarios</tt><br>
<tt>> in which a language identifier must
be declared for</tt><br>
<tt>> content that may be linguistic in
nature but that is</tt><br>
<tt>> used as a language-neutral
identifier to reference or</tt><br>
<tt>> index other information objects.</tt><br>
<tt>> </tt><br>
<tt>> Uses of this code element do not
make any declaration</tt><br>
<tt>> regarding the actual language of a
given data element</tt><br>
<tt>> or of whether a given data element
is, in fact,</tt><br>
<tt>> linguistic in nature.</tt><br>
<tt>> </tt><br>
<tt>> Note: for applications scenarios in
which an identifier</tt><br>
<tt>> string is unambiguously non-linguistic
in nature, "zxx"</tt><br>
<tt>> should be used rather than
"zxn".</tt><br>
<tt>> </tt><br>
<tt>> For example, in a database of coding
elements for</tt><br>
<tt>> cultural objects that includes for
each such object a</tt><br>
<tt>> code element such as an alpha-3
string (e.g., "abc")</tt><br>
<tt>> and a reference name (e.g.,
"PIANO", "GUQIN"), the</tt><br>
<tt>> language identifier applied to the
code element</tt><br>
<tt>> should be "zxx",but
"zxn" may be applied to the</tt><br>
<tt>> reference names.</tt><br>
<tt>> </tt><br>
<tt>> Applications may also use
"zxn" for content that is</tt><br>
<tt>> Linguistic in nature but that is
represented in a</tt><br>
<tt>> Language-neutral form. For example,
the concept 'ten'</tt><br>
<tt>> Is linguistic in nature but can be
expressed in the</tt><br>
<tt>> Language-neutral form
"10". Such use of "zxn" should</tt><br>
<tt>> be considered only for application
scenarios that</tt><br>
<tt>> have a particular need; this usage
is not recommended</tt><br>
<tt>> in general. For instance, if a
software application</tt><br>
<tt>> needs to segment the strings in a
document into items</tt><br>
<tt>> that get passed to various
language-specific processes</tt><br>
<tt>> and it must apply a language
identifier to language-</tt><br>
<tt>> neutral content such as numbers
represented as digits,</tt><br>
<tt>> then "zxn" may be used
within that application; but it</tt><br>
<tt>> is not expected that content authors
would apply "zxn"</tt><br>
<tt>> to numbers in their documents in
general.</tt><br>
<tt>> </tt><br>
<tt>> </tt><br>
<tt>> </tt><br>
<tt>> Peter</tt><br>
<tt>> _______________________________________________</tt><br>
<tt>> Ietf-languages mailing list</tt><br>
<tt>> Ietf-languages@alvestrand.no</tt><br>
<tt>> http://www.alvestrand.no/mailman/listinfo/ietf-languages</tt></span><o:p></o:p></p>
</div>
</div>
</body>
</html>