<br><font size=2 face="sans-serif">The &quot;zxx&quot; tag started with

my query into how I should classify the &quot;audio content&quot; of a

silent film in a system designed to serve non-silent films where a language

code is required. Peter suggested &quot;zxx = no linguistic content&quot;

and registered it. </font>

<br>

<br><font size=2 face="sans-serif">I felt that it might be better to use

the industry terminology &quot;silent&quot; and employ a free tag in the

&quot;Q&quot; space of ISO 639-2. While there was &quot;no linguistic content&quot;

on that audio channel, there was certainly a plot that could be determined

from watching the film even if the title cards were removed (a &quot;title

card&quot; is an interstitial used to display the text in a silent film).

To describe our wonderful heritage of silent films as having no linguistic

content just seemed a bit cruel. I was willing to go with &quot;not applicable&quot;

but could not recommend the use of &quot;zxx = no linguistic content&quot;

for this purpose.</font>

<br>

<br><font size=2 face="sans-serif">When it was later suggested that &quot;zxx&quot;

should be used to mark up code fragments appearing in a tutorial written

in English, I was even more opposed to the &quot;non-linguistic&quot; semantic.

I wasn't the only one who complained that code -- especially in the context

of a technical tutorial -- is primarily meant to be read by humans, not

machines. An assistive device such as a Braille screenreader would &nbsp;want

to represent that text as language, not skip over it because it's non-linguistic

in nature. Binary junk data is the only thing I can think of that is truly

non-linguistic.</font>

<br>

<br><font size=2 face="sans-serif">Any chance we could broaden the semantic

of the &quot;zxx&quot; tag? I still think we did the wrong thing here and

the &quot;non-applicable&quot; tag is more appropriate for all the use

cases mentioned.</font>

<br>

<br><font size=2 face="sans-serif">&nbsp; &nbsp; &nbsp; &nbsp; http://lists.w3.org/Archives/Public/www-international/2007AprJun/0187.html

-- one previous post on the topic</font>

<br>

<br><font size=2 face="sans-serif">Side note: I find the IETF archives

very hard to search or I could have produced a better example. Am I missing

a search interface somewhere? (Reply offlist.)</font>

<br>

<br><font size=2 face="sans-serif">Regards,</font>

<br>

<br><font size=2 face="sans-serif">Karen Broome</font>

<br>

<br><tt><font size=2>Peter Constable &lt;petercon@microsoft.com&gt; wrote

on 03/14/2008 01:37:30 PM:<br>

<br>

&gt; If “zxx” were “not applicable”, I would not have any reservation

<br>

&gt; about semantic overloading for the application scenarios I have in

<br>

&gt; mind now. Funny, I really have no recollection of you suggesting <br>

&gt; that at that time. (Sorry.)</font></tt>

<br><tt><font size=2>&gt; &nbsp;</font></tt>

<br><tt><font size=2>&gt; &nbsp;</font></tt>

<br><tt><font size=2>&gt; Peter</font></tt>

<br><tt><font size=2>&gt; &nbsp;</font></tt>

<br><tt><font size=2>&gt; From: Karen_Broome@spe.sony.com [mailto:Karen_Broome@spe.sony.com]

<br>

&gt; Sent: Friday, March 14, 2008 12:51 PM<br>

&gt; To: Peter Constable<br>

&gt; Cc: ietf-languages@iana.org<br>

&gt; Subject: RE: ID for language-invariant strings</font></tt>

<br><tt><font size=2>&gt; &nbsp;</font></tt>

<br><tt><font size=2>&gt; <br>

&gt; I can keep restating the point I've made from the beginning. The <br>

&gt; semantic for &quot;zxx&quot; should have been defined as &quot;not

applicable&quot; <br>

&gt; which was the use case presented at the time it was created. Since

<br>

&gt; it was not expressed in this way, now we need another tag, I think.

<br>

&gt; <br>

&gt; Regards, <br>

&gt; <br>

&gt; Karen Broome<br>

&gt; Metadata Systems Designer<br>

&gt; Sony Pictures Entertainment<br>

&gt; 310.244.4384 <br>

&gt; <br>

&gt; ietf-languages-bounces@alvestrand.no wrote on 03/14/2008 08:49:31

AM:<br>

&gt; <br>

&gt; &gt; &gt; From: ietf-languages-bounces@alvestrand.no [mailto:ietf-languages-<br>

&gt; &gt; &gt; bounces@alvestrand.no] On Behalf Of Doug Ewell<br>

&gt; &gt; &gt; Sent: Thursday, March 13, 2008 11:16 PM<br>

&gt; &gt; &gt; To: ietf-languages@iana.org<br>

&gt; &gt; &gt; Subject: Re: ID for language-invariant strings<br>

&gt; &gt; <br>

&gt; &gt; &gt; [&quot;zxx&quot; is] a &quot;less bad&quot; fit than the

other choices:<br>

&gt; &gt; &gt;<br>

&gt; &gt; &gt; zxx - content is not linguistic in nature<br>

&gt; &gt; &gt; und - content is in an undetermined language<br>

&gt; &gt; &gt; mis - content is in an otherwise uncoded language<br>

&gt; &gt; &gt; i-default - content is in a default, fallback language intelligible

to<br>

&gt; &gt; &gt; anglophones<br>

&gt; &gt; &gt;<br>

&gt; &gt; &gt; I agree that inventing a new code element/subtag for this

situation<br>

&gt; &gt; &gt; would be undesirable.<br>

&gt; &gt; <br>

&gt; &gt; If it's less bad, I still think it kind of bad.<br>

&gt; &gt; <br>

&gt; &gt; For instance, suppose I need to apply language tags to each of

the <br>

&gt; &gt; data elements in the main ISO 639-3 code table. For data in columns

<br>

&gt; &gt; like the 639-3 ID, clearly &quot;zxx&quot; applies: the alpha-3

identifiers <br>

&gt; &gt; have no linguistic content. But what about the reference names?

<br>

&gt; &gt; &quot;zxx&quot; would be a decidedly bad choice for that column,

IMO, since <br>

&gt; &gt; every single data element is definitely linguistic in nature.<br>

&gt; &gt; <br>

&gt; &gt; I don't know why people are so adverse to new special-purpose

code <br>

&gt; &gt; elements when there is a reasonable need. It's not like there

are a <br>

&gt; &gt; lot of different special-case semantics that are needed in language-<br>

&gt; &gt; tagging application scenarios; I think the set is very small,

<br>

&gt; &gt; perhaps even that this is the only important gap. I am *far*

more <br>

&gt; &gt; concerned about overloading tags with distinct, orthogonal semantics<br>

&gt; &gt; for particular application scenarios (&quot;und&quot; means X

in this <br>

&gt; &gt; application but Y in that application): *that* can lead to serious

trouble.<br>

&gt; &gt; <br>

&gt; &gt; As I think about this, I'm inclined to propose a new special-purpose<br>

&gt; &gt; ID &quot;zrf&quot; in ISO 639:<br>

&gt; &gt; <br>

&gt; &gt; ID: zxn<br>

&gt; &gt; Reference name: language-neutral content<br>

&gt; &gt; Comment: This ID is provided primarily for application scenarios<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;in which a language identifier

must be declared for<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;content that may be linguistic

in nature but that is<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;used as a language-neutral

identifier to reference or<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;index other information objects.<br>

&gt; &gt; <br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Uses of this code element do

not make any declaration<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;regarding the actual language

of a given data element<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;or of whether a given data

element is, in fact,<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;linguistic in nature.<br>

&gt; &gt; <br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Note: for applications scenarios

in which an identifier<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;string is unambiguously non-linguistic

in nature, &quot;zxx&quot;<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;should be used rather than

&quot;zxn&quot;.<br>

&gt; &gt; <br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;For example, in a database

of coding elements for<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;cultural objects that includes

for each such object a<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;code element such as an alpha-3

string (e.g., &quot;abc&quot;)<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;and a reference name (e.g.,

&quot;PIANO&quot;, &quot;GUQIN&quot;), the<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;language identifier applied

to the code element<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;should be &quot;zxx&quot;,but

&quot;zxn&quot; may be applied to the<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;reference names.<br>

&gt; &gt; <br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Applications may also use &quot;zxn&quot;

for content that is<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Linguistic in nature but that

is represented in a<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Language-neutral form. For

example, the concept 'ten'<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Is linguistic in nature but

can be expressed in the<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Language-neutral form &quot;10&quot;.

Such use of &quot;zxn&quot; should<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;be considered only for application

scenarios that<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;have a particular need; this

usage is not recommended<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;in general. For instance, if

a software application<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;needs to segment the strings

in a document into items<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;that get passed to various

language-specific processes<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;and it must apply a language

identifier to language-<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;neutral content such as numbers

represented as digits,<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;then &quot;zxn&quot; may be

used within that application; but it<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;is not expected that content

authors would apply &quot;zxn&quot;<br>

&gt; &gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;to numbers in their documents

in general.<br>

&gt; &gt; <br>

&gt; &gt; <br>

&gt; &gt; <br>

&gt; &gt; Peter<br>

&gt; &gt; _______________________________________________<br>

&gt; &gt; Ietf-languages mailing list<br>

&gt; &gt; Ietf-languages@alvestrand.no<br>

&gt; &gt; http://www.alvestrand.no/mailman/listinfo/ietf-languages</font></tt>