Return-Path: Received: from murder ([unix socket]) by eikenes.alvestrand.no (Cyrus v2.2.8-Mandrake-RPM-2.2.8-4.2.101mdk) with LMTPA; Sun, 10 Apr 2005 04:58:20 +0200 X-Sieve: CMU Sieve 2.2 Received: from localhost (localhost.localdomain [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id 7016661AFB for ; Sun, 10 Apr 2005 04:58:20 +0200 (CEST) Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 13160-06 for ; Sun, 10 Apr 2005 04:58:14 +0200 (CEST) Received: from megatron.ietf.org (megatron.ietf.org [132.151.6.71]) by eikenes.alvestrand.no (Postfix) with ESMTP id A9C8661AF5 for ; Sun, 10 Apr 2005 04:58:13 +0200 (CEST) Received: from localhost.localdomain ([127.0.0.1] helo=megatron.ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1DKSdK-0005vS-K1; Sat, 09 Apr 2005 22:57:10 -0400 Received: from odin.ietf.org ([132.151.1.176] helo=ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1DKSdH-0005vJ-GM for ltru@megatron.ietf.org; Sat, 09 Apr 2005 22:57:07 -0400 Received: from ietf-mx.ietf.org (ietf-mx.ietf.org [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id WAA25562 for ; Sat, 9 Apr 2005 22:57:05 -0400 (EDT) Received: from [63.247.76.195] (helo=montage.altserver.com) by ietf-mx.ietf.org with esmtp (Exim 4.33) id 1DKSmN-0003Y3-3I for ltru@ietf.org; Sat, 09 Apr 2005 23:06:31 -0400 Received: from lns-p19-8-idf-82-249-30-81.adsl.proxad.net ([82.249.30.81] helo=jfc.afrac.org) by montage.altserver.com with esmtpa (Exim 4.44) id 1DKSdF-0002eO-K5; Sat, 09 Apr 2005 19:57:06 -0700 Message-Id: <6.1.2.0.2.20050410044608.047d1a70@mail.jefsey.com> X-Sender: jefsey+jefsey.com@mail.jefsey.com X-Mailer: QUALCOMM Windows Eudora Version 6.1.2.0 Date: Sun, 10 Apr 2005 04:55:31 +0200 To: ned.freed@mrochek.com From: "JFC (Jefsey) Morfin" Subject: Re: [Ltru] Compatibility with existing use (LDAP) In-Reply-To: <01LMWG2NCBKW00005R@mauve.mrochek.com> References: <1987416CA83AC7499AC772F92E2DBF78037C9AFC@LONSMSXM02.emea.ime.reuters.com> <001601c53d4e$6a5ab4c0$7f1afea9@oemcomputer> <01LMWG2NCBKW00005R@mauve.mrochek.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - montage.altserver.com X-AntiAbuse: Original Domain - ietf.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - jefsey.com X-Scan-Signature: 8de5f93cb2b4e3bee75302e9eacc33db Cc: ltru Working Group X-BeenThere: ltru@lists.ietf.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Language Tag Registry Update working group discussion list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: ltru-bounces@lists.ietf.org Errors-To: ltru-bounces@lists.ietf.org X-Virus-Scanned: amavisd-new at alvestrand.no Ned, you are most probably right about LDAP and Addison about XML. But this is not the case for _new_ services like OPES (open pluggable edge services: intelligence on the edge of the network, able to change the langtag of an XML page at will before it receives it) or my own long delayed ONES (open network extended services: intelligence (virtually in an end to end architecture) within the network we see coming back. This not true about the architectural solutions to support multilingualism. Even SMTP can be dramatically affected through the charter of the WG-OPES on SMTP support (even if we are quite delayed by the same problem as this WG: to understand how to support at the same time the past, the present and the future). jfc On 03:44 10/04/2005, ned.freed@mrochek.com said: > > As a technical contributor, the longer this thread goes on > > the more consideration I think we should give to alternatives like > > http://www.ietf.org/internet-drafts/draft-lilly-content-script-01.txt > > > If we were to treat the identification of the language(variant), > > the script, and the orthography of a text as three distinct > > attributes, rather than trying to structure them into a single string > > called a "language tag" would it ultimately make things simpler > > or more complicated? > > > I find this approach intuitively appealing in the case where there are > > multiple orthographic alternatives in the same script for a > > given language, yet it has no impact at all on texts in languages > > where identification of the script or orthography in use is of > > no particular interest. Even transcriptions and transliterations can > > be handled nicely into this kind of approach, since they just become > > orthographic alternatives. > > > It would, however, require a re-thinking of how we've handled > > German and Chinese. > >No doubt there's some appeal to this idea, but I don't think it's practical. > >The single biggest implementation problem I see right off is that language >tags >are used in lots of places, at least one of which is not amenable to the >addition of separate script field. I'm referring, of course, to XML. I believe >adding an xml:script field to go along with xml:lang could only be done with a >major revision to the XML base specification. And even if such a change could >be made, getting it deployed would be a real challenge. > >Other cases, like email, would be able to add a content-script: field much >more >easily, of course. Even so, a vast amount of code would have to be changed to >actually make use of this new field. > >LDAP is another interesting case. Language tags in LDAP are actually a >specific >case of a more general mechanism. But AFAIK little use is made of attribute >tags for purposes other than language tagging. This calls into question how >general the mechanism actually implemented by various clients and servers >actually is. > >And even if you were to get the field defined in every place where there's a >field for language tags, you still have to actually make use of the field. >Having two separate fields certainly makes it easy to recognize script >tags and >prevents them from causing problems in the language tag space, but you lose >information relating language to scripts. For example, suppose you have: > > content-language: az, ru, en > content-script: latn, cryl > >It stands to reason the Russian is written in cyrillic and the English in >latin, but what script is the Azerbaijani text written in? > >Of course you could require a 1:1 correspondence between the entries in >the two >fields, but that would then require a way to indicate when the script is >unknown. And tags would have to be repeated since a single script can apply to >multiple languages or vice versa. Ick. This is _seriously_ ugly. And even if >you ignore the ugly, the repetition of tags in language fields is certain to >cause problems to existing software. > >So, much as I would like to eliminate the interop problems the current >proposal for adding script codes causes, I really don't think this is >a viable way to do it. > > Ned > >P.S. My linguistic skills are rudimentary at best, but my recollection is >that >Azerbaijani is a language that can be written in either cyrillic or latin >script. If that's not the case please substitute a language where this is >actually true in the above example. > >_______________________________________________ >Ltru mailing list >Ltru@lists.ietf.org >https://www1.ietf.org/mailman/listinfo/ltru _______________________________________________ Ltru mailing list Ltru@lists.ietf.org https://www1.ietf.org/mailman/listinfo/ltru