Return-Path: Received: from murder ([unix socket]) by eikenes.alvestrand.no (Cyrus v2.2.8-Mandrake-RPM-2.2.8-4.2.101mdk) with LMTPA; Tue, 17 May 2005 17:14:13 +0200 X-Sieve: CMU Sieve 2.2 Received: from localhost (localhost.localdomain [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id C761061B53 for ; Tue, 17 May 2005 17:14:13 +0200 (CEST) Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 06836-10 for ; Tue, 17 May 2005 17:14:11 +0200 (CEST) X-Greylist: domain auto-whitelisted by SQLgrey-1.4.8 Received: from unicode.org (unicode.org [69.13.187.164]) by eikenes.alvestrand.no (Postfix) with ESMTP id C162061AF1 for ; Tue, 17 May 2005 17:14:10 +0200 (CEST) Received: from sarasvati.unicode.org (localhost [127.0.0.1]) by unicode.org (8.12.11/8.12.11) with ESMTP id j4HFDMQk024509; Tue, 17 May 2005 10:13:22 -0500 Received: with ECARTIS (v1.0.0; list unicode); Tue, 17 May 2005 10:13:22 -0500 (CDT) Received: from montage.altserver.com (montage.altserver.com [63.247.74.122]) by unicode.org (8.12.11/8.12.11) with ESMTP id j4HAlMbl014443 for ; Tue, 17 May 2005 05:47:22 -0500 Received: from lns-p19-2-idf-82-251-144-38.adsl.proxad.net ([82.251.144.38] helo=jfc.afrac.org) by montage.altserver.com with esmtpa (Exim 4.44) id 1DXzbd-0000xf-1U for unicode@unicode.org; Tue, 17 May 2005 03:47:21 -0700 Message-Id: <6.2.1.2.2.20050517111015.03687580@mail.jefsey.com> X-Mailer: QUALCOMM Windows Eudora Version 6.2.1.2 Date: Tue, 17 May 2005 11:50:29 +0200 To: unicode@unicode.org From: "JFC (Jefsey) Morfin" Subject: Re: what is Latn? In-Reply-To: <002001c55a55$0cfd1600$0701a8c0@bruchner> References: <20050514192842.4095518337D9@postie2.hosting365.ie> <4286B740.10806@doves.demon.co.uk> <6d99d1fd050516110566aa0455@mail.gmail.com> <002001c55a55$0cfd1600$0701a8c0@bruchner> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - montage.altserver.com X-AntiAbuse: Original Domain - unicode.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - jefsey.com X-Source: X-Source-Args: X-Source-Dir: X-archive-position: 19971 X-Approved-By: root@unicode.org X-ecartis-version: Ecartis v1.0.0 Sender: unicode-bounce@unicode.org Errors-To: unicode-bounce@unicode.org X-original-sender: jefsey@jefsey.com Precedence: bulk List-help: List-unsubscribe: List-software: Ecartis version 1.0.0 List-ID: X-List-ID: X-list: unicode X-Virus-Scanned: amavisd-new at alvestrand.no At 22:22 16/05/2005, Philippe Verdy wrote: >If something is wrong here, there's something not documented in ISO15924 >(the list of characters that are considered part of the script). Thank you for this. This is the very point I need to "debug" for several months. A human being can more or less understand "Latn". A computer needs a charset. In calling me names, some may add to the humanity's patrimony but will not address the problem. My understanding is that the proposed solution - but it should be worded out and clarified on several points - by some (I feel there is also strong opposition) is to say: 1. we have languages documented in ISO 639 2. we have political influence on languages, commercial contractual issues and user associations support - to quote a few - on a country basis and ISO 3166 3. we now have a script list with ISO 15924, never mind if it is much precise or not, it is a good level iteration 4. let stabilise an ISO 15924 - ISO 639 - ISO 3166 formal relation, as a common tag, and let use it consistently every where, including defining locales. 5. this way when we quote the tag, we quote the locale. When we have the locale we have all the information you call for (provided the way to to draw letters, but also icones, etc. is included in the locale). At that time a "script" will become a synonym for a "dynamic locale" (what your computer should have loaded as a locale to accomplish the task ahead). If this is the Unicode final idea, it looks a very good idea on the paper. But first it should be spelled out so everyone supporting more or less the idea has the same point of view in here, at ISO, at the IETF etc.. Then debates, on what is missing or not, should not be on specific situations (this language, vs. this script) but on the locales XML/ASN.1 supporting scheme. In the meanwhile transition should be documented because the work is important and will take years while other parallel efforts (TBX, CRC, IDN, etc.) may add to it and existing daily usages can be destabilised. There is then - but this is not the place - issues of methodology and of consistency with usage applications systems, and the user acceptance. Probably in many different areas. Also a problem coming from the character oriented Unicode basis is to insure a continuity with sounds, icons, gestures and procedures. Another problem where Unicode/CLDR can be in a good position to contribute is normalised languages or whatever the name being used for the human/machine compatible/referenced languages or constrained for a purpose (like Basic English) jfc