Return-Path: Received: from eikenes.alvestrand.no ([unix socket]) by eikenes.alvestrand.no (Cyrus v2.1.11-Mandrake-RPM-2.1.11-1mdk) with LMTP; Thu, 17 Feb 2005 03:28:49 +0100 X-Sieve: CMU Sieve 2.2 Return-Path: Received: from localhost (localhost.localdomain [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id 519B361BE0 for ; Thu, 17 Feb 2005 03:28:49 +0100 (CET) Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 10721-08 for ; Thu, 17 Feb 2005 03:28:46 +0100 (CET) Received: from psg.com (psg.com [147.28.0.62]) by eikenes.alvestrand.no (Postfix) with ESMTP id 3CDD961C14 for ; Thu, 17 Feb 2005 03:28:46 +0100 (CET) Received: from majordom by psg.com with local (Exim 4.44 (FreeBSD)) id 1D1bOF-0005SZ-0I for idn-data@psg.com; Thu, 17 Feb 2005 02:27:39 +0000 Received: from [63.247.74.122] (helo=montage.altserver.com) by psg.com with esmtp (Exim 4.44 (FreeBSD)) id 1D1bOC-0005SK-Lc for idn@ops.ietf.org; Thu, 17 Feb 2005 02:27:36 +0000 Received: from lns-p19-4-idf-82-65-252-32.adsl.proxad.net ([82.65.252.32] helo=jfc.afrac.org) by montage.altserver.com with esmtpa (Exim 4.44) id 1D1bO2-00059Q-2s; Wed, 16 Feb 2005 18:27:26 -0800 Message-Id: <6.1.2.0.2.20050217025055.0cb71eb0@mail.jefsey.com> X-Sender: jefsey+jefsey.com@mail.jefsey.com X-Mailer: QUALCOMM Windows Eudora Version 6.1.2.0 Date: Thu, 17 Feb 2005 03:16:11 +0100 To: "Michel Suignard" , "Martin v. LXwis" From: "JFC (Jefsey) Morfin" Subject: RE: [idn] homograph attacks Cc: "Erik van der Poel" , "William Tan" , "Kane, Pat" , , , "tedd" , In-Reply-To: References: Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - montage.altserver.com X-AntiAbuse: Original Domain - ops.ietf.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - jefsey.com X-Source: X-Source-Args: X-Source-Dir: Sender: owner-idn@ops.ietf.org Precedence: bulk X-Virus-Scanned: by amavisd-new at alvestrand.no X-Amavis-Alert: BAD HEADER Non-encoded 8-bit data (char F6 hex) in message header 'To' To: ...crosoft.com>,\n\t"Martin v. L\366wis" Martin, >I don't think we are so far off. My concern is that many people are >abusing the term 'language' for these tables. Dear Michel, you are absolutely right. The problem is the content of the tag and the term. - IANA TLD Tables are described by language/script/ccTLD from ISO 3166 alpha-2 - languages are defined from ISO 639 - RFC 3066 appointed ietf-languages@alvestrand.no advisory mailing list Members have introduced a "langtag" which also concatenates the same content: language (ISO 639) - Script ISO 15924 - Country (ISO 3166 - therefore missing some ccTLDs). The idea is to document the real language being used with some more accuracy. The term that ISO 639 corresponds to "langues" in French and the proposed langtag to "langage" in French, and what they really need to permit automated interintelligibility would be "parler" in French. This might lead to complex and unnecessary debates. This is why we have adopted the following wording: - langtag for a tag including lingual oriented information. We do not speak of language but of tag. Whatever the use. - lang3tag is internationalization with language/script/localization (localization should be far more complete than just ISO 3066) - lang4tag adds the authoritative reference for the used form of that language (multilingualization) - lang5tag adds the style to document the vernacular environment of usage. >I am just saying that creating exclusive subset of Latin characters in >European context is not necessarily a bad idea but will result in future >problems because they will always discover that few characters are missing >from the subset. > >It is reasonably easy for .de to establish a table as they did and again >it is ok. It is much more challenging for a worlwide TLD such as .com to >establish registration rules. Typically script is a much better selector >than language to establish those tables and associated rules. This has been extensively discussed by the authors of the proposed "RFC 3066 bis" Draft. I disagreed enough with only small but key details, to be credible I think when I say that I fully agree with their need of ISO 639 to define the language (7260 in the current revision), but also the script (you are right this is necessary, but this is not enough for many reasons you described in part) and localization which may change some accepted usages. They identified that the political boarders were the leading descriptor here due to legal obligations, common education, etc. Scripts defined by ISO are also disputed. Better to have an authoritative decision of the TLD Manager as a part of his namespace rules. After all what we want is "normality", so there are cultural, legal, IPR related matters to consider. I understand that M$ has decided to wait and see the way the market reacts. This time I approve M$. jfc