Assignment of private use language subtags
petercon at microsoft.com
Tue Jul 1 13:16:34 CEST 2014
The über point for Gordon Hemsley is that anybody is welcome to use private-use identifiers, but run the risk that their use of a private-use identifier may conflict with someone else's use. Caveat emptor!
From: ietf-languages-bounces at alvestrand.no [mailto:ietf-languages-bounces at alvestrand.no] On Behalf Of Anthony Aristar
Sent: Friday, May 9, 2014 12:43 PM
To: ietf-languages at alvestrand.no
Subject: Re: Assignment of private use language subtags
As the person who ran MultiTree until last year when I retired, let me make some comments on these codes:
1. The fact that MultiTree uses a local use code should not be taken as evidence that the MultiTree project believes that the language exists.
The system it uses requires that every language entered into its system have a unique 3-character code, and since the project is designed to collect all hypotheses proposed about language relationships, even those which are wrong, it sometimes assigns a code to a language that does not and has never existed.
2. Some of the codes listed on the Wikipedia page are out of date even on the MultiTree site, and are either not in use any more, or are going to be removed shortly.
3. The local use codes collected on the Wikipedia page are only a small subset of the codes MultiTree uses. The local use space was found to be nowhere extensive enough for all the languages which had to be coded, so most of the codes it now assigns are alphanumeric. So, for example, Puruha was assigned the code qq3, and Old Portuguese the code 079. It might be a good thing to get an updated list of all these codes at some point.
On 5/8/2014 10:01 AM, Doug Ewell wrote:
> Gordon P. Hemsley <me at gphemsley dot org> wrote:
>> I was just informed about the existence of this mapping of private
>> use language subtags to languages, maintained by LINGUIST List and
>> used by
> Thanks for the link. I hadn't seen it before, and still can't figure
> out how to get this list directly from Linguist List or MultiTree.
>> Does anyone have any opinions as to the importance of this list, and
>> how much others should be concerned about clashing with these
>> assignments on the wire? (I know there are similar lists maintained
>> for the Unicode private use area that have some importance for
>> unencoded script users.)
>> How safe is it to ignore these assignments?
> I agree with the comparison to the ConScript Unicode Registry
> (http://www.evertype.com/standards/csur/) and similar Unicode PUA
> allocations. BCP 47 users might do well to honor or avoid (as
> appropriate) these private assignments -- particularly if they gain
> traction outside of Linguist List -- but they are under no obligation
> to do so, and BCP 47 should not formalize this arrangement.
> The quality of the list is debatable, and discussion of that is left
> for another message (maybe). That said, since it appears in a
> Wikipedia page (maintained almost exclusively by one person), it would
> be a welcome improvement if Wikipedia authors and editors were to move
> toward using code elements from this list, compared to making up
> non-private code elements as they go along. The Wikipedia page doesn't
> specify this as a goal, though.
> Doug Ewell | Thornton, CO, USA
> http://ewellic.org | @DougEwell
> Ietf-languages mailing list
> Ietf-languages at alvestrand.no
Ietf-languages mailing list
Ietf-languages at alvestrand.no
More information about the Ietf-languages