Language Subtag Registration

Michael Everson everson at evertype.com
Thu Oct 29 06:24:12 CET 2015


On 28 Oct 2015, at 22:45, Felix Sasaki <fsasaki at w3.org> wrote:
> 
>> Perhaps this can be finessed. Either we say “Wikipedia Simple Language Version” with en as the prefix adding fr or de or ru later, or to keep “Wikipedia Simple English” and add “Wikipedia Simple French” etc later at need. 
> 
> The issue is that the notion of simple language may differ severely among different wikipedia language version.

What, in morphology, vocabulary, and syntax? Sure: they’d be forms of distinct languages. Hence the prefix. 

> So if the purpose of the extension is to cover wikipedia simple english this should be made explicit in the subtag itself and not in the prefix. And there may then be a need later to create other subtags for other wikipedia language versions.

The prefix proposed is en, since as yet there are no fr, de, or ru Simple Wikipedias, though these have been discussed. The subtag proposed is wpsimple because for any Simple Wikipedia there will be house-style guidelines which define the content. 

Less precise than Basic English, for example. But nevertheless, defined and implemented. 

> My point is that each community behind a selected wikipedia language version will likely say: we want our own language (subtag) identifier. 

I don’t see why you make this assumption. If an eventual Simple French Wikipedia were implemented, the Language Committee would simply tell them “Your prefix will be fr-wpsimple.” There would be no need for such a community to apply for a subtag. 

> The generalization of simple language to cover several language versions is problematic, like the generalization of sign languages was problematic (and is now an approach of the past).

Scouse differs from standard English by having a set of lexical and phonological differences. Simple English differs from standard English by being defined and implemented according to certain defined strictures.  Both should have en- and both a subtag. 

The sign language generalization is handy for librarians attempting to catalogue a class of items. That’s a different thing from what we’re doing with data, true.

Michael


More information about the Ietf-languages mailing list