Mismatch between ISO 639-2 and ISO 639-3
ISO639-3
ISO639-3 at sil.org
Wed Feb 17 23:33:08 CET 2010
Tomorrow morning, I expect. I have made the updated files, but our webmaster apparently has left for the day.
-Joan
Joan Spanne
SIL International
ISO 639-3 Registration Authority
iso639-3 at sil.org
From: mark.edward.davis at gmail.com [mailto:mark.edward.davis at gmail.com] On Behalf Of Mark Davis ?
Sent: Wednesday, February 17, 2010 4:27 PM
To: ISO639-3
Cc: ietf-languages at iana.org; iso639-2 at loc.gov
Subject: Re: Mismatch between ISO 639-2 and ISO 639-3
Thanks for the very speedy reply. Do you have a sense of when http://www.sil.org/iso639-3/iso-639-3_xxxxxxxx.tab will be updated?
Mark
On Wed, Feb 17, 2010 at 14:10, ISO639-3 <ISO639-3 at sil.org> wrote:
Mark,
[scr] and [scc] have been deprecated (I think that is the proper term) now in Part 2. I thought I had removed them from the published tables.
There is now no distinct Part 2B identifier in these two cases. There is now only the one identifier for each.
Thank you for bringing it to light.
Joan Spanne
SIL International
ISO 639-3 Registration Authority
iso639-3 at sil.org
From: ietf-languages-bounces at alvestrand.no [mailto:ietf-languages-bounces at alvestrand.no] On Behalf Of Mark Davis ?
Sent: Wednesday, February 17, 2010 3:31 PM
To: ietf-languages at iana.org; ISO639-3 at sil.org; iso639-2 at loc.gov
Subject: Mismatch between ISO 639-2 and ISO 639-3
There is a discrepancy between ISO 639-2 and ISO 639-3 for certain three-letter codes. In particular, the bibliographic codes for Croatian and Serbian are hrv and srp respectively in ISO 639-2, whereas ISO 639-3 has them as scr and scc respectively. (There is also a mismatch in the '3-letter code' for "sh", but that is less important.)
My question is: which one should we believe: ISO 639-2 or ISO 639-3?
Background
For ISO 639-3, we have the following:
http://www.sil.org/iso639-3/iso-639-3_20100202.tab
Id
Part2B
Part2T
Part1
Scope
Language_Type
Ref_Name
Comment
hrv
scr
hrv
hr
I
L
Croatian
srp
scc
srp
sr
I
L
Serbian
hbs
sh
M
L
Serbo-Croatian
Code element for 639-1 has been deprecated
On the other hand, we find the following in
http://www.loc.gov/standards/iso639-2/php/code_list.php
which matches http://www.loc.gov/standards/iso639-2/ISO-639-2_utf-8.txt
O 639-2 Code
ISO 639-1 Code
English name of Language
French name of Language
...
hrv
hr
Croatian
croate
...
srp
sr
Serbian
serbe
...
Differences between B and T are indicated in the first column, where they exist, such as:
wel (B)
cym (T)
cy
Welsh
gallois
If there are no such differences, then B and T are the same. So this says that the B forms are hrv and srp respectively, whereas ISO 639-3 has them as scr and scc respectively.
For ISO 639-2, "sh" is in a different file (there doesn't seem to be a machine-readable form).
http://www.loc.gov/standards/iso639-2/php/code_changes.php
[-sh]
(none)
Serbo-Croatian
serbo-croate
2000-02-18
Dep
This code was deprecated...
In this case, the 3-letter code is indicated as none, while ISO 639-3 has it as hbs. This is not as problematic, because 'sh' has a funny status in 639-3.
Mark
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.alvestrand.no/pipermail/ietf-languages/attachments/20100217/487a960c/attachment-0001.htm
More information about the Ietf-languages
mailing list