A proposed solution for descriptions (was: Re: ISO 639 - New item approved - N'Ko)

John Cowan cowan at ccil.org
Sun Jun 11 21:31:27 CEST 2006

Mark Crispin scripsit:

> No &#x sequences should ever appear in an ASCII-only form; nor, for that 
> matter, should ISO 8859-1 codepoints.  

Unfortunately, Ltru was trapped between a rock and a hard place.  It had
source materials using at least a Windows-1252 character repertoire that
had to be reduced to publication in the form of an Internet-Draft, where
only ASCII is acceptable.  Some transformation of the source material
was unavoidable.

The choice taken was to introduce SGML escape sequences into the plain
text of the I-D (and the consequent IANA publication), documented as such.
Now we are considering adding escape-free alternatives, since it is
obvious that escape sequences (as opposed to the Unicode characters they
represent) will not be found by anything but a completely specialized
search engine.

(In particular, Google cannot cope; a search for "natisone dialect nadiza
dialect Provençal" succeeds, but using "Provencal" or "Provençal"
instead fails.)

Doubly unfortunately, this undertaking has opened a free-for-all wherein
all real or imaginary errors (and I take no position on which is which)
of the source material must be corrected on the spot.  We shouldn't be
here and should leave this place at once.

At the end of the Metatarsal Age, the dinosaurs     John Cowan
abruptly vanished. The theory that a single         cowan at ccil.org
catastrophic event may have been responsible        http://www.ccil.org/~cowan
has been strengthened by the recent discovery of
a worldwide layer of whipped cream marking the
Creosote-Tutelary boundary.             --Science Made Stupid

More information about the Ietf-languages mailing list