Last call for ISO 15924-based updates
petercon at microsoft.com
Fri Mar 13 17:35:05 CET 2009
The text element a with acute has two fully-equivalent representations in ISO/IEC 10646, just as John indicated. In terms of encoded characters, one representation has one encoded character while the other representation uses two encoded characters, but they represent one and the same text element.
I don't see what bearing this has on John's explanation of Zinh, however: the combining acute, U+0301, can be combined with characters of various scripts, and so its script property is inherited.
From: ietf-languages-bounces at alvestrand.no [mailto:ietf-languages-bounces at alvestrand.no] On Behalf Of Keld Jørn Simonsen
Sent: Friday, March 13, 2009 7:26 AM
To: Doug Ewell
Cc: ietf-languages at iana.org
Subject: Re: Last call for ISO 15924-based updates
On Thu, Mar 12, 2009 at 06:46:24PM -0600, Doug Ewell wrote:
> John Cowan <cowan at ccil dot org> wrote:
> > The whole point of the Zinh code is to signal that the diacritic
> > changes its script depending on the diacriticized letter. The acute
> > accent, for example, has no script of its own; it is understood as a
> > Latin accent when placed on a Latin letter, but as a Greek accent when
> > placed on a Greek letter.
> What Gérard may or may not be aware of, and what powers this entire
> explanation, is that in Unicode, a diacriticized letter may be
> represented as two encoded characters, one for the base letter and one
> for the diacritic. For example, "a with acute" may be encoded as
> U+00E1, or it may be encoded as U+0061 plus U+0301. In the second case,
> the detached acute accent U+0301 would have the "inherited script"
W should rather use the ISO standards here, and we do in IETF.
In ISO 10646 the character "a with acute" can only be represented
in one way. namely as U00E1. The other string you are citing is two
characters in ISO 10646 (and not the "a with acute" character).
Ietf-languages mailing list
Ietf-languages at alvestrand.no
More information about the Ietf-languages