<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html; charset=UTF-8" http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
Patrik,<br>
<br>
I see that Mark responded on this thread, but didn't actually answer
the question.<br>
<br>
For IDNA 2008 purposes, the relevant point to look at is Section B
of RFC 5892,<br>
not Section A.<br>
<br>
All twelve of these characters are superscript or subscript
characters which have<br>
compatibility decompositions to single letters. Because of this,
they are all<br>
"unstable" by the criterion in Section B. As a result they are all
DISALLOWED<br>
in IDNA 2008 (of whatever vintage) and will stay that way, because
of the<br>
Unicode normalization stability guarantees.<br>
<br>
Changing their General Category values from gc=Ll to gc=Lm has no
impact<br>
whatsoever on the bottom line of whether these twelve characters are<br>
allowed in IDN's. (They aren't.)<br>
<br>
--Ken<br>
<br>
On 4/4/2011 7:43 AM, Mark Davis ☕ wrote:
<blockquote
cite="mid:BANLkTi=J8WE0gUCWRiE=cz0ZSC9S2Gz8cQ@mail.gmail.com"
type="cite"><font face="georgia,serif">That was one of the
considerations in the discussion; the effect on identifiers
(IDNA and others).</font>
<div><font face="georgia,serif"><br clear="all">
</font><font face="georgia, serif">Mark<br>
<i></i></font></div>
</blockquote>
<br>
<blockquote
cite="mid:BANLkTi=J8WE0gUCWRiE=cz0ZSC9S2Gz8cQ@mail.gmail.com"
type="cite">
<div>2011/4/4 Patrik Fältström <span dir="ltr"><<a
moz-do-not-send="true" href="mailto:patrik@frobbit.se">patrik@frobbit.se</a>></span><br>
<div class="gmail_quote">
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt
0.8ex; border-left: 1px solid rgb(204, 204, 204);
padding-left: 1ex;">
I also would like to get a firm response from Unicode people
as well, BUT, by just quickly looking at the change, I can
only see the change gc=Ll to gc=Lm be something that have to
do with IDNA2008.<br>
<br>
And as rule A of IDNA2008 is the following:<br>
<br>
A: General_Category(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc}<br>
<br>
...i.e. both Ll and Lm are accepted, this change should NOT
have any impact on IDNA2008.<br>
<br>
So I am not as worried as I was when I first saw that Gc was
proposed to be changed for twelve(!) characters!!!<br>
<font color="#888888"><br>
Patrik<br>
</font>
<div>
<div class="h5"><br>
</div>
</div>
</blockquote>
</div>
</div>
</blockquote>
<br>
</body>
</html>