No subject


Sun Dec 6 01:12:07 CET 2009


Those include -- and in a far from exhaustive list - the following
characters:

Basic Latin - *ASCII punctuation and symbols*:
9U+0021<character.jsp?a=3D0021> ( ! )
EXCLAMATION MARK
U+0026 <character.jsp?a=3D0026> ( & ) AMPERSAND
U+002A <character.jsp?a=3D002A> ( * ) ASTERISK
U+002C <character.jsp?a=3D002C> ( , ) COMMA
U+002E <character.jsp?a=3D002E> ( . ) FULL STOP
U+002F <character.jsp?a=3D002F> ( / ) SOLIDUS
U+003A <character.jsp?a=3D003A> ( : ) COLON
U+003F <character.jsp?a=3D003F> ( ? ) QUESTION MARK
General Punctuation - *Dashes*: 1U+2014 <character.jsp?a=3D2014> ( =E2=80=
=94 ) EM DASH
General Punctuation - *General punctuation*: 5U+2018
<character.jsp?a=3D2018> ( =E2=80=98 )
LEFT SINGLE QUOTATION MARK
U+2019 <character.jsp?a=3D2019> ( =E2=80=99 ) RIGHT SINGLE QUOTATION MARK
U+201C <character.jsp?a=3D201C> ( =E2=80=9C ) LEFT DOUBLE QUOTATION MARK
U+201D <character.jsp?a=3D201D> ( =E2=80=9D ) RIGHT DOUBLE QUOTATION MARK
U+2022 <character.jsp?a=3D2022> ( =E2=80=A2 ) BULLET

This is just an indication of the kinds of things that people would want in
IDNs but can't be there. (Clearly the ASCII can't be.)
Mark


On Mon, Dec 7, 2009 at 12:11, Shawn Steele <Shawn.Steele at microsoft.com>wrot=
e:

> > You wouldn't know, so don't start from upper case sigma. :)
>
> Ah, that's the trick.  AFAICT various marketing departments don't
> particularly care what DNS does, they are just used to certain forms of
> names, so the name they wanted plastered on the side of a bus might be
> all-caps.  (Although if I were a CamelCased company I don't think that'd =
be
> my first choice).
>
> So, for companies that might want to do this, it seems they might just
> register both forms.  Since the actual "correct" form is clear, I don't
> think it matters if lookup disallowed a distinction between the two.
>
> - Shawn
>
>

--001636e1e9f3bd7bac047a2a7431
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Interestingly, I have it on good authority that in terms of trademarks, esz=
ett and ss are treated the same -- that is, in trademark lingo, a phrase th=
at only differed by exchanging these characters from a trademark would infr=
inge that trademark.<div>
<br></div><div>As a side note, there are many, many English trademarks that=
 cannot be expressed literally as IDNs. Just a small sampling:<div><br></di=
v><div>Arby=E2=80=99s</div><div>Baker=E2=80=99s Choice</div><div>Ben &amp; =
Jerry=E2=80=99s</div>
<div>Lands=E2=80=99=C2=A0End</div><div>Spray =E2=80=99n Wash</div><div><spa=
n class=3D"Apple-style-span" style=3D"font-family: Arial; color: rgb(26, 26=
, 26); line-height: 17px; ">Uncle Ben=E2=80=99s</span></div><div><div><div>=
Wendy=E2=80=99s</div><div>...</div><div>
<br></div><div>From the=C2=A0INTERNATIONAL TRADEMARK ASSOCIATION=E2=80=99S =
=C2=A0TRADEMARK CHECKLIST</div><div><br></div><div>Those include -- and in =
a far from exhaustive list - the following characters:</div><div><br></div>=
<div><span class=3D"Apple-style-span" style=3D"font-family: Times; "><form =
name=3D"myform">
<h3 style=3D"margin-top: 0.5em; margin-bottom: 0.5em; background-color: rgb=
(238, 238, 238); "><span class=3D"Apple-style-span" style=3D"font-size: sma=
ll;">Basic Latin -=C2=A0</span><i><span class=3D"Apple-style-span" style=3D=
"font-size: small;">ASCII punctuation and symbols</span></i><span class=3D"=
Apple-style-span" style=3D"font-size: small;">: 9</span></h3>
<code><a target=3D"c" href=3D"character.jsp?a=3D0021">U+0021</a></code>=C2=
=A0(=C2=A0!=C2=A0) EXCLAMATION MARK<br><code><a target=3D"c" href=3D"charac=
ter.jsp?a=3D0026">U+0026</a></code>=C2=A0(=C2=A0&amp;=C2=A0) AMPERSAND<font=
 class=3D"Apple-style-span" face=3D"monospace"><br>
</font><code><a target=3D"c" href=3D"character.jsp?a=3D002A">U+002A</a></co=
de>=C2=A0(=C2=A0*=C2=A0) ASTERISK<br><code><a target=3D"c" href=3D"characte=
r.jsp?a=3D002C">U+002C</a></code>=C2=A0(=C2=A0,=C2=A0) COMMA<br><code><a ta=
rget=3D"c" href=3D"character.jsp?a=3D002E">U+002E</a></code>=C2=A0(=C2=A0.=
=C2=A0) FULL STOP<br>
<code><a target=3D"c" href=3D"character.jsp?a=3D002F">U+002F</a></code>=C2=
=A0(=C2=A0/=C2=A0) SOLIDUS<br><code><a target=3D"c" href=3D"character.jsp?a=
=3D003A">U+003A</a></code>=C2=A0(=C2=A0:=C2=A0) COLON<br><code><a target=3D=
"c" href=3D"character.jsp?a=3D003F">U+003F</a></code>=C2=A0(=C2=A0?=C2=A0) =
QUESTION MARK<br>
<h3 style=3D"margin-top: 0.5em; margin-bottom: 0.5em; background-color: rgb=
(238, 238, 238); "><span class=3D"Apple-style-span" style=3D"font-size: sma=
ll;">General Punctuation -=C2=A0</span><i><span class=3D"Apple-style-span" =
style=3D"font-size: small;">Dashes</span></i><span class=3D"Apple-style-spa=
n" style=3D"font-size: small;">: 1</span></h3>
<code><a target=3D"c" href=3D"character.jsp?a=3D2014">U+2014</a></code>=C2=
=A0(=C2=A0=E2=80=94=C2=A0) EM DASH<br><h3 style=3D"margin-top: 0.5em; margi=
n-bottom: 0.5em; background-color: rgb(238, 238, 238); "><span class=3D"App=
le-style-span" style=3D"font-size: small;">General Punctuation -=C2=A0</spa=
n><i><span class=3D"Apple-style-span" style=3D"font-size: small;">General p=
unctuation</span></i><span class=3D"Apple-style-span" style=3D"font-size: s=
mall;">: 5</span></h3>
<code><a target=3D"c" href=3D"character.jsp?a=3D2018">U+2018</a></code>=C2=
=A0(=C2=A0=E2=80=98=C2=A0) LEFT SINGLE QUOTATION MARK<br><code><a target=3D=
"c" href=3D"character.jsp?a=3D2019">U+2019</a></code>=C2=A0(=C2=A0=E2=80=99=
=C2=A0) RIGHT SINGLE QUOTATION MARK<br><code><a target=3D"c" href=3D"charac=
ter.jsp?a=3D201C">U+201C</a></code>=C2=A0(=C2=A0=E2=80=9C=C2=A0) LEFT DOUBL=
E QUOTATION MARK<br>
<code><a target=3D"c" href=3D"character.jsp?a=3D201D">U+201D</a></code>=C2=
=A0(=C2=A0=E2=80=9D=C2=A0) RIGHT DOUBLE QUOTATION MARK<br><code><a target=
=3D"c" href=3D"character.jsp?a=3D2022">U+2022</a></code>=C2=A0(=C2=A0=E2=80=
=A2=C2=A0) BULLET<br><font class=3D"Apple-style-span" face=3D"arial"><br>
This is just an indication of the kinds of things that people would want in=
 IDNs but can&#39;t be there. (Clearly the ASCII can&#39;t be.)</font></for=
m></span></div></div><div>Mark<br>
<br><br><div class=3D"gmail_quote">On Mon, Dec 7, 2009 at 12:11, Shawn Stee=
le <span dir=3D"ltr">&lt;<a href=3D"mailto:Shawn.Steele at microsoft.com">Shaw=
n.Steele at microsoft.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_=
quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1=
ex;">
<div class=3D"im">&gt; You wouldn&#39;t know, so don&#39;t start from upper=
 case sigma. :)<br>
<br>
</div>Ah, that&#39;s the trick. =C2=A0AFAICT various marketing departments =
don&#39;t particularly care what DNS does, they are just used to certain fo=
rms of names, so the name they wanted plastered on the side of a bus might =
be all-caps. =C2=A0(Although if I were a CamelCased company I don&#39;t thi=
nk that&#39;d be my first choice).<br>

<br>
So, for companies that might want to do this, it seems they might just regi=
ster both forms. =C2=A0Since the actual &quot;correct&quot; form is clear, =
I don&#39;t think it matters if lookup disallowed a distinction between the=
 two.<br>

<font color=3D"#888888"><br>
- Shawn<br>
<br>
</font></blockquote></div><br></div></div></div>

--001636e1e9f3bd7bac047a2a7431--


More information about the Idna-update mailing list