Comments on draft-ietf-idnabis-defs-10

Vint Cerf vint at google.com
Mon Aug 31 08:16:47 CEST 2009


Mark,

I tried converting uppercased unicode into punycode and got only  
lowercased results.

maybe I used a bad implementation of punycode?

v

On Aug 30, 2009, at 8:13 PM, Mark Davis ⌛ wrote:

> Yes, the best way to think of Punycode is as a reversible encoding  
> of any Unicode string, where the encoding only uses bytes that  
> correspond to certain ASCII characters. Any case mapping, case or  
> character restrictions, or prefixing (xn--) is external to Punycode  
> proper.
>
> Here are some examples.
>
> //Text => [Punycode] => Text
>
> öbb => [bb-eka] => öbb
> ÖBB => [BB-nha ]=> ÖBB
> I♥NY => [INY-zx5a] => I♥NY
>
> Mark
>
>
> 2009/8/30 James Mitchell <james.mitchell at ausregistry.com.au>
> Punycode encode copies all ASCII characters verbatim;  all U-labels  
> containing at least one ASCII letter character will exhibit this  
> behaviour.
>
> Example, xn--bucher-kva decodes to bücher whereas XN--BUCHER-KVA  
> decodes to BüCHER.
>
> James
>
> > -----Original Message-----
> > From: idna-update-bounces at alvestrand.no [mailto:idna-update-
> > bounces at alvestrand.no] On Behalf Of Vint Cerf
> > Sent: Monday, 31 August 2009 6:28 AM
> > To: Patrik Fältström
> > Cc: Wil Tan; Andrew Sullivan; idna-update at alvestrand.no; John C  
> Klensin
> > Subject: Re: Comments on draft-ietf-idnabis-defs-10
> >
> > you should not be able to reproduce this if I have understood the
> > encoding/decoding algorithm. I thought it treated upper and lower  
> case
> > ascii as equal value for purposes of encoding and decoding.  this  
> was
> > to allow upper/lower case ascii to be case insensitive in the
> > punycoding algorithm, I believe.
> >
> > v
> >
> > On Aug 30, 2009, at 3:54 PM, Patrik Fältström wrote:
> >
> > > On 30 aug 2009, at 20.22, Wil Tan wrote:
> > >
> > >> However, if certain characters in an A-label have
> > >> been uppercased, the Punycode decoding algorithm (due to its  
> mixed-
> > >> case
> > >> annotation feature) may produce invalid U-label because the ASCII
> > >> characters
> > >> will be in capital letter form.
> > >
> > > Can you give examples of when this happens? I can not see this
> > > result in the punycode algorithm. I also just did some tests with
> > > mixed ascii, latin and chinese characters, and can not reproduce.
> > >
> > >   Patrik
> > >
> > > _______________________________________________
> > > Idna-update mailing list
> > > Idna-update at alvestrand.no
> > > http://www.alvestrand.no/mailman/listinfo/idna-update
> >
> > _______________________________________________
> > Idna-update mailing list
> > Idna-update at alvestrand.no
> > http://www.alvestrand.no/mailman/listinfo/idna-update
> _______________________________________________
> Idna-update mailing list
> Idna-update at alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/idna-update
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.alvestrand.no/pipermail/idna-update/attachments/20090831/594bdffd/attachment-0001.htm 


More information about the Idna-update mailing list