IDNA2008 test vectors

Thu Jan 27 03:12:25 CET 2011

Mark

*— Il meglio è l’inimico del bene —*

On Tue, Jan 25, 2011 at 14:37, Simon Josefsson <simon at josefsson.org> wrote:

> Mark Davis ☕ <mark at macchiato.com> writes:
>
> > For the next version of UTS46, the UTC is introducing additional fields
> in
> > its test files that indicate whether the strings are valid in IDNA2008,
> so
> > you could take a look at those (currently in draft state).
> >
> > http://www.unicode.org/review/#pri177
> >
> > The draft test file is at
> > http://www.unicode.org/Public/idna/6.0.1/IdnaTest.txt
> >
> > You would skip the lines starting with T, since those are only relevant
> for
> > transitional implementations. If you use the files and find any issues,
> let
> > me know and I can funnel the feedback back to the UTC.
>
> Thank you, I'm now going through these against my implementation.
> However, shouldn't I also ignore the toUnicode column for all B tests?
>

The B lines are valid for both T and N, so you should include them.

>
> That is, for a IDNA2008 implementation with no mapping support, the
> useful test vectors are the toASCII columns of all lines, and all
>

You would ignore any line with an error in the toUnicode column, or that has
T in the Type column.

With no mapping, you wouldn't test the Source column value. What you'd do is
see that the toAscii and to toUnicode values mapped back and forth.
(However, you would _use_ the value in the Source column if either were
empty.)

> columns of the N lines.
>
> /Simon
> _______________________________________________
> Idna-update mailing list
> Idna-update at alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/idna-update
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.alvestrand.no/pipermail/idna-update/attachments/20110126/411f5b4a/attachment-0001.html>