Data on confusables

Gervase Markham gerv at mozilla.org
Tue Jul 28 12:06:19 CEST 2009


On 27/07/09 21:26, Mark Davis ⌛ wrote:
> I don't have a count of domain names. The figures I gave do part of what
> you are asking for:
>
> A. characters allowed by IDNA2008 that are confusable with /at least
> one/ other character allowed by IDNA2008
>
> B.  characters allowed by IDNA200*3* that are confusable with /at least/
> one other character allowed by IDNA200*3* (/and/ not in A)

Forgive me, but I'm having trouble relating these two definitions to the 
numbers in your original post. Could you tell me the values of A and B?

> I'm showing no additional characters in that group; that is, any
> PVALID2008 character with a confusable in PVALID2003 also has a
> confusable in PVALID2008. (The number of other characters that each
> could be confused with does grow, but that doesn't change whether or not
> they can be spoofed.)

OK, that's interesting. It does reinforce the point that registry policy 
still has a large part to play; what we've done is made it easier for 
registries to formulate that policy because they have to consider fewer 
characters.

Gerv


More information about the Idna-update mailing list