Browser IDN display policy: opinions sought
Mark Davis ☕
mark at macchiato.com
Sat Dec 10 21:58:41 CET 2011
There is a set of basic levels for mixing scripts set up in
recommend reading the material before that section, however, for
A more comprehensive approach would be to use some of the mixed-script and
whole-script confusable data as described in
http://www.unicode.org/reports/tr39/proposed.html. Note that that data will
grow over time; in particular, I'd expect the Indic confusables to be
fleshed out further. I'd expect it to incorporate some of the work ongoing
in ICANN for identifying confusables, as well.
There is a table of recommended scripts for identifiers in
I wouldn't recommend mixtures of those those with others scripts (like
Latin), or allowing whole-script confusables of other scripts that match
those. That helps to keep out many of the characters that are extremely
confusable with Latin.
I'm pointing to the proposed-update versions of these documents; comments
are welcome on the text. The next meeting of the UTC to consider feedback
will be in February.
*— Il meglio è l’inimico del bene —*
2011/12/10 Paul Hoffman <phoffman at imc.org>
> On Dec 10, 2011, at 10:20 AM, Michel Suignard wrote:
> >> On 10 dec 2011, at 18:26, Paul Hoffman wrote:
> >>> D: Unicode if the label is a single script that is displayable by the
> >>> browser, Punycode otherwise.
> >> +1
> >> With the exceptions for combinations of various scripts and script
> > In fact, mixed scripts are fine and desired in many situations.
> > Think of Romaji in Japan which cannot be confused with Kana and Kanji.
> So in that case it is perfectly OK to have a white list of scripts that can
> be mixed with Latin (which is the typical case).
> Sure, if you think there is a single entity who can make the whitelist of
> scripts that can be combined in a single label.
> I believe that there is not. I would like to be wrong.
> --Paul Hoffman
> Idna-update mailing list
> Idna-update at alvestrand.no
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Idna-update