[Unicode Announcement] New Public Review Issue #147: Proposed Deprecation of U+0673 ARABIC LETTER ALEF WITH WAVY HAMZA BELOW

Erik van der Poel erikv at google.com
Mon Jun 1 23:25:50 CEST 2009


2009/6/1 John C Klensin <klensin at jck.com>:
>
>
> --On Monday, June 01, 2009 12:52 -0700 Erik van der Poel
> <erikv at google.com> wrote:
>
>> Removed ietf at ietf.org
>>
>> I did not find any occurrences in a large sample of IDNs on
>> the Web.
>
> Out of curiosity, how many IDNs did you find that used
>
>        --Arabic characters at all?
>
>        --Arabic characters that could reasonably be seen as
>        "decorated" (e.g., combining a base character with a
>        combining character, whether expressed that way in
>        Unicode or not)?
>
> If the answer to either question is in the range of "few", I'm
> not sure that any inference from the inability to find that one
> particular character construction tells us very much.

These numbers are percentages, and I've included the regular ASCII dot
U+002E (which occurs in 100% of domain names in this sample from the
Web) and Latin small e U+0065:

00002E 100.00000%
000065 66.53629%
00060C 0.00000%
00061B 0.00000%
00061F 0.00000%
000621 0.00004%
000622 0.00001%
000623 0.00009%
000624 0.00000%
000625 0.00001%
000626 0.00001%
000627 0.00150%
000628 0.00118%
000629 0.00010%
00062A 0.00025%
00062B 0.00001%
00062C 0.00007%
00062D 0.00008%
00062E 0.00005%
00062F 0.00014%
000630 0.00001%
000631 0.00025%
000632 0.00005%
000633 0.00116%
000634 0.00026%
000635 0.00005%
000636 0.00004%
000637 0.00003%
000638 0.00001%
000639 0.00027%
00063A 0.00002%
000640 0.00000%
000641 0.00012%
000642 0.00006%
000643 0.00012%
000644 0.00133%
000645 0.00128%
000646 0.00026%
000647 0.00007%
000648 0.00107%
000649 0.00004%
00064A 0.00035%
00064B 0.00000%
00064C 0.00000%
00064D 0.00000%
00064E 0.00000%
00064F 0.00000%
000650 0.00000%
000651 0.00000%
000652 0.00000%
000660 0.00000%
000661 0.00000%
000664 0.00000%
000665 0.00000%
000666 0.00000%
000667 0.00000%
000668 0.00000%
000669 0.00000%
00066C 0.00000%
00066D 0.00000%
000679 0.00000%
00067B 0.00000%
00067E 0.00001%
000686 0.00000%
000688 0.00000%
000698 0.00000%
0006A9 0.00002%
0006AF 0.00002%
0006BE 0.00000%
0006C1 0.00000%
0006C8 0.00000%
0006CC 0.00005%
0006D4 0.00000%
0006E9 0.00000%
0006F0 0.00000%
0006F5 0.00000%
0006F6 0.00000%
0006F7 0.00000%
0006F8 0.00000%
0006F9 0.00000%
0006FB 0.00000%
0006FE 0.00000%


More information about the Idna-update mailing list