Allowed characters (was: Re: Casefolding Sigma (was: Re: IDNAbis Preprocessing Draft)

Mark Davis mark.davis at icu-project.org
Wed Mar 26 20:46:09 CET 2008


It is a small change to focus on characters in Arabic blocks.

Change A to:
[
 [:^idna=disallowed:]
 -[:^block=/(?i)Arabic/:]
]

Change B to
[[:L:][:Mn:][:Mc:][:Nd:]
-[:^isCaseFolded:]
-[:NFKC_QC=N:]
-[:di:]
-[[:block=Combining_Diacritical_Marks_for_Symbols:]
  [:block=Musical_Symbols:]
  [:block=Ancient_Greek_Musical_Notation:]
 ]
-[:^block=/(?i)Arabic/:]
]
Click on Only A and you'll get the list of characters that used to be valid
in Arabic blocks but would no longer be. Here's the URL to make it simple
for you:

http://unicode.org/cldr/utility/list-unicodeset.jsp?a=[[%20[:
^idna=disallowed:]%20-[:^block=/(?i)Arabic/:]]-[[:L:][:Mn:][:Mc:][:Nd:]-[:^isCaseFolded:]-[:NFKC_QC=N:]-[:di:]-[[:block=Combining_Diacritical_Marks_for_Symbols:]%20%20[:block=Musical_Symbols:]%20%20[:block=Ancient_Greek_Musical_Notation:]%20]-[:^block=/(?i)Arabic/:]]]


Similarly the characters "In both A and B" are at:

http://unicode.org/cldr/utility/list-unicodeset.jsp?a=[[%20[:
^idna=disallowed:]%20-[:^block=/(?i)Arabic/:]]%26[[:L:][:Mn:][:Mc:][:Nd:]-[:^isCaseFolded:]-[:NFKC_QC=N:]-[:di:]-[[:block=Combining_Diacritical_Marks_for_Symbols:]%20%20[:block=Musical_Symbols:]%20%20[:block=Ancient_Greek_Musical_Notation:]%20]-[:^block=/(?i)Arabic/:]]]

Mark

On Wed, Mar 26, 2008 at 12:23 PM, Michael Everson <everson at evertype.com>
wrote:

> I really just wanted a list of
>
> Which Arabic letters were in and which were out.
>
> Which Arabic diacritics were in and which were out.
>
> Which punctuation and symbols in the Arabic block were in and which were
> out.
>
> Sorry if this is too complicated.
> --
> Michael Everson * http://www.evertype.com
> _______________________________________________
> Idna-update mailing list
> Idna-update at alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/idna-update
>



-- 
Mark
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.alvestrand.no/pipermail/idna-update/attachments/20080326/7e513133/attachment.html


More information about the Idna-update mailing list