Mixing scripts (Re: Unicode versions (Re: Criteria for exceptional characters))

Wed Dec 20 00:38:20 CET 2006

> >>> Is there a list of the Unicode codepoints known to be used in each of
> >>> the ISO 15924 script codes?
> >>
> >>
> >> The closest you are going to get to an repertoire partitioning
> >> of Unicode into scripts is Scripts.txt, the very file we have
> >> been talking about and using for the development of the
> >> inclusions file.
> 
> I was not asking for a partitioning.

O.k., but your were asking for "a list of the Unicode codepoints
known to be used in each of the ISO 15924 script codes."

I've explained at length why no such list exists. Correcting
the notion of what ISO 15924 script codes actually are, it
might then be possible to start a project to collect
information on which characters get used with each
writing system that might end up cataloged or otherwise
be identified with each of the ISO 15924 script codes,
but it is unlikely that such an endeavor would terminate
in less than a decade. And the end result you would get
would be so full of overlapping determinations, annotations, footnotes,
and caveats, that it is easy to foresee that it would be
pretty useless for anything along the lines of what
we are attempting to accomplish for IDNAbis.

Just my opinion, though. Perhaps somebody else can see a
useful (and quick) way to come up with a definitive answer
that would be helpful.

--Ken