Defs: General

Mark Davis mark at macchiato.com
Thu Nov 20 03:07:15 CET 2008


Defs
*Other than A-Label and U-Label issue already covered:
*
------------------------------

A
code point is an integer value associated with a character in a coded
character set.

Unicode [Unicode51] is a coded character set containing about 100,000
characters as of the current version.

=>

A code point is an integer value in the codespace of a coded character set.
In Unicode, these are integers from 0 to 0x10FFFF.

Unicode [Unicode51] is a coded character set with about 100,000 characters
assigned to code points as of version 5.1.


Rationale. Code points may not be associated with characters. In Unicode,
for example, the vast majority are not, since they have not yet been
assigned a character. Also added the note about the range in Unicode, since
its code points are the most referenced in these documents.
------------------------------

these specifications leave the problem of transcoding between the


[action: someplace define the term "transcoding", or better yet, just
use the term "converting"]


------------------------------

Mark
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.alvestrand.no/pipermail/idna-update/attachments/20081119/b79226ac/attachment.htm 


More information about the Idna-update mailing list