Adam, one question..... do you think that the canonicalization function you're positing can be described in an Unicode-version-independent way? harald