Draft on IDN Tables in XML
roessler at does-not-exist.org
Thu Mar 1 21:21:26 CET 2012
(Speaking in a purely personal capacity...)
A few quick comments on the XML format level upon skimming this (mostly nit-picking):
- you don't seem to identify the namespace URI in the normative text, it's hidden in the RNG schema
- you don't use the namespace in your examples.
- the example in 3.4 doesn't follow the specification for the description element: You either need a CDATA construct, or you need to entity-escape the pointy brackets.
- in the specification for the description element, don't talk of "CDATA" as an element. It isn't, but rather a particular way to mark up textual data:
So: The content model for <description> is text; if content includes markup, then CDATA is a convenient method to write that out.
- MIME-type -> media type
- To the unique identifier point in your issues list -- agree that it might be useful to give the table a unique identifier. I probably wouldn't choose an ID attribute, but would include a "this version" and "latest version" URI in the <meta> section.
- I may be missing something, but what's the extension model for the @when attribute on <var>? Are certain names there reserved for IANA, and others up for grabs? Want to use URIs for these names, so everybody can coin their own without stepping on others' toes? Something else?
Hope these are useful,
Thomas Roessler <roessler at does-not-exist.org> (@roessler)
On 2012-03-01, at 20:15 +0100, Kim Davies wrote:
> I have posted a first draft regarding a format that could be used for representing IDN Tables in XML to the I-D Repository:
> After discussion with a number of folks that felt this would be good work to undertake, I've put together a first cut which is not comprehensive, but I think goes some way toward a potential format.
> Unless there is interest in this being a more formal activity, my assumption is to aim to publish the final result independently as an Informational RFC. However, the mechanism of publication is secondary to coming up with something useful that would benefit TLD registries and other implementors. A list of design goals, from the document, is as follows:
> • MUST be in a format that can be implemented in a reasonably straightforward manner in software;
> • The format SHOULD be able to be checked for formatting errors, such that common mistakes can be caught;
> • An IDN Table MUST be able to express the set of valid code points that are allowed for registration under a specific zone administrator's policies;
> • MUST be able to express computed alternatives to a given domain name based on a one-to-one, or one-to-many relationship. These computed alternatives are commonly known as "IDN variants";
> • IDN Variants SHOULD be able to be tagged with specific categories, such that the categories can be used to support registry policy (such as whether to list the computed variant in the zone, or to merely block it from registration);
> • IDN Variants MUST be able to stipulated based on contextual information. For example, specific variants may only be applicable when they follow another specific code point, or when the code point is displayed in a specific presentation form;
> • The data contained within the table MUST be unambiguous, such that independent implementations that utilise the contents will arrive at the same results;
> • IDN Tables SHOULD be suitable for comparison and re-use, such that one could easily compare the contents of two or more to see the differences, to merge them, and so on.
> • As many existing IDN Tables are practicable SHOULD be able to be migrated to the new format with all applicable logic retained.
> It is explicitly NOT the goal of this format to:
> • Stipulate what code points should be listed in an IDN Table by a zone administrator. What registration policies are used for a particular zone is outside the scope of this memo.
> • Stipulate what a consumer of an IDN Table must do when they determine a particular domain is valid or invalid; or arrive at a set of computed IDN variants. IDN Tables are only used to describe rules for computing code points, but does not prescribe how registries and other parties utilise them.
> I'd appreciate any feedback.
> Idna-update mailing list
> Idna-update at alvestrand.no
More information about the Idna-update