[Fwd: I-D ACTION:draft-hall-mime-app-mbox-03.txt]
Bruce Lilly
blilly at erols.com
Sun Jan 30 17:24:13 CET 2005
On Wed January 26 2005 14:57, Eric A. Hall wrote:
>
> The ADs for the applications area are going to move this into last-call
> again, but we all want to run it back up the flagpole one more time.
>
> This version departs from earlier releases by specifying a "format"
> parameter to the application/mbox media-type, by defining a "default"
> value for the parameter,
Several things are missing related to that:
1. A registration procedure for registering new format value keywords
(that could be, and probably should be, a separate document).
2. An IANA Considerations section related to establishment of a
format value keyword registry (containing the "default" entry),
and maintenance of that registry in conjunction with the
registration procedure.
3. Location of the format value keyword registry (so that implementors
can find the registry). That should be coordinated with IANA.
4. Syntax rules and ABNF for the format keyword values, unless
"anything goes".
5. Semantic rules for format value keywords, e.g. are they
case-insensitive.
6. Provision, if any, for private-use or experimental format value
keywords (e.g. reservation of keywords beginning with"x-" for
such purposes).
[...]
> The "default" format uses a sequence of 822 messages, with the exception
> that line-endings are LF instead of CR/LF (this only applies to the
> canonical database, and doesn't affect the transfer protocol or anything
> else).
[...]
> Another thing that is specified here is that separator lines (at the
> least) must be encoded to prevent local collisions, when an mbox
> attachment is saved into an existing local folder (messages can become
> irreversible mingled if some kind of escaping is not performed).
Since the format differs from canonical message format, and as there
appears to be provision for encoding parts of the media type (using
an unspecified encoding algorithm), it appears that several items are
missing regarding such encoding:
1. encoding algorithm(s) and corresponding decoding algorithm(s)
2. how the particular encoding algorithm used by the originator is
specified with the media type so that it can be reversed by the
recipient.
3. interaction between any transfer encoding (RFC 2045) which may
be present in messages and the encoding algorithms above
4. if it is possible to have the entire media type encoded or
only portions ("at the least") encoded, how the recipient can
determine which is the case, and how to identify which portions
are encoded so that appropriate decoding -- of those portions
only -- can be performed w/o mangling unencoded portions, even
if those unencoded portions contain content which has octet
sequences resembling encoded portions. [I suspect that partial
encoding won't work, and that the entire media type would have
to be encoded/decoded as a unit.]
5. Interaction of encoding mechanisms and modifications that may
occur during transport (message/partial fragmentation, addition
of spurious whitespace, removal of trailing whitespace, etc.).
6. Since the media type format contains lone LF octets, it is
unsuitable for transfer w/o transfer encoding (RFC 2822 section
2.3); it is therefore possible that:
a) a message within an mbox may have had RFC 2045 transfer
encoding applied to a body MIME-part, with a corresponding
Content-Transfer-Encoding field
b) CRLF sequences delimiting lines may have been changed to LF
c) some encoding may be applied to all or portions of the media
type for the purpose of escaping "separator lines"
d) transfer encoding may have to be applied to the media type
for transfer, as it would otherwise contain non-conforming
octet sequences (LF not immediately preceded by CR (RFC 2822
sect. 2.3)
In order to recover content end-to-end, it is necessary to
specify the order of the various transformations and the
corresponding decoding sequence, to prevent undesirable
interaction between encoding/decoding operations that would
alter message content.
In the registration temple, the magic number could be indicated as
0x46726F6D20 ("From ").
More information about the Ietf-types
mailing list