Usenet Format Working Group                            K. Murchison, Ed.
Internet-Draft                                Carnegie Mellon University
Obsoletes: 1036 (if approved)                                 C. Lindsey
Expires: November 23, 2006                      University of Manchester
                                                                 D. Kohn
                                                           FlyDash, Inc.
                                                            May 22, 2006


                         Netnews Article Format
                      draft-ietf-usefor-usefor-08

Status of this Memo

   By submitting this Internet-Draft, each author represents that any
   applicable patent or other IPR claims of which he or she is aware
   have been or will be disclosed, and any of which he or she becomes
   aware will be disclosed, in accordance with Section 6 of BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as Internet-
   Drafts.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/ietf/1id-abstracts.txt.

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html.

   This Internet-Draft will expire on November 23, 2006.

Copyright Notice

   Copyright (C) The Internet Society (2006).

Abstract

   This document specifies the syntax of Netnews articles in the context
   of the "Internet Message Format" (RFC 2822) and "Multipurpose
   Internet Mail Extensions (MIME)" (RFC 2045).  This document
   supersedes RFC 1036, updating it to reflect current practice and
   incorporating incremental changes specified in other documents.



Murchison, et al.       Expires November 23, 2006               [Page 1]


Internet-Draft           Netnews Article Format                 May 2006


Changes since draft-ietf-usefor-usefor-07

   o  Reworked ABNF for Path header field keywords (ticket #1047).

   o  Removed definition of "sender" in Injection-Info header field
      (ticket #1159).

   o  Reworded description of "posting-account" in Injection-Info header
      field (ticket #1159).

   o  Discussed the use of [FWS] in Newsgroups, Followup-To, and
      Distribution header fields (ticket #1178).

   o  Further clarification of *WSP vs. [FWS] in Section 2.2 (ticket
      #1179).

   o  Miscellaneous editorial changes from Dan Kohn.


Changes since draft-ietf-usefor-usefor-06

   o  Reworked ABNF for Path header field delimiters and components
      (ticket #1047).

   o  Imported ABNF elements from reference docs (ticket #1155).

   o  Added IANA registration for header fields per RFC 3864 (ticket
      #1156).

   o  New ABNF for Control header field (ticket #1157).

   o  Better ABNF for <host-value> (ticket #1158).

   o  Added new definitions and advice on sender vs. posting-account in
      Injection-Info header field (ticket #1159).

   o  New ABNF for Archive and Injection-Info header fields, including
      note regarding CFWS and <parameter> (ticket #1177).

   o  Replaced leading/trailing [FWS] with *WSP in header field ANBF
      (ticket #1179).

   o  Using RFC 4234 as a reference instead of RFC 2234.

   o  Removed reference to RFC 0977 -- using only
      draft-ietf-nntpext-base.





Murchison, et al.       Expires November 23, 2006               [Page 2]


Internet-Draft           Netnews Article Format                 May 2006


   o  Added note about Expires being defined by RFC 2156.

   o  Miscellaneous editorial changes.


Changes since draft-ietf-usefor-usefor-05

   o  Finalized ABNF for <msg-id> (ticket #1003).

   o  Further cleanup of description for Newsgroups header field (ticket
      #1021).

   o  Fixed ABNF for <orig-date> (ticket #1028).

   o  Began adding text for differences from RFC 1036 (ticket #1032).

   o  Added text regarding hard-to-parse MIME boundaries (ticket #1046).

   o  Reworked/redefined Path header field delimiters and components
      (ticket #1047).

   o  Added more differences from RFC 2822 (ticket #1052).

   o  Added text for relationship to RFC 2822 (ticket #1053).

   o  Listed header fields which don't permit CFWS (ticket #1079).

   o  Applied text from ticket #1080 (Injection-Info MIME params).

   o  Added more text Approved header field semantics (ticket #1082).

   o  Added text regarding Injection-Date being mandatory/optional
      (ticket #1088).

   o  Reworked definitions of "agents", etc (ticket #1102).

   o  Removed RFC 2821 as a normative reference.

   o  Miscellaneous editorial changes.


Changes since draft-ietf-usefor-usefor-04

   o  Updated to newer references (ticket #1002).

   o  Cleaned up ABNF for <msg-id> (ticket #1003).





Murchison, et al.       Expires November 23, 2006               [Page 3]


Internet-Draft           Netnews Article Format                 May 2006


   o  Deprecated X- headers (ticket #1004).

   o  Clarified relationship between followups and References header
      field (ticket #1008).

   o  Cleaned up ABNF and description for Newsgroups header field
      (ticket #1021).

   o  Tweaked language regarding the use of <obs-phrase> (ticket #1022).

   o  Tweaked language regarding GMT timezone and Date header field
      (ticket #1028).

   o  Added language warning against folding Newsgroups header field
      (ticket #1042).

   o  Use RFC 2822 term "header field" instead of "header" (ticket
      #1043).

   o  Starting adding differences from RFC 2822 (ticket #1052).

   o  Miscellaneous editorial changes.


Changes since draft-ietf-usefor-usefor-03

   o  Reworked ABNFs of several headers.

   o  Used Charles' ABNF for <msg-id>.

   o  Disallowed comments in Supersedes header.

   o  Disallowed comments in Xref header.

   o  Disallowed comments in Message-Id header.

   o  CFWS between msg-ids in References header is not optional.

   o  Compatibility changes based on comments from Charles.

   o  Miscellaneous editorial changes.


Changes since draft-ietf-usefor-usefor-02

   o  Changed to RFC 3978 boilerplate (xml2rfc v1.29)





Murchison, et al.       Expires November 23, 2006               [Page 4]


Internet-Draft           Netnews Article Format                 May 2006


   o  Changed "network news" to "Netnews" throughout.

   o  Prohibit NO-WS-CTL in msg-id.

   o  Complaints-To header is now an Injection-Info parameter.

   o  Added descriptions of Injection-Info parameters.

   o  Removed "filename" parameter from Archive header.

   o  Added CFWS to User-Agent header.

   o  Miscellaneous editorial changes.


Changes since draft-ietf-usefor-usefor-01

   o  Removed half-hearted discussion of internal format and 8-bit clean
      transport.

   o  Added definitions of "proto-article", "posting agent", "followup",
      "followup-agent", "user-agent", and "injecting agent".

   o  Removed discussion of message/partial MIME messages.

   o  Noted that the header contents in every line MUST NOT be empty.

   o  Merged MIME sections.

   o  Only allow "UT" and "GMT" in Date header; disallow all other <obs-
      zone>.

   o  Used Charles' ABNF for <msg-id> and <unstructured>.

   o  Removed restrictions on length and start character for Newsgroups.

   o  More verbose description of Path header.

   o  Disallowed comments in Control header.

   o  Specified that <control-command> is a verb optionally followed by
      whitespace-separated arguments.

   o  Noted that Supersedes header is different from [Son-of-1036].

   o  More exact ABNF for Archive and Injection-Info parameters.





Murchison, et al.       Expires November 23, 2006               [Page 5]


Internet-Draft           Netnews Article Format                 May 2006


   o  Discussed meaning of "yes", "no" in Archive header.

   o  Added "Obsolete Headers" section.

   o  Miscellaneous editorial changes


Changes since draft-ietf-usefor-article-13

   o  The Mail-Copies-To, Posted-And-Mailed headers have been moved to
      other documents.

   o  Dropped MIME parameters, as there is no WG consensus (per Chair).

   o  More exact ABNF for Archive and Injection-Info parameters.

   o  Complaints-To header is now an Injection-Info parameter.


Issues to be addressed

   o  Path header field delimiters and components ABNF (ticket #1047).

   o  Whitespace in Path header field (ticket #1178).  Editor isn't
      clear on what the issue actually is.


























Murchison, et al.       Expires November 23, 2006               [Page 6]


Internet-Draft           Netnews Article Format                 May 2006


Table of Contents

   1.  Introduction . . . . . . . . . . . . . . . . . . . . . . . . .  9
     1.1.  Basic Concepts . . . . . . . . . . . . . . . . . . . . . .  9
     1.2.  Scope  . . . . . . . . . . . . . . . . . . . . . . . . . .  9
     1.3.  Requirements Notation  . . . . . . . . . . . . . . . . . .  9
     1.4.  Syntax Notation  . . . . . . . . . . . . . . . . . . . . .  9
     1.5.  Definitions  . . . . . . . . . . . . . . . . . . . . . . . 10
     1.6.  Structure of This Document . . . . . . . . . . . . . . . . 11
   2.  Format . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
     2.1.  Base . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
     2.2.  Header Fields  . . . . . . . . . . . . . . . . . . . . . . 13
     2.3.  MIME Conformance . . . . . . . . . . . . . . . . . . . . . 15
   3.  News Header Fields . . . . . . . . . . . . . . . . . . . . . . 16
     3.1.  Mandatory Header Fields  . . . . . . . . . . . . . . . . . 16
       3.1.1.  From . . . . . . . . . . . . . . . . . . . . . . . . . 17
       3.1.2.  Date . . . . . . . . . . . . . . . . . . . . . . . . . 17
       3.1.3.  Message-ID . . . . . . . . . . . . . . . . . . . . . . 17
       3.1.4.  Subject  . . . . . . . . . . . . . . . . . . . . . . . 19
       3.1.5.  Newsgroups . . . . . . . . . . . . . . . . . . . . . . 20
       3.1.6.  Path . . . . . . . . . . . . . . . . . . . . . . . . . 21
     3.2.  Optional Header Fields . . . . . . . . . . . . . . . . . . 23
       3.2.1.  Injection-Date . . . . . . . . . . . . . . . . . . . . 23
       3.2.2.  References . . . . . . . . . . . . . . . . . . . . . . 24
       3.2.3.  Followup-To  . . . . . . . . . . . . . . . . . . . . . 24
       3.2.4.  Expires  . . . . . . . . . . . . . . . . . . . . . . . 25
       3.2.5.  Control  . . . . . . . . . . . . . . . . . . . . . . . 25
       3.2.6.  Supersedes . . . . . . . . . . . . . . . . . . . . . . 26
       3.2.7.  Distribution . . . . . . . . . . . . . . . . . . . . . 26
       3.2.8.  Summary  . . . . . . . . . . . . . . . . . . . . . . . 27
       3.2.9.  Approved . . . . . . . . . . . . . . . . . . . . . . . 27
       3.2.10. Organization . . . . . . . . . . . . . . . . . . . . . 27
       3.2.11. Xref . . . . . . . . . . . . . . . . . . . . . . . . . 27
       3.2.12. Archive  . . . . . . . . . . . . . . . . . . . . . . . 28
       3.2.13. User-Agent . . . . . . . . . . . . . . . . . . . . . . 28
       3.2.14. Injection-Info . . . . . . . . . . . . . . . . . . . . 29
     3.3.  Obsolete Header Fields . . . . . . . . . . . . . . . . . . 31
       3.3.1.  Lines  . . . . . . . . . . . . . . . . . . . . . . . . 32
   4.  Internationalization Considerations  . . . . . . . . . . . . . 33
   5.  Security Considerations  . . . . . . . . . . . . . . . . . . . 34
   6.  IANA Considerations  . . . . . . . . . . . . . . . . . . . . . 35
   7.  References . . . . . . . . . . . . . . . . . . . . . . . . . . 40
     7.1.  Normative References . . . . . . . . . . . . . . . . . . . 40
     7.2.  Informative References . . . . . . . . . . . . . . . . . . 41
   Appendix A.  Acknowledgments . . . . . . . . . . . . . . . . . . . 43
   Appendix B.  Differences from RFC 1036 and its derivatives . . . . 44
   Appendix C.  Differences from RFC 2822 . . . . . . . . . . . . . . 45
   Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 46



Murchison, et al.       Expires November 23, 2006               [Page 7]


Internet-Draft           Netnews Article Format                 May 2006


   Intellectual Property and Copyright Statements . . . . . . . . . . 47


















































Murchison, et al.       Expires November 23, 2006               [Page 8]


Internet-Draft           Netnews Article Format                 May 2006


1.  Introduction

1.1.  Basic Concepts

   "Netnews" is a set of protocols for generating, storing and
   retrieving news "articles" (whose format is a subset of that for
   Email messages) and for exchanging them amongst a readership which is
   potentially widely distributed.  It is organized around "newsgroups",
   with the expectation that each reader will be able to see all
   articles posted to each newsgroup in which he participates.  These
   protocols most commonly use a flooding algorithm which propagates
   copies throughout a network of participating servers.  Typically,
   only one copy is stored per server, and each server makes it
   available on demand to readers able to access that server.

1.2.  Scope

   This document specifies the syntax of Netnews articles in the context
   of the "Internet Message Format" [RFC2822] and "Multipurpose Internet
   Mail Extensions (MIME)" [RFC2045].  This document supersedes
   [RFC1036], updating it to reflect current practice and incorporating
   changes and clarifications specified in other documents such as [Son-
   of-1036].

   This is the first in a set of documents that obsolete [RFC1036].
   This document focuses on the syntax and semantics of Netnews
   articles.  [I-D.ietf-usefor-usepro] is also a standards-track
   document, and describes the protocol issues of Netnews articles,
   independent of transport protocols such as [I-D.ietf-nntpext-base].
   A best common practice document, [I-D.ietf-usefor-useage], describes
   implementation recommendations to improve interoperability and
   usability.

   This specification is intended as a definition of what article
   content format is to be passed between systems.  Although some news
   systems locally store articles in this format (which eliminates the
   need for translation between formats), local storage is outside of
   the scope of this standard.

1.3.  Requirements Notation

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in [RFC2119].

1.4.  Syntax Notation

   Header fields defined in this specification use the Augmented Backus-



Murchison, et al.       Expires November 23, 2006               [Page 9]


Internet-Draft           Netnews Article Format                 May 2006


   Naur Form (ABNF) notation (including the Core Rules) specified in
   [RFC4234] and many constructs defined in [RFC2822], [RFC2045] as
   updated by [RFC2231], and [RFC3986], specifically:

   token         = <see RFC2045 Section 5.1>
   dot-atom-text = <see RFC2045 Section 5.1>

   parameter     = <see RFC2231 Section 7>
   attribute     = <see RFC2231 Section 7>

   FWS           = <see RFC2822 Section 3.2.3>
   CFWS          = <see RFC2822 Section 3.2.3>
   atext         = <see RFC2822 Section 3.2.4>
   dot-atom-text = <see RFC2822 Section 3.2.4>
   phrase        = <see RFC2822 Section 3.2.6>
   utext         = <see RFC2822 Section 3.2.6>
   date-time     = <see RFC2822 Section 3.3>
   mailbox       = <see RFC2822 Section 3.4>
   mailbox-list  = <see RFC2822 Section 3.4>
   address-list  = <see RFC2822 Section 3.4>
   fields        = <see RFC2822 Section 3.6>

   IPv6address   = <see RFC3986 Section 3.2.2>
   IPv4address   = <see RFC3986 Section 3.2.2>

   ALPHA         = <see RFC4234 Appendix B.1>
   CRLF          = <see RFC4234 Appendix B.1>
   DIGIT         = <see RFC4234 Appendix B.1>
   DQUOTE        = <see RFC4234 Appendix B.1>
   SP            = <see RFC4234 Appendix B.1>

   Additionally, Section 3.1.3 specifies a stricter definition of
   <msg-id> than the syntax in [RFC2822] Section 3.6.4.

1.5.  Definitions

   An "article" is the unit of Netnews, analogous to an [RFC2822]
   "message".  A "proto-article" is one that has not yet been injected
   into the news system.  In contrast to an "article", a "proto-article"
   may lack some mandatory header fields.

   A "message identifier" (Section 3.1.3) is a unique identifier for an
   article, usually supplied by the "user agent" that posted it or,
   failing that, by the "news server".  It distinguishes the article
   from every other article ever posted anywhere.  Articles with the
   same message identifier are treated as if they are the same article
   regardless of any differences in the body or header fields.




Murchison, et al.       Expires November 23, 2006              [Page 10]


Internet-Draft           Netnews Article Format                 May 2006


   A "newsgroup" is a single forum having a name and intended for
   articles on a specific topic.  An article is "posted to" a single
   newsgroup or several newsgroups.  When an article is posted to more
   than one newsgroup, it is said to be "crossposted"; note that this
   differs from posting the same text as part of each of several
   articles, one per newsgroup.

   A newsgroup may be "moderated", in which case submissions are not
   posted directly, but mailed to a "moderator" for consideration and
   possible posting.  Moderators are typically human but may be
   implemented partially or entirely in software.

   A "poster" is the person or software that composes and submits a
   possibly compliant article to a "user agent".  The poster is
   analogous to an [RFC2822] author.

   A "reader" is the person or software reading news articles.

   A "followup" is an article containing a response to the contents of
   an earlier article, its "precursor".  Every followup includes a
   "References" header field identifying that precursor (but note that
   non-followup articles may also use a References header field).

   A "control message" is an article which is marked as containing
   control information; a news server receiving such an article may
   (subject to the policies observed at that site) take actions beyond
   just filing and passing on the article.

   A "news server" is software that may accept articles from a "user
   agent", and/or make articles available to user agents, and/or
   exchange articles with other news servers.

   A "user agent" is software that may help posters submit proto-
   articles to a news server, and/or fetch articles from a news server
   and present them to a reader, and/or assist the reader in creating
   articles and followups.

   The generic term "agent" is used when describing requirements that
   apply to both user agents and news servers.

1.6.  Structure of This Document

   This document uses a cite by reference methodology, rather than
   repeating the contents of other standards, which could otherwise
   result in subtle differences and interoperability challenges.
   Although this document is as a result rather short, it requires
   complete understanding and implementation of the normative references
   to be compliant.



Murchison, et al.       Expires November 23, 2006              [Page 11]


Internet-Draft           Netnews Article Format                 May 2006


   Section 2 defines the format of Netnews articles.  Section 3 details
   the header fields necessary to make an article suitable for the
   Netnews environment.
















































Murchison, et al.       Expires November 23, 2006              [Page 12]


Internet-Draft           Netnews Article Format                 May 2006


2.  Format

2.1.  Base

   An article is said to be conformant to this specification if it
   conforms to the format specified in [RFC2822] Section 3 and to the
   additional requirements of this specification.

   An article that uses the obsolete syntax specified in Section 4 of
   [RFC2822] is NOT conformant to this specification, except for
   following two cases:

   o  Articles are conformant if they use the <obs-phrase> construct
      (use of a phrase like "John Q. Public" without the use of quotes,
      see [RFC2822] Section 4.1) but agents MUST NOT generate
      productions of such syntax.

   o  Articles are conformant if they use the "GMT" <zone>, as specified
      in Section 3.1.2.

   This document, and specifications that build upon it, specify how to
   handle conformant articles.  Handling of non-conformant articles is
   outside the scope of this specification.

   Agents conforming to this specification MUST generate only conformant
   articles.

   The text below uses ABNF to specify restrictions on the syntax
   specified in [RFC2822]; this grammar is intended to be more
   restrictive than the [RFC2822] grammar.  Articles must conform to the
   ABNF specified in [RFC2822].

   Articles must also conform to the restrictions specified here, both
   those that are expressed as text and those that are expressed as
   ABNF.

      NOTE: Other specifications use the term "header" as a synonym for
      what [RFC2822] calls "header field".  This document follows the
      terminology in Section 2 of [RFC2822] in using the terms "line",
      "header field", "header field name", "header field body", and
      "folding", based on a belief that consistent terminology among
      specifications that depend on each other makes the specifications
      easier to use in the long run.

2.2.  Header Fields

   All header fields in a news article are compliant with [RFC2822],
   however this specification is less permissive in what can be



Murchison, et al.       Expires November 23, 2006              [Page 13]


Internet-Draft           Netnews Article Format                 May 2006


   generated and accepted by news agents.  The syntax allowed for news
   articles is a strict subset of the "Internet Message Format", making
   all messages compliant with this specification inherently compliant
   with [RFC2822].  Note however that the converse is not guaranteed to
   be true in all cases.

   General rules which apply to all header fields (even those documented
   in [RFC2822] and [RFC2045]) are listed below and those that apply to
   specific header fields are described in the relevant sections of this
   document.

   o  All agents MUST generate header fields so that at least one space
      immediately follows the ':' separating the header field name and
      the header field body (for compatibility with deployed software,
      including [I-D.ietf-nntpext-base] servers).  News agents MAY
      accept header fields which do not contain the required space.

   o  Every line of a header field body (including the first and any
      that are subsequently folded) MUST contain at least one non-
      whitespace character.

         NOTE: This means that no header field body defined by or
         referenced by this document can be empty.  As a result, rather
         than using the <unstructured> syntax from Section 3.2.6 of
         [RFC2822], this document uses a stricter definition:

   unstructured    =  *WSP utext *( [FWS] utext ) *WSP

         NOTE: The [RFC2822] specification sometimes uses [FWS] at the
         beginning or end of ABNF describing header field content.  This
         specification uses *WSP in such cases, also in cases where this
         specification redefines constructs from [RFC2822].  This is
         done for consistency with the restriction described here, but
         the restriction applies to all header fields, not just those
         where ABNF is defined in this document.

   o  Compliant software MUST NOT generate (but MAY accept) header
      fields of more than 998 octets.  This is the only limit on the
      length of a header field prescribed by this standard.  However,
      specific rules to the contrary may apply in particular cases (for
      example, according to [RFC2047] lines of a header field containing
      encoded-words are limited to 76 octets).  [I-D.ietf-usefor-useage]
      includes suggested limits for convenience of display by user
      agents.

         NOTE: There is NO restriction on the number of lines into which
         a header field may be split, and hence there is NO restriction
         on the total length of a header field (in particular it may, by



Murchison, et al.       Expires November 23, 2006              [Page 14]


Internet-Draft           Netnews Article Format                 May 2006


         suitable folding, be made to exceed the 998 octets restriction
         pertaining to a single header line).

   o  The character set for header fields is US-ASCII.  Where the use of
      non-ASCII characters is required, they MUST be encoded using the
      MIME mechanisms defined in [RFC2047] and [RFC2231].

2.3.  MIME Conformance

   User agents MUST meet the definition of MIME-conformance in [RFC2049]
   and MUST also support [RFC2231].  This level of MIME Conformance
   provides support for internationalization and multimedia in message
   bodies ([RFC2045], [RFC2046], [RFC2231]), and support for
   internationalization of header fields ([RFC2047], [RFC2231]).  Note
   that [Errata] currently exist for [RFC2046] and [RFC2231].

   For the purposes of Section 5 of [RFC2047], all header fields defined
   in Section 3 of this standard are to be considered as "extension
   message header fields", permitting the use of [RFC2047] encodings
   within any <unstructured> header field, or within any <comment> or
   <phrase> permitted within any structured header field.

   User agents MAY accept and generate other MIME extension header
   fields, and in particular SHOULD accept Content-Disposition [RFC2183]
   and Content-Language [RFC3282].


























Murchison, et al.       Expires November 23, 2006              [Page 15]


Internet-Draft           Netnews Article Format                 May 2006


3.  News Header Fields

   The following news header fields extend those defined in Section 3.6
   of [RFC2822]:

   fields          =/ *( newsgroups /
                         path /
                         injection-date /
                         followup-to /
                         expires /
                         control /
                         supersedes /
                         distribution /
                         summary /
                         approved /
                         organization /
                         xref /
                         archive /
                         user-agent /
                         injection-info /
                         lines )

   Each of these header fields may occur at most once in a news article.

   The following header fields defined in this document do not allow
   comments (CFWS):

   Newsgroups
   Path
   Followup-To
   Control
   Supersedes
   Distribution
   Xref
   Lines

   This also applies to the following header field defined in [RFC2822]:

   Message-ID

   Most of these header fields are mainly of interest to news servers,
   and news servers often need to process these fields very rapidly.
   Thus some header fields prohibit comments.

3.1.  Mandatory Header Fields

   Each news article conformant with this specification MUST have
   exactly one of each of the following header fields: From, Date,



Murchison, et al.       Expires November 23, 2006              [Page 16]


Internet-Draft           Netnews Article Format                 May 2006


   Message-ID, Subject, Newsgroups, Path.

3.1.1.  From

   The From header field is the same as that specified in Section 3.6.2
   of [RFC2822] with the added restrictions detailed in Section 2.2.

   from            =  "From:" SP mailbox-list CRLF

3.1.2.  Date

   The Date header field is the same as that specified in Sections 3.3
   and 3.6.1 of [RFC2822] with the added restrictions detailed in
   Section 2.2.  However, the use of "GMT" as a time zone (part of <obs-
   zone>), although deprecated, is widespread in news articles today.
   Therefore, agents MUST accept <date-time> constructs that use the
   "GMT" zone.

   orig-date       =  "Date:" SP date-time CRLF

      NOTE: This specification does not change [RFC2822], which says
      that agents MUST NOT generate <date-time> constructs which include
      any zone names defined by <obs-zone>.

   Software that accepts dates with unknown timezones SHOULD treat such
   timezones as equivalent to "-0000" when comparing dates, as specified
   in [RFC2822] Section 4.3.

   Also note that these requirements apply wherever <date-time> is used,
   including Injection-Date and Expires in Section 3.2.1 and
   Section 3.2.4 respectively.

3.1.3.  Message-ID

   The Message-ID header field contains a single unique message
   identifier.  Netnews is more dependent on message identifier
   uniqueness and fast comparison than Email is, and some news software
   and standards [I-D.ietf-nntpext-base] might have trouble with the
   full range of possible <msg-id>s permitted by [RFC2822].  This
   section therefore restricts the syntax of <msg-id> as compared to
   Section 3.6.4 of [RFC2822].  The global uniqueness requirement for
   <msg-id> in [RFC2822] is to be understood as applying across all
   protocols using such message identifiers, and across both Email and
   Netnews in particular.







Murchison, et al.       Expires November 23, 2006              [Page 17]


Internet-Draft           Netnews Article Format                 May 2006


   message-id      =  "Message-ID:" SP *WSP msg-id *WSP CRLF

   msg-id          =  "<" id-left "@" id-right ">"
                      ; maximum length is 250 octets

   id-left         =  dot-atom-text / no-fold-quote

   id-right        =  dot-atom-text / no-fold-literal

   no-fold-quote   =  DQUOTE
                         ( "." *mqtext /
                           *mqtext "." /
                           *mqtext mqspecial *mqtext )
                         DQUOTE

   mqtext          =  atext / "." / mqspecial

   mqspecial       =  "(" / ")" /      ; same as specials except
                      "<" /            ; "\" and DQUOTE quoted
                      "[" / "]" /      ; "." doubled and ">" omitted
                      ":" / ";" /
                      "@" / "," /
                      ".." / "\\" / "\" DQUOTE

   no-fold-literal =  "[" *( mdtext / "\[" / "\]" / "\\" ) "]"

   mdtext          =  %d33-61 /        ; The rest of the US-ASCII
                      %d63-90 /        ; characters not including
                      %d94-126         ; ">", "[", "]", or "\"

   The msg-id MUST NOT be more than 250 octets in length.

      NOTE: The length restriction ensures that systems that accept
      message identifiers as a parameter when retrieving an article
      (e.g.  [I-D.ietf-nntpext-base]) can rely on a bounded length.

   Observe that msg-id includes the < and >.

   Observe also that in contrast to the corresponding header field in
   [RFC2822]:

   o  the syntax does not allow comments within the Message-ID header
      field,

   o  it ensures that no string of characters is quoted if it were
      already a <dot-atom-text> (it MUST start or end with a ".", or
      contain at least one <mqspecial>),




Murchison, et al.       Expires November 23, 2006              [Page 18]


Internet-Draft           Netnews Article Format                 May 2006


   o  it ensures that no single character is prefixed by a "\" in the
      form of a <quoted-pair> unless strictly necessary,

   o  it excludes all control characters,

   o  there is no possibility for ">" or WSP to occur inside a <msg-id>,
      whether quoted or not, and

   o  even though commonly derived from <domain>s, <id-rights>s are
      case-sensitive (and thus, once created, are not to be altered
      during subsequent transmission or copying)

   This is to simplify processing by news servers and to ensure
   interoperability with existing implementations and compliance with
   [I-D.ietf-nntpext-base].  Thus, whereas under [RFC2822] the following
   <msg-id>s would be considered semantically equivalent,

   <ab.cd@example.com>
   <"ab.cd"@example.com>
   <"ab.\cd"@example.com>

   only the first of them is syntactically permitted by this standard,
   and hence a simple comparison of octets will always suffice to
   determine the identity of two <msg-id>s.

   Also note that this updated ABNF applies wherever <msg-id> is used,
   including the References header field discussed in Section 3.2.2 and
   the Supersedes header field discussed in Section 3.2.6.

   Some software will try to match the <id-right> of a <msg-id> in a
   case-insensitive fashion; some will match it in a case-sensitive
   fashion.  Implementations MUST NOT generate two Message-IDs where the
   only difference is the case of characters in the <id-right> part.

   When generating a <msg-id>, implementations SHOULD use a domain name
   as the <id-right>.

      NOTE: Section 3.6.4 of [RFC2822] recommends that the <id-right>
      should be a domain name or a domain literal.  Domain literals are
      troublesome since many IP addresses are not globally unique;
      domain names are more likely to generate unique Message-IDs.

3.1.4.  Subject

   The Subject header field is the same as that specified in Section
   3.6.5 of [RFC2822] with the added restrictions detailed in
   Section 2.2.  Further discussion of the content of the Subject header
   field appears in [I-D.ietf-usefor-usepro] and [I-D.ietf-usefor-



Murchison, et al.       Expires November 23, 2006              [Page 19]


Internet-Draft           Netnews Article Format                 May 2006


   useage].

   subject         =  "Subject:" SP unstructured CRLF

3.1.5.  Newsgroups

   The Newsgroups header field specifies the newsgroup(s) to which the
   article is posted.

   newsgroups      =  "Newsgroups:" SP newsgroup-list CRLF

   newsgroup-list  =  *WSP newsgroup-name
                      *( [FWS] "," [FWS] newsgroup-name ) *WSP

   newsgroup-name  =  component *( "." component )

   component       =  1*component-char

   component-char  =  ALPHA / DIGIT / "+" / "-" / "_"

   Not all servers support [FWS] in the list of newsgroups.  In
   particular, folding the Newsgroups header field over several lines
   has been shown to harm propagation significantly.  [FWS] in the
   <newsgroup-list> SHOULD NOT be generated, but MUST be accepted.

   A newsgroup component SHOULD NOT consist of digits only, and SHOULD
   NOT contain uppercase letters.  Such components MAY be used only to
   refer to existing groups that do not conform to this naming scheme.

      NOTE: All-digit components conflict with one widely used storage
      scheme for articles.  Mixed case groups cause confusion between
      systems with case sensitive matching and systems with case
      insensitive matching of <newsgroup-name>s.

   <component>s beginning with underline ("_") are reserved for use by
   future versions of this standard and MUST NOT be generated by user
   agents (whether in Newsgroups header fields or in newgroup control
   messages [I-D.ietf-usefor-usepro]).  However, such names MUST be
   accepted by news servers.

   <component>s beginning with "+" and "-" are reserved for private use
   and MUST NOT be generated by user agents (whether in Newsgroups
   header fields or in newgroup control messages [I-D.ietf-usefor-
   usepro]) without a private prior agreement to do so.  However, such
   names MUST be accepted by news servers.

   The following <newsgroup-name>s are reserved, and MUST NOT be used as
   the name of a newsgroup:



Murchison, et al.       Expires November 23, 2006              [Page 20]


Internet-Draft           Netnews Article Format                 May 2006


   o  Groups whose first (or only) component is "example"

   o  The group "poster"

   The following <newsgroup-name>s have been used for specific purposes
   in various implementations and protocols, and therefore MUST NOT be
   used for the names of normal newsgroups.  They MAY be used for their
   specific purpose, or by local agreement.

   o  Groups whose first (or only) component is "to"

   o  Groups whose first (or only) component is "control"

   o  Groups which contain (or consist only of) the component "all"

   o  Groups which contain (or consist only of) the component "ctl"

   o  The group "junk"

      NOTE: "example.*" is reserved for examples in this and other
      standards; "poster" has a special meaning in the Followup-To
      header field; "to.*" is reserved for certain point-to-point
      communications in conjunction with the "ihave" control message
      [I-D.ietf-usefor-usepro]; "control.*" and "junk" have special
      meanings in some news-servers; "all" is used as a wildcard in some
      implementations; and "ctl" was formerly used to indicate a
      <control-command> within the Subject header field.

3.1.6.  Path

   The Path header field indicates the route taken by an article since
   its injection into the Netnews system.  Each agent that processes an
   article is required to prepend at least one <path-identity> to this
   header field body.  This is primarily to enable news servers to avoid
   sending articles to sites already known to have them, in particular
   the site they came from, and additionally to permit tracing the route
   articles take in moving over the network, and for gathering
   statistics.













Murchison, et al.       Expires November 23, 2006              [Page 21]


Internet-Draft           Netnews Article Format                 May 2006


   path            =  "Path:" SP *WSP path-list tail-entry *WSP CRLF

   path-list       =  *( path-identity [FWS] [path-diagnostic] "!" )

   path-diagnostic =  diag-match / diag-other / diag-deprecated

   diag-match      =  "!"                          ; another "!"

   diag-other      =  "!." diag-keyword [ "." diag-identity ] [FWS]

   diag-deprecated =  "!" IPv4address [FWS]

   diag-keyword    =  1*ALPHA                      ; see USEPRO

   diag-identity   =  path-identity / IPv4address / IPv6address

   tail-entry      =  path-nodot
                      ; may be the string "not-for-mail"

   path-identity   =  ( 1*( label "." ) toplabel ) / path-nodot

   path-nodot      =  1*( alphanum / "-" / "_" )   ; legacy names

   label           =  alphanum [ *( alphanum / "-" ) alphanum ]

   toplabel        =  ( [ label *( "-" ) ] ALPHA *( "-" ) label ) /
                      ( label *( "-" ) ALPHA [ *( "-" ) label ] ) /
                      ( label 1*( "-" ) label )

   alphanum        =  ALPHA / DIGIT                ; compare RFC3696

   A <path-identity> is a name identifying a site.  It takes the form of
   a domain name having two or more components separated by dots, or a
   single name with no dots (<path-nodot>).

   Each <path-identity> in the <path-list> (which does not include the
   <tail-entry>) indicates, from right to left, the successive agents
   through which the article has passed.  The use of the <diag-match>,
   which appears as "!!", indicates that the agent to its left verified
   the identity of the agent to its right before accepting the article
   (whereas the <path-delimiter> "!" implies no such claim).

   NOTE: Historically, the <tail-entry> indicated the name of the
   sender.  If not used for this purpose, the string "not-for-mail" is
   often used instead (since at one time the whole path could be used as
   a mail address for the sender).

   NOTE: Although case-insensitive, it is intended that the <path-



Murchison, et al.       Expires November 23, 2006              [Page 22]


Internet-Draft           Netnews Article Format                 May 2006


   keyword>s should be in upper case, to distinguish them from the
   <path-identity>s which are traditionally in lower case.

   A <path-diagnostic> is an item inserted into the Path header field
   for purposes other than to indicate the name of a site.  The use of
   these is described in [I-D.ietf-usefor-usepro].

   NOTE: One usage of a <path-diagnostic> is to record an IP address.
   The fact that IPv6 addresses are allowed means that the colon (:) is
   permitted; note that this may cause interoperability problems at
   older sites that regard ":" as a <path-delimiter> and have neighbors
   whose names have 4 or fewer characters, and where all the characters
   are valid HEX digits.

3.2.  Optional Header Fields

   None of the header fields appearing in this section is required to
   appear in every article but some of them may be required in certain
   types of articles.  Further discussion of these requirements appears
   in [I-D.ietf-usefor-usepro] and [I-D.ietf-usefor-useage].

   The header fields Reply-To, Sender, Comments, and Keywords are used
   in news articles in the same circumstances and with the same meaning
   as that specified in [RFC2822] with the added restrictions detailed
   in Section 2.2.  Multiple occurances of the Keywords header field are
   not permitted.

   sender          =  "Sender:" SP mailbox CRLF

   reply-to        =  "Reply-To:" SP address-list CRLF

   comments        =  "Comments:" SP unstructured CRLF

   keywords        =  "Keywords:" SP phrase *("," phrase) CRLF

   The MIME header fields MIME-Version, Content-Type, Content-Transfer-
   Encoding, Content-Disposition, and Content-Language are used in news
   articles in the same circumstances and with the same meanings as
   those specified in [RFC2045], [RFC2183], and [RFC3282] with the added
   restrictions detailed in Section 2.2.

   All remaining news header fields are described below.

3.2.1.  Injection-Date

   The Injection-Date header field contains the date and time that the
   article was injected into the network.  Its purpose is to prevent the
   reinjection into the news stream of "stale" articles which have



Murchison, et al.       Expires November 23, 2006              [Page 23]


Internet-Draft           Netnews Article Format                 May 2006


   already expired by the time they arrive at some news server.

   This header field MUST be inserted whenever an article is injected.
   However, software that predates this standard does not use this
   header, and therefore agents MUST accept articles without the
   Injection-Date header field.

   injection-date  =  "Injection-Date:" SP date-time CRLF


   See the remarks under Section 3.1.2 regarding the syntax of
   <date-time> and the requirements and recommendations to which it is
   subject.

      NOTE: Since clocks on various agents may not be synchronized, the
      <date-time> in this header field may not be later than the Date
      header field, as would be expected.  Agents SHOULD use the <date-
      time> they believe is correct when adding Inject-Info but SHOULD
      NOT alter the pre-existing Date header field.

   This header field is intended to replace the currently-used but
   undocumented "NNTP-Posting-Date" header field, whose use is now
   deprecated.

3.2.2.  References

   The References header field is the same as that specified in Section
   3.6.4 of [RFC2822] with the added restrictions detailed in
   Section 2.2 and those listed below:

   o  The updated <msg-id> construct defined in Section 3.1.3 MUST be
      used.

   o  Message identifiers MUST be separated with CFWS.

   o  Comments in CFWS between message identifiers can cause
      interoperability problems, so comments SHOULD NOT be generated,
      but MUST be accepted.


   references      =  "References:" SP [CFWS] msg-id *(CFWS msg-id)
                      [CFWS] CRLF

3.2.3.  Followup-To

   The Followup-To header field specifies to which newsgroup(s)
   followups should be posted.  The Followup-To header field SHOULD NOT
   appear in a message, unless its content is different from the content



Murchison, et al.       Expires November 23, 2006              [Page 24]


Internet-Draft           Netnews Article Format                 May 2006


   of the Newsgroups header field.

   followup-to     =  "Followup-To:" SP ( newsgroup-list / poster-text )
                      CRLF

   poster-text     =  *WSP %d112.111.115.116.101.114 *WSP
                      ; "poster" in lower-case

   The syntax is the same as that of the Newsgroups (Section 3.1.5)
   header field, with the exception that the keyword "poster" requests
   that followups should be mailed to the article's reply address rather
   than posted.  Agents MUST generate the keyword "poster" in lower-
   case, but MAY choose to recognize case-insensitive forms such as
   "Poster".

   As in the Newsgroups (Section 3.1.5) header field, [FWS] in the
   <newsgroup-list> SHOULD NOT be generated, but MUST be accepted.

3.2.4.  Expires

   The Expires header field specifies a date and time when the article
   is deemed to be no longer relevant and could usefully be removed
   ("expired").

   expires         =  "Expires:" SP date-time CRLF

   See the remarks under Section 3.1.2 regarding the syntax of
   <date-time> and the requirements and recommendations to which it is
   subject.

      NOTE: The Expires header field is also sometimes used in Email
      with a similar meaning [RFC2156].

3.2.5.  Control

   The Control header field marks the article as a control message, and
   specifies the desired actions (additional to the usual ones of
   storing and/or relaying the article).

   control         =  "Control:" SP *WSP control-command *WSP CRLF

   control-command =  verb *( 1*WSP argument )

   verb            =  token

   argument        =  1*( %x21-7E )

   The verb indicates what action should be taken, and the argument(s)



Murchison, et al.       Expires November 23, 2006              [Page 25]


Internet-Draft           Netnews Article Format                 May 2006


   (if any) supply details.  In some cases, the body of the article may
   also contain details.  The legal verbs and respective arguments are
   discussed in the companion document, [I-D.ietf-usefor-usepro].

   An article with a Control header field MUST NOT also have a
   Supersedes header field.

3.2.6.  Supersedes

   The Supersedes header field contains a message identifier specifying
   an article to be superseded upon the arrival of this one.  An article
   containing a Supersedes header field is equivalent to a "cancel"
   [I-D.ietf-usefor-usepro] control message for the specified article,
   followed immediately by the new article without the Supersedes header
   field.

   supersedes      =  "Supersedes:" SP *WSP msg-id *WSP CRLF

      NOTE: There is no "c" in Supersedes.

      NOTE: The Supersedes header field defined here has no connection
      with the Supersedes header field that sometimes appears in Email
      messages converted from X.400 according to [RFC2156]; in
      particular, the syntax here permits only one <msg-id> in contrast
      to the multiple <msg-id>s in that Email version.

3.2.7.  Distribution

   The Distribution header field specifies geographic or organizational
   limits on an article's propagation.

   distribution    =  "Distribution:" SP dist-list CRLF

   dist-list       =  *WSP dist-name
                      *( [FWS] "," [FWS] dist-name ) *WSP

   dist-name       =  ALPHA / DIGIT
                      *( ALPHA / DIGIT / "+" / "-" / "_" )

   The <dist-name>s "world" and "local" are predefined.  However,
   "world" SHOULD NOT be used explicitly, since it is the default when
   the Distribution header field is absent entirely.

   "All" MUST NOT be used as a <dist-name>. <dist-name>s SHOULD contain
   at least three characters, except when they are two-letter country
   names drawn from [ISO.3166.1988]. <dist-name>s are case-insensitive
   (i.e.  "US", "Us", "uS", and "us" all specify the same distribution).




Murchison, et al.       Expires November 23, 2006              [Page 26]


Internet-Draft           Netnews Article Format                 May 2006


   [FWS] in the <dist-list> SHOULD NOT be generated, but MUST be
   accepted.

3.2.8.  Summary

   The Summary header field is a short phrase summarizing the article's
   content.

   summary         =  "Summary:" SP unstructured CRLF

3.2.9.  Approved

   The Approved header field indicates the mailing addresses (and
   possibly the full names) of the persons or entities approving the
   article for posting.  Its principal uses are in moderated articles
   and in group control messages [I-D.ietf-usefor-usepro].

   approved        =  "Approved:" SP mailbox-list CRLF

   Each mailbox contained in the Approved header field MUST be that of
   one of the person(s) or entity(ies) in question, and one of those
   mailboxes MUST be that of the actual sender of the article.  Note
   that this standard does not provide any means to enforce or verify
   this requirement, but future extensions or standards may provide such
   a facility (e.g. digitial signatures).

3.2.10.  Organization

   The Organization header field is a short phrase identifying the
   poster's organization.

   organization    =  "Organization:" SP unstructured CRLF

   NOTE: There is no "s" in Organization.

3.2.11.  Xref

   The Xref header field indicates where an article was filed by the
   last news server to process it.  The article locations are used to
   keep track of crossposted articles so that user agents serviced by a
   particular news server can mark such articles as read.










Murchison, et al.       Expires November 23, 2006              [Page 27]


Internet-Draft           Netnews Article Format                 May 2006


   xref            =  "Xref:" SP *WSP server-name
                      1*( FWS location ) *WSP CRLF

   server-name     =  path-identity

   location        =  newsgroup-name ":" article-locator

   article-locator =  1*( %x21-27 / %x29-3A / %x3C-7E )
                      ; US-ASCII printable characters
                      ; except '(' and ';'

   The <server-name> is included so that software can determine which
   news server generated the header field.  The locations specify what
   newsgroups the article was filed under (which may differ from those
   in the Newsgroups header field) and where it was filed under them.
   The exact form of an <article-locator> is implementation-specific.

      NOTE: The traditional form of an <article-locator> (as required by
      [I-D.ietf-nntpext-base]) is a decimal number, with articles in
      each newsgroup numbered consecutively starting from 1.

3.2.12.  Archive

   The Archive header field provides an indication of the poster's
   intent regarding preservation of the article in publicly accessible
   long-term or permanent storage.

   archive         =  "Archive:" SP [CFWS] ("no" / "yes")
                      *( [CFWS] ";" [CFWS] archive-param ) [CFWS] CRLF

   archive-param   =  parameter

   The presence of an Archive header field in an article with a field
   body of "no" indicates that the poster does not permit redistribution
   from publicly accessible long-term or permanent archives.  The
   absence of this header field, or the presence of this header field
   with a field body of "yes", indicates that the poster is willing for
   such redistribution to take place.  Further extensions to this
   standard may provide parameters for administration of the archiving
   process.

3.2.13.  User-Agent

   The User-Agent header field contains information about the user agent
   (typically a newsreader) generating the article, for statistical
   purposes and tracing of standards violations to specific software
   needing correction.  It is intended that this header field be
   suitable for use in Email.



Murchison, et al.       Expires November 23, 2006              [Page 28]


Internet-Draft           Netnews Article Format                 May 2006


   user-agent      =  "User-Agent:" SP 1*product [CFWS] CRLF

   product         =  [CFWS] token [ [CFWS] "/" product-version ]

   product-version =  [CFWS] token

   This header field MAY contain multiple product-tokens identifying the
   user agent and any subproducts which form a significant part of it,
   listed in order of their significance for identifying the
   application.

      NOTE: Some of this information has previously been sent in non-
      standardized header fields such as X-Newsreader, X-Mailer,
      X-Posting-Agent, X-Http-User-Agent, and others.  Once a user agent
      uses User-Agent, it should have no need to send these non-standard
      header fields.

      NOTE: [RFC2616] describes a similar facility for the HTTP
      protocol.  This specification differs in that "{" and "}" are
      allowed in tokens (<product> and <product-version>) and comments
      are permitted wherever whitespace is allowed.

3.2.14.  Injection-Info

   The Injection-Info header field contains information provided by the
   injecting news server as to how an article entered the Netnews system
   and to assist in tracing its true origin.  It can also specify one or
   more addresses where complaints concerning the poster of the article
   may be sent.

   injection-info  =  "Injection-Info:" SP [CFWS] path-identity
                      [CFWS] *( ";" [CFWS] parameter ) [CFWS] CRLF

      NOTE: The syntax of <parameter> ([RFC2045] Section 5.1 as amended
      by [RFC2231]), taken in conjunction with the folding rules of
      [RFC0822], effectively allows [CFWS] to occur on either side of
      the "=" inside a <parameter>.

   The following table gives the <attribute> and the format of the
   <value> for each <parameter> defined for use with this header field.
   At most one occurrence of each such <parameter> is allowed.

   <attribute>              format of <value>
   --------------------     -----------------
   "posting-host"           a <host-value>
   "posting-account"        any <value>
   "logging-data"           any <value>
   "mail-complaints-to"     an <address-list>



Murchison, et al.       Expires November 23, 2006              [Page 29]


Internet-Draft           Netnews Article Format                 May 2006


   where

   host-value      =  dot-atom-text / IPv4address / IPv6address /
                      (dot-atom-text ":" ( IPv4address / IPv6address ))

      NOTE: Since any such <host-value> or <address-list> also has to be
      a syntactically correct <value>, it will usually be necessary to
      encapsulate it as a <quoted-string>, for example:

   posting-host = "posting@example.com:192.168.0.1"

   Other <attribute>s SHOULD NOT be used unless defined in extensions to
   this standard.  If non-standards-based <attribute>s are used, they
   MUST begin with an "x-".

   Although comments and folding of white space are permitted throughout
   the Injection-Info header field, folding SHOULD NOT be used within
   any <parameter>.  Folding SHOULD only occur before or after the ";"
   separating <parameter>s, and comments SHOULD only be used following
   the last <parameter>.

      NOTE: Some of this information has previously been sent in non-
      standardized header fields such as NNTP-Posting-Host, X-trace,
      X-Complaints-To, and others.  Once a news server uses Injection-
      Info, it should have no need to send these non-standard header
      fields.

   The "posting-host" <parameter> specifies the FQDN and/or IP address
   (IPv4address or IPv6address) of the host from which the news server
   received the article.

      NOTE: If the "posting-host" <parameter> identifies a dial-up
      point-of-presence, the "posting-account" or the "logging-data"
      <parameter> may provide additional information about the true
      origin of the article.

   The "posting-account" <parameter> identifies the source from which
   that news server received the article, in a notation that can be
   interpreted by the news server administrator.  This notation can
   include any information the administrator deems pertinent.  In order
   to limit the exposure of personal data, it SHOULD be given in a form
   that can't be interpreted by other sites.  However, to make it useful
   for rate limiting and abuse detection, two messages posted from the
   same source SHOULD have the same value of "posting-account", and two
   messages from different sources SHOULD have differing values of
   "posting-account".  The exact definition of "source" is left to the
   discretion of the news server administrator.




Murchison, et al.       Expires November 23, 2006              [Page 30]


Internet-Draft           Netnews Article Format                 May 2006


   The "logging-data" <parameter> contains information (typically a
   session number or other non-persistent means of identifying a posting
   account) that will enable the true origin of the article to be
   determined by reference to logging information kept by the news
   server.

   The "mail-complaints-to" <parameter> specifies one or more mailboxes
   for sending complaints concerning the behavior of the poster of the
   article.

   It is a matter of local policy which <parameter>s to include.  Some
   pieces of information have privacy implications; this is discussed in
   [I-D.ietf-usefor-useage].

3.3.  Obsolete Header Fields

   Early Netnews software sometimes generated header fields such as:

   Relay-Version: version B 2.10 2/13/83; site cbosgd.UUCP
   Posting-Version: version B 2.10 2/13/83; site eagle.UUCP
   Date-Received: Friday, 19-Nov-82 16:59:30 EST

   Relay-Version contained version information about the news server
   that last processed the article.  Posting-Version contained version
   information about the user agent that posted the article.  Date-
   Received contained the date when the last news server to process the
   article first saw it (in a slightly nonstandard format).

   These header fields are mentioned for historical purposes only.
   Agents MUST NOT generate articles containing these header fields.

   In addition, this standard obsoletes certain header fields defined in
   [Son-of-1036]:

   Also-Control: cancel <9urrt98y53@site.example>
   See-Also: <i4g587y@site1.example> <kgb2231+ee@site2.example>
   Article-Names: comp.foo:charter
   Article-Updates: <i4g587y@site1.example>

   Also-Control indicated a control message that was also intended to be
   filed as a normal article.  See-Also listed related articles, but
   without the specific relationship with followups that pertains to the
   References header field.  Article-Names indicated some special
   significance of that article in relation to the indicated newsgroup.
   Article-Updates indicated that an earlier article was updated,
   without at the same time being superseded.

   The header fields listed above are documented for historical purposes



Murchison, et al.       Expires November 23, 2006              [Page 31]


Internet-Draft           Netnews Article Format                 May 2006


   only.  These header fields MUST NOT be generated and SHOULD be
   ignored.

3.3.1.  Lines

   The Lines header field indicates the number of lines in the body of
   the article.

   lines           =  "Lines:" SP *WSP 1*DIGIT *WSP CRLF

   The line count includes all body lines, including the signature if
   any, including empty lines (if any) at the beginning or end of the
   body, and including the whole of all MIME message and multipart parts
   contained in the body (the single empty separator line between the
   header fields and the body is not part of the body).  The "body" here
   is the body as found in the posted article as transmitted by the user
   agent.

   Historically, this header field was used by the [I-D.ietf-nntpext-
   base] Overview facility, but its use for this purpose is now
   deprecated.  As a result, this header field is to be regarded as
   obsolescent, and it is likely to be removed entirely in a future
   version of this standard.  Servers and clients SHOULD ignore it, and
   SHOULD NOT generate it.



























Murchison, et al.       Expires November 23, 2006              [Page 32]


Internet-Draft           Netnews Article Format                 May 2006


4.  Internationalization Considerations

   Internationalization of news article header fields and bodies is
   provided using MIME mechanisms discussed in Section 2.3.  Note that
   the generation of internationalized <newsgroup-name>s for use in
   header fields is not addressed in this document.













































Murchison, et al.       Expires November 23, 2006              [Page 33]


Internet-Draft           Netnews Article Format                 May 2006


5.  Security Considerations

   The news article format specified in this document does not provide
   any security services, such as confidentiality, authentication of
   sender, or non-repudiation.  Instead, such services need to be
   layered above, using such protocols as S/MIME [RFC3851] or PGP/MIME
   [RFC3156], or below, using secure versions of news transport
   protocols.  Additionally, several currently non-standardized
   protocols [PGPVERIFY] will hopefully be standardized in the near
   future.

   Message identifiers (Section 3.1.3) in news are required to be
   unique; articles are refused (in server-to-server transfer) if the
   identifier has already been seen.  So if you can predict the
   identifier of a message, you can preempt it by posting a message
   (possibly to a quite different group) with the same message
   identifier, stopping your target message from propagating.  Agents
   that generate message identifiers for news articles SHOULD ensure
   that they are unpredictable.

   MIME security considerations are discussed in [RFC2046].  Note that
   the full range of encodings allowed for parameters in [RFC2046] and
   [RFC2231] permits constructs that simple parsers may fail to parse
   correctly; examples of hard-to-parse constructs are:

   Content-Type: multipart/mixed
     (; boundary=foo ; xyz=");bOuNdArY*=''next%20part(")

   Content-Type: multipart/digest;
     boundary (not=me) = ("yes ;-) simple (foo;bar") ; x-foo = xyzzy

   Such differences in parsing may be used as part of an attack.



















Murchison, et al.       Expires November 23, 2006              [Page 34]


Internet-Draft           Netnews Article Format                 May 2006


6.  IANA Considerations

   IANA is requested to register the following header fields in the
   Permanent Message Header Field Repository, in accordance with the
   procedures set out in [RFC3864].

      Header field name: Also-Control
      Applicable protocol: netnews
      Status: obsoleted
      Author/change controller: IETF
      Specification document(s): [Son-of-1036] (Section 6.15)

      Header field name: Approved
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.9)

      Header field name: Archive
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.12)

      Header field name: Article-Names
      Applicable protocol: netnews
      Status: obsoleted
      Author/change controller: IETF
      Specification document(s): [Son-of-1036] (Section 6.17)

      Header field name: Article-Updates
      Applicable protocol: netnews
      Status: obsoleted
      Author/change controller: IETF
      Specification document(s): [Son-of-1036] (Section 6.18)

      Header field name: Comments
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2),
      [RFC2822] (Section 3.6.5)

      Header field name: Control
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.5)



Murchison, et al.       Expires November 23, 2006              [Page 35]


Internet-Draft           Netnews Article Format                 May 2006


      Header field name: Date
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.1.2),
      [RFC2822] (Section 3.6.1)

      Header field name: Distribution
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.7)

      Header field name: Expires
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.4)

      Header field name: Followup-To
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.3)

      Header field name: From
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.1.1),
      [RFC2822] (Section 3.6.2)

      Header field name: Injection-Date
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.1)

      Header field name: Injection-Info
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.14)

      Header field name: Keywords
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF



Murchison, et al.       Expires November 23, 2006              [Page 36]


Internet-Draft           Netnews Article Format                 May 2006


      Specification document(s): This document (Section 3.2),
      [RFC2822] (Section 3.6.5)

      Header field name: Lines
      Applicable protocol: netnews
      Status: deprecated
      Author/change controller: IETF
      Specification document(s): This document (Section 3.3.1)
      Related information: [I-D.ietf-nntpext-base] (Section 8.1)

      Header field name: Message-ID
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.1.3)
      Related information: [RFC2822] (Section 3.6.4)

      Header field name: Newsgroups
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.1.5)

      Header field name: NNTP-Posting-Date
      Applicable protocol: netnews
      Status: obsoleted
      Author/change controller: IETF
      Specification document(s): none

      Header field name: NNTP-Posting-Host
      Applicable protocol: netnews
      Status: obsoleted
      Author/change controller: IETF
      Specification document(s): none

      Header field name: Organization
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.10)

      Header field name: Path
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.1.6)





Murchison, et al.       Expires November 23, 2006              [Page 37]


Internet-Draft           Netnews Article Format                 May 2006


      Header field name: References
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.2),
      [RFC2822] (Section 3.6.4)

      Header field name: Reply-To
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2),
      [RFC2822] (Section 3.6.2)

      Header field name: See-Also
      Applicable protocol: netnews
      Status: obsoleted
      Author/change controller: IETF
      Specification document(s): [Son-of-1036] (Section 6.16)

      Header field name: Sender
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2),
      [RFC2822] (Section 3.6.2)

      Header field name: Subject
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.1.4),
      [RFC2822] (Section 3.6.5)

      Header field name: Summary
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.8)

      Header field name: Supersedes
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.6)

      Header field name: User-Agent
      Applicable protocol: netnews



Murchison, et al.       Expires November 23, 2006              [Page 38]


Internet-Draft           Netnews Article Format                 May 2006


      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.13)

      Header field name: Xref
      Applicable protocol: netnews
      Status: standard
      Author/change controller: IETF
      Specification document(s): This document (Section 3.2.11)










































Murchison, et al.       Expires November 23, 2006              [Page 39]


Internet-Draft           Netnews Article Format                 May 2006


7.  References

7.1.  Normative References

   [Errata]   "RFC Editor Errata".

   [RFC2045]  Freed, N. and N. Borenstein, "Multipurpose Internet Mail
              Extensions (MIME) Part One: Format of Internet Message
              Bodies", RFC 2045, November 1996.

   [RFC2046]  Freed, N. and N. Borenstein, "Multipurpose Internet Mail
              Extensions (MIME) Part Two: Media Types", RFC 2046,
              November 1996.

   [RFC2047]  Moore, K., "MIME (Multipurpose Internet Mail Extensions)
              Part Three: Message Header Extensions for Non-ASCII Text",
              RFC 2047, November 1996.

   [RFC2049]  Freed, N. and N. Borenstein, "Multipurpose Internet Mail
              Extensions (MIME) Part Five: Conformance Criteria and
              Examples", RFC 2049, November 1996.

   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119, March 1997.

   [RFC2183]  Troost, R., Dorner, S., and K. Moore, "Communicating
              Presentation Information in Internet Messages: The
              Content-Disposition Header Field", RFC 2183, August 1997.

   [RFC2231]  Freed, N. and K. Moore, "MIME Parameter Value and Encoded
              Word Extensions: Character Sets, Languages, and
              Continuations", RFC 2231, November 1997.

   [RFC2822]  Resnick, P., "Internet Message Format", RFC 2822,
              April 2001.

   [RFC3282]  Alvestrand, H., "Content Language Headers", RFC 3282,
              May 2002.

   [RFC3986]  Berners-Lee, T., Fielding, R., and L. Masinter, "Uniform
              Resource Identifier (URI): Generic Syntax", STD 66,
              RFC 3986, January 2005.

   [RFC4234]  Crocker, D. and P. Overell, "Augmented BNF for Syntax
              Specifications: ABNF", RFC 4234, October 2005.






Murchison, et al.       Expires November 23, 2006              [Page 40]


Internet-Draft           Netnews Article Format                 May 2006


7.2.  Informative References

   [I-D.ietf-nntpext-base]
              Feather, C., "Network News Transfer Protocol",
              draft-ietf-nntpext-base-27 (work in progress), June 2005.

   [I-D.ietf-usefor-useage]
              Lindsey, C., "Usenet Best Practice",
              draft-ietf-usefor-useage-01 (work in progress),
              March 2005.

   [I-D.ietf-usefor-usepro]
              Lindsey, C., "News Article Architecture and Protocols",
              draft-ietf-usefor-usepro-05 (work in progress),
              January 2006.

   [ISO.3166.1988]
              International Organization for Standardization, "Codes for
              the representation of names of countries, 3rd edition",
              ISO Standard 3166, August 1988.

   [PGPVERIFY]
              Lawrence, D., "PGPverify", June 1999.

   [RFC0822]  Crocker, D., "Standard for the format of ARPA Internet
              text messages", STD 11, RFC 822, August 1982.

   [RFC1036]  Horton, M. and R. Adams, "Standard for interchange of
              USENET messages", RFC 1036, December 1987.

   [RFC2156]  Kille, S., "MIXER (Mime Internet X.400 Enhanced Relay):
              Mapping between X.400 and RFC 822/MIME", RFC 2156,
              January 1998.

   [RFC2616]  Fielding, R., Gettys, J., Mogul, J., Frystyk, H.,
              Masinter, L., Leach, P., and T. Berners-Lee, "Hypertext
              Transfer Protocol -- HTTP/1.1", RFC 2616, June 1999.

   [RFC3156]  Elkins, M., Del Torto, D., Levien, R., and T. Roessler,
              "MIME Security with OpenPGP", RFC 3156, August 2001.

   [RFC3851]  Ramsdell, B., "Secure/Multipurpose Internet Mail
              Extensions (S/MIME) Version 3.1 Message Specification",
              RFC 3851, July 2004.

   [RFC3864]  Klyne, G., Nottingham, M., and J. Mogul, "Registration
              Procedures for Message Header Fields", BCP 90, RFC 3864,
              September 2004.



Murchison, et al.       Expires November 23, 2006              [Page 41]


Internet-Draft           Netnews Article Format                 May 2006


   [Son-of-1036]
              Spencer, H., "News Article Format and Transmission",
              June 1994.
















































Murchison, et al.       Expires November 23, 2006              [Page 42]


Internet-Draft           Netnews Article Format                 May 2006


Appendix A.  Acknowledgments

   As this document is the result of an eight year effort, the number of
   people that have contributed to its content are too numerous to
   mention individually.  Many thanks go out to all past and present
   members of the USEFOR Working Group of the Internet Engineering Task
   Force (IETF) and the accompanying mailing list.












































Murchison, et al.       Expires November 23, 2006              [Page 43]


Internet-Draft           Netnews Article Format                 May 2006


Appendix B.  Differences from RFC 1036 and its derivatives

   This appendix contains a list of changes that have been made in the
   Netnews Article Format from earlier standards, specifically
   [RFC1036].

   o  The [RFC2822] conventions for parenthesis-enclosed <comment>s in
      header fields are supported in all newly defined header fields and
      in header fields inherited from [RFC2822].  They are, however,
      still disallowed for performance and/or compatibility reasons in
      the Message-ID, Newsgroups, Path, Followup-To, Control,
      Supersedes, Distribution, Xref and Lines header fields.

   o  Whitespace is permitted in Newsgroups header fields.

   o  An enhanced syntax for the Path header field enables the injection
      point of, and the route taken by an article to be determined with
      more precision.

   o  MIME is recognized as an integral part of Netnews.

   o  There is a new Injection-Date header to make the rejection of
      stale articles more precise and to minimize spurious rejections.

   o  There are several new optional header fields defined, notably
      Archive, Injection-Info and User-Agent, leading to increased
      functionality.

   o  Certain header fields, notably Lines, have been deprecated or made
      obsolete (Section 3.3).

   o  The convention to interpret subjects starting with the word "cmsg"
      as a control message was removed.

   o  There are numerous other small changes, clarifications and
      enhancements.















Murchison, et al.       Expires November 23, 2006              [Page 44]


Internet-Draft           Netnews Article Format                 May 2006


Appendix C.  Differences from RFC 2822

   This appendix lists the differences between the syntax allowed by the
   Netnews Article Format (this document) as compared to the Internet
   Message Format, as specified in [RFC2822].

   The Netnews article format is a strict subset of the Internet Message
   Format; all Netnews articles conform to the syntax of [RFC2822].

   The following restrictions are important:

   o  A SP (space) is REQUIRED after the colon (':') following a header
      field name.

   o  A more restricted syntax of <msg-id> (to be used by the
      Message-ID, References, and Supersedes header fields) is defined.

   o  The length of a <msg-id> MUST NOT exceed 250 octets.

   o  Comments are not allowed in the Message-ID header field.

   o  The CFWS between <msg-id>s in the References header field is not
      optional.

   o  It is legal for a parser to reject obsolete syntax, except that:

      *  The <obs-phrase> construct MUST be accepted.

      *  The obsolete <zone> "GMT" MUST be accepted within a
         <date-time>.

   o  Every line of a header field body (including the first and any
      that are subsequently folded) MUST contain at least one non-
      whitespace character.  This means that an empty header field body
      is illegal.
















Murchison, et al.       Expires November 23, 2006              [Page 45]


Internet-Draft           Netnews Article Format                 May 2006


Authors' Addresses

   Kenneth Murchison (editor)
   Carnegie Mellon University
   5000 Forbes Avenue
   Cyert Hall 285
   Pittsburgh, PA  15213
   US

   Phone: +1 412 268 2638
   Email: murch@andrew.cmu.edu


   Charles H. Lindsey
   University of Manchester
   5 Clerewood Avenue
   Heald Green
   Cheadle
   Cheshire  SK8 3JU
   GB

   Phone: +44 161 436 6131
   Email: chl@clw.cs.man.ac.uk


   Dan Kohn
   FlyDash, Inc.
   425 Alma St. #411
   Palo Alto, CA  94301
   US

   Phone: +1 415 233 1000
   Email: dan@dankohn.com


















Murchison, et al.       Expires November 23, 2006              [Page 46]


Internet-Draft           Netnews Article Format                 May 2006


Intellectual Property Statement

   The IETF takes no position regarding the validity or scope of any
   Intellectual Property Rights or other rights that might be claimed to
   pertain to the implementation or use of the technology described in
   this document or the extent to which any license under such rights
   might or might not be available; nor does it represent that it has
   made any independent effort to identify any such rights.  Information
   on the procedures with respect to rights in RFC documents can be
   found in BCP 78 and BCP 79.

   Copies of IPR disclosures made to the IETF Secretariat and any
   assurances of licenses to be made available, or the result of an
   attempt made to obtain a general license or permission for the use of
   such proprietary rights by implementers or users of this
   specification can be obtained from the IETF on-line IPR repository at
   http://www.ietf.org/ipr.

   The IETF invites any interested party to bring to its attention any
   copyrights, patents or patent applications, or other proprietary
   rights that may cover technology that may be required to implement
   this standard.  Please address the information to the IETF at
   ietf-ipr@ietf.org.


Disclaimer of Validity

   This document and the information contained herein are provided on an
   "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS
   OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET
   ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED,
   INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE
   INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
   WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.


Copyright Statement

   Copyright (C) The Internet Society (2006).  This document is subject
   to the rights, licenses and restrictions contained in BCP 78, and
   except as set forth therein, the authors retain all their rights.


Acknowledgment

   Funding for the RFC Editor function is currently provided by the
   Internet Society.




Murchison, et al.       Expires November 23, 2006              [Page 47]