Skip to main content

Byte Range PATCH
draft-ietf-httpapi-patch-byterange-01

Document Type Active Internet-Draft (httpapi WG)
Author Austin Wright
Last updated 2024-03-19
RFC stream Internet Engineering Task Force (IETF)
Intended RFC status (None)
Formats
Additional resources Mailing list discussion
Stream WG state WG Document
Document shepherd (None)
IESG IESG state I-D Exists
Consensus boilerplate Unknown
Telechat date (None)
Responsible AD (None)
Send notices to (None)
draft-ietf-httpapi-patch-byterange-01
Building Blocks for HTTP APIs                                  A. Wright
Internet-Draft                                             19 March 2024
Intended status: Standards Track                                        
Expires: 20 September 2024

                            Byte Range PATCH
                 draft-ietf-httpapi-patch-byterange-01

Abstract

   This document specifies a media type for PATCH payloads that
   overwrites a specific byte range, to allow random access writes, or
   allow a resource to be uploaded in several segments.

Status of This Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF).  Note that other groups may also distribute
   working documents as Internet-Drafts.  The list of current Internet-
   Drafts is at https://datatracker.ietf.org/drafts/current/.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on 20 September 2024.

Copyright Notice

   Copyright (c) 2024 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents (https://trustee.ietf.org/
   license-info) in effect on the date of publication of this document.
   Please review these documents carefully, as they describe your rights
   and restrictions with respect to this document.  Code Components
   extracted from this document must include Revised BSD License text as
   described in Section 4.e of the Trust Legal Provisions and are
   provided without warranty as described in the Revised BSD License.

Wright                  Expires 20 September 2024               [Page 1]
Internet-Draft              Byte Range PATCH                  March 2024

Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
     1.1.  Notational Conventions  . . . . . . . . . . . . . . . . .   3
   2.  Modifying a content range with PATCH  . . . . . . . . . . . .   3
     2.1.  The Content-Range field . . . . . . . . . . . . . . . . .   4
     2.2.  The Content-Length field  . . . . . . . . . . . . . . . .   5
     2.3.  The Content-Offset field  . . . . . . . . . . . . . . . .   5
     2.4.  The Content-Type field  . . . . . . . . . . . . . . . . .   5
     2.5.  Other fields  . . . . . . . . . . . . . . . . . . . . . .   6
     2.6.  Applying a patch  . . . . . . . . . . . . . . . . . . . .   6
     2.7.  The multipart/byteranges media type . . . . . . . . . . .   6
     2.8.  The message/byterange media type  . . . . . . . . . . . .   7
       2.8.1.  Syntax  . . . . . . . . . . . . . . . . . . . . . . .   8
     2.9.  The application/byteranges media type . . . . . . . . . .   9
       2.9.1.  Syntax  . . . . . . . . . . . . . . . . . . . . . . .   9
     2.10. Range units . . . . . . . . . . . . . . . . . . . . . . .  11
   3.  Preserving Incomplete Uploads with "Prefer: transaction"  . .  11
   4.  Segmented document creation with PATCH  . . . . . . . . . . .  12
     4.1.  Example . . . . . . . . . . . . . . . . . . . . . . . . .  13
   5.  Registrations . . . . . . . . . . . . . . . . . . . . . . . .  14
     5.1.  message/byterange . . . . . . . . . . . . . . . . . . . .  14
     5.2.  application/byteranges  . . . . . . . . . . . . . . . . .  15
     5.3.  Content-Offset  . . . . . . . . . . . . . . . . . . . . .  16
     5.4.  "transaction" preference  . . . . . . . . . . . . . . . .  16
   6.  Security Considerations . . . . . . . . . . . . . . . . . . .  16
     6.1.  Unallocated ranges  . . . . . . . . . . . . . . . . . . .  16
     6.2.  Document Size Hints . . . . . . . . . . . . . . . . . . .  17
   7.  References  . . . . . . . . . . . . . . . . . . . . . . . . .  17
     7.1.  Normative References  . . . . . . . . . . . . . . . . . .  17
     7.2.  Informative References  . . . . . . . . . . . . . . . . .  18
   Appendix A.  Discussion . . . . . . . . . . . . . . . . . . . . .  18
     A.1.  Indeterminate Length Uploads  . . . . . . . . . . . . . .  18
     A.2.  Sparse Documents  . . . . . . . . . . . . . . . . . . . .  19
     A.3.  Recovering from interrupted PUT . . . . . . . . . . . . .  19
     A.4.  Splicing and Binary Diff  . . . . . . . . . . . . . . . .  19
   Author's Address  . . . . . . . . . . . . . . . . . . . . . . . .  19

1.  Introduction

   Filesystem interfaces typically provide some way to write at a
   specific position in a file.  While HTTP supports reading byte range
   offsets using the Range header (Section 14 of [RFC9110]), this
   technique cannot generally be used in PUT, because the server may
   ignore the Content-Range header while executing the write, causing
   data corruption.  However, by using a method and media type that the
   server must understand, writes to byte ranges with Content-Range
   semantics becomes possible even when server support is undetermined.

Wright                  Expires 20 September 2024               [Page 2]
Internet-Draft              Byte Range PATCH                  March 2024

   This media type is intended for use in a wide variety of applications
   where overwriting specific parts of the file is desired.  This
   includes idempotently writing data to a stream, appending data to a
   file, overwriting specific byte ranges, or writing to multiple
   regions in a single operation (for example, appending audio to a
   recording in progress while updating metadata at the beginning of the
   file).

1.1.  Notational Conventions

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
   "OPTIONAL" in this document are to be interpreted as described in BCP
   14 [RFC2119] [RFC8174] when, and only when, they appear in all
   capitals, as shown here.

   This document uses ABNF as defined in [RFC5234] and imports grammar
   rules from [RFC8941] and [RFC9112].

   For brevity, example HTTP requests or responses may add newlines or
   whitespace, or omit some headers necessary for message transfer.

   The term "byte" is used in the [RFC9110] sense to mean "octet."
   Ranges are zero-indexed and inclusive.  For example, "bytes 0-0"
   means the first byte of the document, and "bytes 1-2" is a range with
   two bytes, starting one byte into the document.  Ranges of zero bytes
   are described by an address offset rather than a range.  For example,
   "at byte 5" would separate the byte ranges 0-4 and 5-9.

2.  Modifying a content range with PATCH

   The Content-Range field in a PUT request requires support by the
   server in order to be processed correctly.  Without specific support,
   the server's normal behavior would be to ignore the header, replacing
   the entire resource with just the part that changed, causing data
   corruption.  To mitigate this, Content-Range may be used in
   conjunction with the PATCH method [RFC5789] as part of a media type
   whose semantics are to write at the specified byte offset.  This
   document re-uses the "multipart/byteranges" media type, and defines
   the "message/byterange" media type, for this purpose.

   A byte range patch lists one or more _parts_. Each part specifies two
   essential components:

Wright                  Expires 20 September 2024               [Page 3]
Internet-Draft              Byte Range PATCH                  March 2024

   1.  Part fields: a list of HTTP fields that specify metadata,
       including the range being written to, the length of the body, and
       information about the target document that cannot be listed in
       the PATCH headers, e.g.  Content-Type (where it would describe
       the patch itself, rather than the document being updated).

   2.  A part body: the actual data to write to the specified location.

   Each part MUST indicate a single contiguous range to be written to.
   Servers MUST reject byte range patches that don't contain a known
   range with a 422 or 400 error.  (This would mean the client may be
   using a yet-undefined mechanism to specify the target range.)

   The simplest form to represent a byte range patch is the "message/
   byterange" media type, which is similar to an HTTP message:

   Content-Range: bytes 2-5/12

   cdef

   This patch represents an instruction to write the four bytes "cdef"
   at an offset of 2 bytes.  A document listing the digits 0-9 in a row,
   would look like this after applying the patch:

   01cdef6789␍␊

   Although this example is a text document with a line terminator,
   patches are only carried as binary data, and can potentially carry or
   overwrite parts of multi-byte characters.

2.1.  The Content-Range field

   The Content-Range field (as seen inside a patch document) is used to
   specify where in the target document the part body will be written.

   The client MAY indicate the anticipated final size of the document by
   providing the complete-length form, for example bytes 0-11/12.  This
   value of complete-length does not affect the write, however the
   server MAY use it for other purposes, especially for preallocating an
   optimal amount of space, and deciding when an upload in multiple
   parts has finished.

   If the client does not know or care about the final length of the
   document, it MAY use * in place of complete-length.  For example,
   bytes 0-11/*. Most random access writes will follow this form.

Wright                  Expires 20 September 2024               [Page 4]
Internet-Draft              Byte Range PATCH                  March 2024

   The unsatisfied-range form (e.g. bytes */1000) sets the size of the
   document, but without writing any data.  It may be used to truncate a
   document (to represent inverse of an append operation), or to
   allocate space for a document without specifying any of its contents.

   If the "last-pos" is is unknown because the upload is indeterminate
   length (the Content-Length of the request is not known from the
   start, or the upload might continue indefinitely), then the Content-
   Offset field must be used instead.

2.2.  The Content-Length field

   A "Content-Length" part field, if provided, describes the length of
   the part body.  (To describe the size of the entire target resource,
   see the Content-Range field.)

   If provided, it MUST exactly match the length of the range specified
   in the Content-Range field, and servers MUST error when the Content-
   Length mismatches the length of the range.

2.3.  The Content-Offset field

   The Content-Range field specifies an offset to write the content at,
   when the end of the write is not known.  It is used instead of
   Content-Range when the Content-Length is not known.

   Its syntax is specified as a Structured Field [RFC8941]:

   Content-Offset = sf-item

   The value indicates how much of the target document is to be skipped
   over, typically the number of bytes.  It MUST be an sf-integer.

   It accepts two parameters:

   The "unit" parameter indicates the unit associated with the value.  A
   missing "unit" parameter is equivelant to providing unit=bytes.

   The "complete-length" parameter is equivelant to the complete-length
   value in Content-Range.  When provided, it MUST be an sf-integer
   specifying the intended final length of the document.  When missing,
   the complete-length is unknown.

2.4.  The Content-Type field

   A "Content-Type" part field MUST have the same effect as if provided
   in a PUT request uploading the entire resource (patch applied).  Its
   use is typically limited to creating resources.

Wright                  Expires 20 September 2024               [Page 5]
Internet-Draft              Byte Range PATCH                  March 2024

2.5.  Other fields

   Other part fields in the patch document SHOULD have the same meaning
   as if it is a message-level header in a PUT request uploading the
   entire resource (patch applied).

   Use of such fields SHOULD be limited to cases where the meaning in
   the HTTP request headers would be different, where they would
   describe the entire patch, rather than the part.  For example, the
   "Content-Type" field.

2.6.  Applying a patch

   Servers SHOULD NOT accept requests that write beyond, and not
   adjacent to, the end of the resource.  This would create a sparse
   file, where some bytes are undefined.  For example, writing at byte
   601 of a resource where bytes 0-599 are defined; this would leave
   byte 600 undefined.  Servers that accept sparse writes MUST NOT
   disclose uninitialized content, and SHOULD fill in undefined regions
   with zeros.

   The expected length of the write can be computed from the part
   fields.  If the actual length of the part body mismatches the
   expected length, this MUST be treated the same as a network
   interruption at the shorter length, but anticipating the longer
   length.  Recovering from this interruption may involve rolling back
   the entire request, or saving as many bytes as possible.  The client
   can then recover as it would recover from a network interruption.

2.7.  The multipart/byteranges media type

   The following is a request with a "multipart/byteranges" body to
   write two ranges in a document:

Wright                  Expires 20 September 2024               [Page 6]
Internet-Draft              Byte Range PATCH                  March 2024

   PATCH /uploads/foo HTTP/1.1
   Content-Type: multipart/byteranges; boundary=THIS_STRING_SEPARATES
   Content-Length: 206
   If-Match: "xyzzy"
   If-Unmodified-Since: Sat, 29 Oct 1994 19:43:31 GMT

   --THIS_STRING_SEPARATES
   Content-Range: bytes 2-6/25
   Content-Type: text/plain

   23456
   --THIS_STRING_SEPARATES
   Content-Range: bytes 17-21/25
   Content-Type: text/plain

   78901
   --THIS_STRING_SEPARATES--

   The syntax for multipart messages is defined in [RFC2046],
   Section 5.1.1.  While the body cannot contain the boundary, servers
   MAY use the Content-Length field to skip to the boundary (potentially
   ignoring a boundary in the body, which would be an error by the
   client).

   The multipart/byteranges type may be used for operations where
   multiple regions must be updated at the same time; clients expect all
   the changes to be recorded as a single operation, or that if there's
   an interruption, all of the parts from the request will be rolled
   back.

2.8.  The message/byterange media type

   When making a request with a single byte range, there is no need for
   a multipart boundary marker.  This document defines a new media type
   "message/byterange" with the same semantics as a single byte range in
   a multipart/byteranges message, but with a simplified syntax.

   The "message/byterange" form may be used in a request as so:

Wright                  Expires 20 September 2024               [Page 7]
Internet-Draft              Byte Range PATCH                  March 2024

   PATCH /uploads/foo HTTP/1.1
   Content-Type: message/byterange
   Content-Length: 272
   If-Match: "xyzzy"
   If-Unmodified-Since: Sat, 29 Oct 1994 19:43:31 GMT

   Content-Range: bytes 100-299/600
   Content-Type: text/plain

   [200 bytes...]

   This represents a request to modify a 600-byte document, overwriting
   200 bytes of it, starting at a 100-byte offset.

2.8.1.  Syntax

   The syntax re-uses concepts from the "multipart/byteranges" media
   type, except it omits the multipart separator, and so only allows a
   single range to be specified.  It is also similar to the "message/
   http" media type, except the first line (the status line or request
   line) is omitted; a message/byterange document can be fed into a
   message/http parser by first prepending a line like "PATCH /
   HTTP/1.1".

   It follows the syntax of HTTP message headers and body.  It MUST
   include the Content-Range header field.  If the message length is
   known by the sender, it SHOULD contain the Content-Length header
   field.  Unknown or nonapplicable header fields MUST be ignored.

   The field-line and message-body productions are specified in
   [RFC9112].

   byterange-document = *( field-line CRLF )
                        CRLF
                        [ message-body ]

   This document has the same semantics as a single part in a
   "multipart/byteranges" document (Section 5.1.1 of [RFC2046]) or any
   response with a 206 (Partial Content) status code (Section 15.3.7 of
   [RFC9110]).  A "message/byterange" document may be trivially
   transformed into a "multipart/byteranges" document by prepending a
   dash-boundary and CRLF, and appending a close-delimiter (a CRLF,
   dash-boundary, terminating "--", and optional CRLF).

Wright                  Expires 20 September 2024               [Page 8]
Internet-Draft              Byte Range PATCH                  March 2024

2.9.  The application/byteranges media type

   The "application/byteranges" has the same semantics as "multipart/
   byteranges" but follows a binary format similar to "message/bhttp"
   [RFC9292], which may be more suitable for some clients and servers,
   as all variable length strings are tagged with their length.

2.9.1.  Syntax

   Parsing starts by looking for a "Known-Length Message" or an
   "Indeterminate-Length Message".  One or the other is distinguished by
   the different Framing Indicator.

   The remainder of the message is parsed by reading fields, then the
   content, by a method depending on if the message is known-length or
   indeterminate-length.  If there are additional parts, they begin
   immediately after the end of a Content.

   The "Known-Length Field Section", "Known-Length Content",
   "Indeterminate-Length Field Section", "Indeterminate-Length Content",
   "Indeterminate-Length Content Chunk", "Field Line" definitions are
   identical to their definition in message/bhttp.  They are used in one
   or more Known-Length Message and/or Indeterminate-Length Message
   productions, concatenated together.

Wright                  Expires 20 September 2024               [Page 9]
Internet-Draft              Byte Range PATCH                  March 2024

   Patch {
     Message (..) ...
   }

   Known-Length Message {
     Framing Indicator (i) = 8,
     Known-Length Field Section (..),
     Known-Length Content (..),
   }

   Indeterminate-Length Message  {
     Framing Indicator (i) = 10,
     Indeterminate-Length Field Section (..),
     Indeterminate-Length Content (..),
   }

   Known-Length Field Section {
     Length (i),
     Field Line (..) ...,
   }

   Known-Length Content {
     Content Length (i),
     Content (..),
   }

   Indeterminate-Length Field Section {
     Field Line (..) ...,
     Content Terminator (i) = 0,
   }

   Indeterminate-Length Content {
     Indeterminate-Length Content Chunk (..) ...,
     Content Terminator (i) = 0,
   }

   Indeterminate-Length Content Chunk {
     Chunk Length (i) = 1..,
     Chunk (..),
   }

   Field Line {
     Name Length (i) = 1..,
     Name (..),
     Value Length (i),
     Value (..),
   }

Wright                  Expires 20 September 2024              [Page 10]
Internet-Draft              Byte Range PATCH                  March 2024

2.10.  Range units

   Currently, the only defined range unit is "bytes", however this may
   be other, yet-to-be-defined values.

   In the case of "bytes", the bytes that are read are exactly the same
   as the bytes that are changed.  However, other units may define write
   semantics different from a read, if symmetric behavior would not make
   sense.  For example, if a Content-Range field adds an item in a JSON
   array, this write may add a leading or trailing comma, not
   technically part of the item itself, in order to keep the resulting
   document well-formed.

   Even though the length in alternate units isn't changed, the byte
   length might.  This might only be acceptable to servers storing these
   values in a database or memory structure, rather than on a byte-based
   filesystem.

3.  Preserving Incomplete Uploads with "Prefer: transaction"

   The stateless design of HTTP generally implies that a request is
   atomic (otherwise parties would need to keep track of the state of a
   request while it's in progress).  Clients need not need be concerned
   with the side-effects of only the first half of an upload being
   honored, if there's an error partway through.

   However, some clients may desire partial state changes, particularly
   when remaking the upload is more expensive than recovering from an
   interruption.  In these cases, clients will want an incomplete upload
   to be preserved as much as possible, so they may re-synchronize the
   state and pick up from where the incomplete request was terminated.

   The client's preference for atomic or upload-preserving behavior may
   be signaled by a Prefer header:

   Prefer: transaction=atomic
   Prefer: transaction=persist

   The transaction=atomic preference indicates that the request SHOULD
   apply only when a successful response is returned, and not any time
   before the end of the upload.

   The transaction=persist preference indicates that uploaded data
   SHOULD be continuously stored as soon as possible, so that if the
   upload is interrupted, it is possible to resume the upload from where
   it left off.

Wright                  Expires 20 September 2024              [Page 11]
Internet-Draft              Byte Range PATCH                  March 2024

   This preference is generally applicable to any HTTP request (and not
   merely for PATCH or byte range patches).  Servers SHOULD indicate
   when this preference was honored, using a "Preference-Applied"
   response header.  For example:

   Preference-Applied: transaction=persist

   Servers may consider signaling this in a 103 Early Hints response,
   since once the final response is written, this may no longer be
   useful information.

4.  Segmented document creation with PATCH

   As an alternative to using PUT to create a new resource, the contents
   of a resource may be uploaded in segments, written across several
   PATCH requests.

   A user-agent may also use PATCH to recover from an interrupted PUT
   request, if it was expected to create a new resource.  The server
   will store the data sent to it by the user agent, but will not
   finalize the upload until the final length of the document is known
   and received.

   1.  The client makes a PUT or PATCH request to a URL, a portion of
       which is generated by the client, to be unpredictable to other
       clients.  This first request creates the resource, and should
       include If-None-Match: * to verify the target does not exist.  If
       a PUT request, the server reads the Content-Length header and
       stores the intended final length of the document.  If a PATCH
       request, the "Content-Range" field in the "message/byterange"
       patch is read for the final length.  The final length may also be
       undefined, and defined in a later request.

   2.  If any request is interrupted, the client may make a HEAD request
       to determine how much, if any, of the previous response was
       stored, and resumes uploading from that point.  The server will
       return 200 (OK), but this may only indicate the write has been
       saved; the server is not obligated to begin acting on the upload
       until it is complete.

Wright                  Expires 20 September 2024              [Page 12]
Internet-Draft              Byte Range PATCH                  March 2024

   3.  If the client sees from the HEAD response that additional data
       remains to be uploaded, it may make a PATCH request to resume
       uploading.  Even if no data was uploaded or the resource was not
       created, the client should attempt creating the resource with
       PATCH to mitigate the possibility of another interrupted
       connection with a server that does not save incomplete transfers.
       However if in response to PATCH, the server reports 405 (Method
       Not Allowed), 415 (Unsupported Media Type), or 501 (Not
       Implemented), then the client must resort to a PUT request.

   4.  The server detects the completion of the final request when the
       current received data matches the indicated final length.  For
       example, a Content-Range: 500-599/600 field is a write at the end
       of the resource.  The server processes the upload and returns a
       response for it.

   For building POST endpoints that support large uploads, clients can
   first upload the data to a scratch file as described above, and then
   process by submitting a POST request that links to the scratch file.

   For updating an existing large file, the client can upload to a
   scratch file, then execute a MOVE (Section 9.9 of [RFC4918]) over the
   intended target.

4.1.  Example

   A single PUT request that creates a new resource may be split apart
   into multiple PATCH requests.  Here is an example that uploads a
   600-byte document across three 200-byte segments.

   The first PATCH request creates the resource:

   PATCH /uploads/foo HTTP/1.1
   Content-Type: message/byterange
   Content-Length: 281
   If-None-Match: *

   Content-Range: bytes 0-199/600
   Content-Type: text/plain
   Content-Length: 200

   [200 bytes...]

   This request allocates a 600 byte document, and uploading the first
   200 bytes of it.  The server responds with 200, indicating that the
   complete upload was stored.

   Additional requests upload the remainder of the document:

Wright                  Expires 20 September 2024              [Page 13]
Internet-Draft              Byte Range PATCH                  March 2024

   PATCH /uploads/foo HTTP/1.1
   Content-Type: message/byterange
   Content-Length: 283
   If-None-Match: *

   Content-Range: bytes 200-399/600
   Content-Type: text/plain
   Content-Length: 200

   [200 bytes...]

   This second request also returns 200 (OK).

   A third request uploads the final portion of the document:

   PATCH /uploads/foo HTTP/1.1
   Content-Type: message/byterange
   Content-Length: 283
   If-None-Match: *

   Content-Range: bytes 200-399/600
   Content-Type: text/plain
   Content-Length: 200

   [200 bytes...]

   The server responds with 200 (OK).  Since this completely writes out
   the 600-byte document, the server may also perform final processing,
   for example, checking that the document is well formed.  The server
   MAY return an error code if there is a syntax or other error, or in
   an earlier response as soon as it it able to detect an error, however
   the exact behavior is left undefined.

5.  Registrations

5.1.  message/byterange

   Type name:  message

   Subtype name:  byterange

   Required parameters:  N/A

   Optional parameters:  N/A

   Encoding considerations:  binary

   Security considerations:  See Section 6

Wright                  Expires 20 September 2024              [Page 14]
Internet-Draft              Byte Range PATCH                  March 2024

   Interoperability considerations:  See Section 2.8 of this document

   Published specification:  This document

   Applications that use this media type:  HTTP applications that
      process filesystem-like writes to locations within a resource.

   Fragment identifier considerations:  N/A

   Additional information:  Deprecated alias names for this type:  N/A
                            Magic number(s):  N/A
                            File extension(s):  N/A
                            Macintosh file type code(s):  N/A

   Person and email address to contact for further information:  See Aut
      hors' Addresses section.

   Intended usage:  COMMON

   Restrictions on usage:  None.

   Author:  See Authors' Addresses section.

   Change controller:  IESG

5.2.  application/byteranges

   Type name:  application

   Subtype name:  byteranges

   Required parameters:  N/A

   Optional parameters:  N/A

   Encoding considerations:  binary

   Security considerations:  See Section 6

   Interoperability considerations:  See Section 2.9 of this document

   Published specification:  This document

   Applications that use this media type:  HTTP applications that
      process filesystem-like writes to locations within a resource.

   Fragment identifier considerations:  N/A

Wright                  Expires 20 September 2024              [Page 15]
Internet-Draft              Byte Range PATCH                  March 2024

   Additional information:  Deprecated alias names for this type:  N/A
                            Magic number(s):  N/A
                            File extension(s):  N/A
                            Macintosh file type code(s):  N/A

   Person and email address to contact for further information:  See Aut
      hors' Addresses section.

   Intended usage:  COMMON

   Restrictions on usage:  None.

   Author:  See Authors' Addresses section.

   Change controller:  IESG

5.3.  Content-Offset

   Field name:  Content-Offset

   Status: permanent

   Specification document:  This document.

5.4.  "transaction" preference

   Preference:  transaction

   Value:  either "atomic" or "persist"

   Description:  Specify if the client would prefer incomplete uploads
      to be saved, or committed only on success.

   Reference:  Section 3

6.  Security Considerations

6.1.  Unallocated ranges

   A byterange patch may permit writes to offsets beyond the end of the
   resource.  This may have non-obvious behavior.

   Servers supporting sparse files MUST NOT return uninitialized memory
   or storage contents.  Uninitialized regions may be initialized prior
   to executing the write, or this may be left to the filesystem if it
   can guarantee that unallocated space will be read as a constant
   value.

Wright                  Expires 20 September 2024              [Page 16]
Internet-Draft              Byte Range PATCH                  March 2024

   If a server fills in unallocated space by initializing it, servers
   SHOULD protect against patches that make writes to very large
   offsets.  Servers may account for this by treating it as a write by
   the client, similar to "Document Size Hints" below.

6.2.  Document Size Hints

   A byte range patch is, overall, designed to require server resources
   that's proportional to the patch size.  One possible exception to
   this rule is the complete-length part of the Content-Range field,
   which hints at the final upload size.  Generally, this does not
   require the server to (immediately) allocate this amount of data.
   However, some servers may choose to begin preallocating disk space
   right away, which could be a very expensive operation compared to the
   actual size of the request.

   In general, servers SHOULD treat the complete-length hint the same as
   a PUT request of that size, and issue a 400 (Client Error).
   // 413 (Payload Too Large) might not be appropriate for this
   // situation, as it would indicate the patch is too large and the
   // client should break up the patches into smaller chunks, rather
   // than the intended final upload size being too large.

7.  References

7.1.  Normative References

   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119,
              DOI 10.17487/RFC2119, March 1997,
              <https://www.rfc-editor.org/rfc/rfc2119>.

   [RFC5234]  Crocker, D., Ed. and P. Overell, "Augmented BNF for Syntax
              Specifications: ABNF", STD 68, RFC 5234,
              DOI 10.17487/RFC5234, January 2008,
              <https://www.rfc-editor.org/rfc/rfc5234>.

   [RFC8174]  Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC
              2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174,
              May 2017, <https://www.rfc-editor.org/rfc/rfc8174>.

   [RFC8941]  Nottingham, M. and P. Kamp, "Structured Field Values for
              HTTP", RFC 8941, DOI 10.17487/RFC8941, February 2021,
              <https://www.rfc-editor.org/rfc/rfc8941>.

Wright                  Expires 20 September 2024              [Page 17]
Internet-Draft              Byte Range PATCH                  March 2024

   [RFC9110]  Fielding, R., Ed., Nottingham, M., Ed., and J. Reschke,
              Ed., "HTTP Semantics", STD 97, RFC 9110,
              DOI 10.17487/RFC9110, June 2022,
              <https://www.rfc-editor.org/rfc/rfc9110>.

   [RFC9112]  Fielding, R., Ed., Nottingham, M., Ed., and J. Reschke,
              Ed., "HTTP/1.1", STD 99, RFC 9112, DOI 10.17487/RFC9112,
              June 2022, <https://www.rfc-editor.org/rfc/rfc9112>.

7.2.  Informative References

   [RFC2046]  Freed, N. and N. Borenstein, "Multipurpose Internet Mail
              Extensions (MIME) Part Two: Media Types", RFC 2046,
              DOI 10.17487/RFC2046, November 1996,
              <https://www.rfc-editor.org/rfc/rfc2046>.

   [RFC4918]  Dusseault, L., Ed., "HTTP Extensions for Web Distributed
              Authoring and Versioning (WebDAV)", RFC 4918,
              DOI 10.17487/RFC4918, June 2007,
              <https://www.rfc-editor.org/rfc/rfc4918>.

   [RFC5789]  Dusseault, L. and J. Snell, "PATCH Method for HTTP",
              RFC 5789, DOI 10.17487/RFC5789, March 2010,
              <https://www.rfc-editor.org/rfc/rfc5789>.

   [RFC9292]  Thomson, M. and C. A. Wood, "Binary Representation of HTTP
              Messages", RFC 9292, DOI 10.17487/RFC9292, August 2022,
              <https://www.rfc-editor.org/rfc/rfc9292>.

Appendix A.  Discussion

   // This section to be removed before final publication.

A.1.  Indeterminate Length Uploads

   There is no standard way for a Content-Range header to indicate an
   unknown or indeterminate-length body starting at a certain offset;
   the design of partial content messages requires that the sender know
   the total length before transmission.  However it seems it should be
   possible to generate an indeterminate-length partial content response
   (e.g. return a continuously growing audio file starting at a 4MB
   offset).  Fixing this would require a new header, update to HTTP, or
   a revision of HTTP.

Wright                  Expires 20 September 2024              [Page 18]
Internet-Draft              Byte Range PATCH                  March 2024

A.2.  Sparse Documents

   This pattern can enable multiple, parallel uploads to a document at
   the same time.  For example, uploading a large log file from multiple
   devices.  However, this document does not define any ways for clients
   to track the unwritten regions in sparse documents, and the existing
   conditional request headers are designed to cause conflicts.
   Parallel uploads may require a byte-level locking scheme or conflict-
   free operators.  This may be addressed in a later document.

A.3.  Recovering from interrupted PUT

   Servers do not necessarily save the results of an incomplete upload;
   since most clients prefer atomic writes, many servers will discard an
   incomplete upload.  A mechanism to indicate a preference for atomic
   vs. non-atomic writes may be defined at a later time.

   Byte range PATCH cannot by itself be used to recover from an
   interrupted PUT that updates an existing document.  If the server
   operation is atomic, the entire operation will be lost.  If the
   server saves the upload, it may not possible to know how much of the
   request was received by the server, and what was old content that
   already existed.

   One technique would be to use a 1xx interim response to indicate a
   location where the partial upload is being stored.  If PUT request is
   interrupted, the client can make PATCH requests to this temporary,
   non-atomic location to complete the upload.  When the last part is
   uploaded, the original interrupted PUT request will finish.

A.4.  Splicing and Binary Diff

   Operations more complicated than standard filesystem operations are
   out of scope for this media type.  A feature of byte range patch is
   an upper limit on the complexity of applying the patch.  In contrast,
   prepending, splicing, replace, or other complicated file operations
   could potentially require the entire file on disk be rewritten.

   Consider registering a media type for VCDIFF in this document, under
   the topic of "Media type registrations for byte-level patching".

Author's Address

   Austin Wright
   Email: aaa@bzfx.net

Wright                  Expires 20 September 2024              [Page 19]