Network Working Group                                         D. Wessels
Internet-Draft                                                  k claffy
                                                 National Laboratory for
                                                Applied Network Research
                                                                   /UCSD
                                                         5 November 1996
                                                    Expire in six months

                Internet Cache Protocol (ICP), version 2
                   <draft-wessels-icp-v2-00.txt>

Status of this Memo


   This document is an Internet-Draft. Internet-Drafts are working
   documents of the Internet Engineering Task Force (IETF), its areas,
   and its working groups. Note that other groups may also distribute
   working documents as Internet-Drafts.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time. It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as ``work in progress.''

   To learn the current status of any Internet-Draft, please check the
   ``1id-abstracts.txt'' listing contained in the Internet-Drafts Shadow
   Directories on ftp.is.co.za (Africa), nic.nordu.net (Europe),
   munnari.oz.au (Pacific Rim), ds.internic.net (US East Coast), or
   ftp.isi.edu (US West Coast).



Abstract


   This draft document describes the Internet Cache Protocol (ICP)
   currently implemented in a few World-Wide Web proxy cache packages.
   ICP was initially developed by Peter Danzig, et. al. at the
   Univerisity of Southern California.  It evolved as an important part
   of hierarchical caching on the Harvest research project.


Introduction


   ICP is a packet format used for communicating between WWW caches.
   ICP is primarily used in a cache hierarchy to locate specific WWW
   objects in neighbor caches.  One cache will send an ICP query to its
   neighbors.  The neighbors will send back ICP replies indicating a
   ``HIT'' or a ``MISS.''  ICP can also be used to multiplex



Wessels                                                         [Page 1]


Internet-Draft                                               August 1996


   transmission of multiple object streams over a single TCP connection.

   In current practice, ICP is implemented on top of UDP, but there is
   no requirement that it be limited to UDP.  There are some functions
   which work best over UDP, and others which work best over TCP.

   In addition to it use as an object location protocol, the packets ICP
   are often used for ``cache selection.''  Failure to receive a reply
   from a cache may inidicate a network or system failure.  The order in
   which the replies arrive may be used to select the ``closest''
   neighbor.


ICP Packet Format

    0                   1                   2                   3
   0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |     Opcode    |    Version    |        Packet Length        |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                         Request Number                      |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                            Options                          |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                            Padding                          |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                       Sender Host Address                   |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                                                             |
   |                            Payload                          |
   /                                                             /
   /                                                             /
   |                                                             |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

   NOTE: All fields must be converted to network byte order before
   transmission.

   Opcode
      One of the opcodes defined below.

   Version
      The ICP protocol version number.  At the time of this writing,
      both versions two and three are in use.

   Packet Length
      The total length of the ICP packet.




Wessels                                                         [Page 2]


Internet-Draft                                               August 1996


   Request Number
      An opaque identifier.  When set in a query packet, the reply
      packets must contain the same value.

   Options
      A bitfield of options.  See ``ICP Options'' below.

   Padding
      Four bytes of padding; a legacy from previous versions.

   Sender Host Address
      The IPv4 address of the host sending the ICP packet.  This field
      should probably not be trusted over what is  provided by
      getpeername(), accept(), and recvfrom().

   Payload
      The contents of the Payload field vary depending on the Opcode,
      but most often it contains a null-terminated URL string.



ICP Opcodes


   Value    Name
   -----    -----------------
       0    ICP_OP_INVALID
       1    ICP_OP_QUERY
       2    ICP_OP_HIT
       3    ICP_OP_MISS
       4    ICP_OP_ERR
       5    ICP_OP_SEND
       6    ICP_OP_SENDA
       7    ICP_OP_DATABEG
       8    ICP_OP_DATA
       9    ICP_OP_DATAEND
      10    ICP_OP_SECHO
      11    ICP_OP_DECHO
   12-20    UNUSED
      21    ICP_OP_RELOADING
      22    ICP_OP_DENIED
      23    ICP_OP_HIT_OBJ

   ICP_OP_INVALID
      Used to detect invalid packets.

   ICP_OP_QUERY
      A query packet.  NOTE this opcode has a different payload format



Wessels                                                         [Page 3]


Internet-Draft                                               August 1996


      than most of the others.  First is the requestor's IPv4 address,
      followed by a URL.  The requestor address is often left null.

      Payload Format:

       0                   1                   2                   3
      0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
      |                     Requestor Host Address                  |
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
      |                                                             |
      /                       Null-Terminated URL                   /
      /                                                             /
      |                                                             |
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

      In response to an ICP_OP_QUERY, the receipient must return one of:
      ICP_OP_HIT, ICP_OP_MISS, ICP_OP_ERR, ICP_OP_RELOADING,
      ICP_OP_DENIED, or ICP_OP_HIT_OBJ.

   ICP_OP_SECHO
      Similar to ICP_OP_QUERY, but for use in simulating a query to an
      origin server.  When ICP is used to choose the closest neighbor,
      the origin server can be included in the algorithm by bouncing an
      ICP_OP_SECHO packet off it's echo port.  The payload is simply the
      null-terminated URL.

   ICP_OP_DECHO
      Similar to ICP_OP_QUERY, but for use in simulating a query to an
      cache which does not use ICP.  When ICP is used to choose the
      closest neighbor, a non-ICP cache can be included in the algorithm
      by bouncing an ICP_OP_DECHO packet off it's echo port.  The
      payload is simply the null-terminated URL.

      NOTE: one problem with this approach is that while a system's echo
      port may be functioning perfectly, the cache software may not be
      running at all.


   The following six ICP opcodes are sent in response to an ICP_OP_QUERY
   packet.  For each one the payload must be the null-terminated URL
   string.  Both the URL string and the Request Number field must be
   exactly the same as from the ICP_OP_QUERY packet.

   ICP_OP_HIT
      Sending an ICP_OP_HIT response indicates that the requested URL
      exists in this cache.




Wessels                                                         [Page 4]


Internet-Draft                                               August 1996


   ICP_OP_MISS
      Sending an ICP_OP_MISS response indicates that the requested URL
      does not exist in this cache.

   ICP_OP_ERR
      Sending an ICP_OP_ERR indicates some kind of error in parsing or
      handling the query packet.

   ICP_OP_RELOADING
      Sending an ICP_OP_RELOADING response indicates that this cache is
      up, but is in a ``startup'' mode.  A cache in startup mode may
      wish to return ICP_OP_HIT for cache hits, but not ICP_OP_MISS for
      misses.  ICP_OP_RELOADING essentially means ``I am up and running,
      but please don't fetch this URL from me now.''

   ICP_OP_DENIED
      Sending an ICP_OP_DENIED response indicates that the querying site
      is not allowed to retrieve the named object from this cache.
      Caches and proxies may implement complex access controls.  This
      reply can only be interpreted to mean ``you are not allowed to
      request this particular URL from me at this particular time.''

   ICP_OP_HIT_OBJ
      Just like an ICP_OP_HIT response, but the actual object data has
      been included in this reply packet.   Many requested objects are
      small enough that it makes sense to include them in the query
      response and avoid the need to make a subsequent HTTP request for
      the object.

      An ICP_OP_HIT_OBJ reply will be sent only if the ICP_FLAG_HIT_OBJ
      flag is set in the query packet Options field.

      Payload Format:

       0                   1                   2                   3
      0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
      |                                                             |
      /                       Null-Terminated URL                   /
      /                                                             /
      |                                                             |
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
      |         Object Size           |                             |
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+                             |
      |                                                             |
      /                          Object Data                        /
      /                                                             /
      |                                                             |



Wessels                                                         [Page 5]


Internet-Draft                                               August 1996


      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

      The protocol does not impose any limits on the maximum size of the
      Object Data.  Software that the author is familiar with will use
      ICP_OP_HIT_OBJ over ICP_OP_HIT if the entire packet is less than
      the system's maximum UDP transmission size.

      The receiving application should check to make sure it actually
      receives Object Size bytes of data.  If it does not, then it
      should treat the ICP_OP_HIT_OBJ reply as though it were a normal
      ICP_OP_HIT.

      NOTE: the Object Size field does not necessarily begin on a 32-bit
      boundary as shown in the diagram above.  It begins immediately
      following the NULL byte of the URL string.

   The following six opcodes are used for multiplexing object
   transmission over a TCP connection.  These are not in use at the time
   of this writing.

   ICP_OP_SEND
      Request for object data (non authoritative).

   ICP_OP_SENDA
      Request for object data (authoritative).

   ICP_OP_DATABEG
      Beginning of data transmission.

   ICP_OP_DATA
      Middle of data transmission.

   ICP_OP_DATAEND
      End of data transmission.






ICP Options


   0x80000000  ICP_FLAG_HIT_OBJ This flag is set in an ICP_OP_QUERY
   packet indicating that it is okay to respond with an ICP_OP_HIT_OBJ
   packet if the object data will fit.





Wessels                                                         [Page 6]


Internet-Draft                                               August 1996


Missing Features

   Note the lack of an HTTP method in the ICP query.  The current
   version can only be used for GET requests.


Security Considerations

   Security is an issue with ICP over UDP because of its connectionless
   nature.  A proxy/cache must only process ICP packets received from
   its known neighbors.  Packets from unknown addresses must be
   discarded.

   The ICP_OP_HIT_OBJ packet is especially senstive to security issues
   since it contains actual object data in the response.  In combination
   with IP spoofing it is most likely possible to pollute the cache with
   invalid objects.

Author's  Address:

   Duane Wessels
   National Laboratory for Applied Network Research/UCSD
   wessels@nlanr.net

   K Claffy
   National Laboratory for Applied Network Research/UCSD
   kc@nlanr.net
























Wessels                                                         [Page 7]