draft-ietf-tsvwg-port-randomization-04

Transport Area Working Group                                   M. Larsen
(tsvwg)                                                      TietoEnator
Internet-Draft                                                   F. Gont
Intended status: BCP                                             UTN/FRH
Expires: January 3, 2010                                    July 2, 2009


                           Port Randomization
                 draft-ietf-tsvwg-port-randomization-04

Status of this Memo

   This Internet-Draft is submitted to IETF in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as Internet-
   Drafts.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/ietf/1id-abstracts.txt.

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html.

   This Internet-Draft will expire on January 3, 2010.

Copyright Notice

   Copyright (c) 2009 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents in effect on the date of
   publication of this document (http://trustee.ietf.org/license-info).
   Please review these documents carefully, as they describe your rights
   and restrictions with respect to this document.








Larsen & Gont            Expires January 3, 2010                [Page 1]


Internet-Draft             Port Randomization                  July 2009


Abstract

   Recently, awareness has been raised about a number of "blind" attacks
   that can be performed against the Transmission Control Protocol (TCP)
   and similar protocols.  The consequences of these attacks range from
   throughput-reduction to broken connections or data corruption.  These
   attacks rely on the attacker's ability to guess or know the five-
   tuple (Protocol, Source Address, Destination Address, Source Port,
   Destination Port) that identifies the transport protocol instance to
   be attacked.  This document describes a number of simple and
   efficient methods for the selection of the client port number, such
   that the possibility of an attacker guessing the exact value is
   reduced.  While this is not a replacement for cryptographic methods
   for protecting the connection, the described port number obfuscation
   algorithms provide improved security/obfuscation with very little
   effort and without any key management overhead.  The algorithms
   described in this document are local policies that may be
   incrementally deployed, and that do not violate the specifications of
   any of the transport protocols that may benefit from them, such as
   TCP, UDP, UDP-lite, SCTP, DCCP, and RTP.































Larsen & Gont            Expires January 3, 2010                [Page 2]


Internet-Draft             Port Randomization                  July 2009


Table of Contents

   1.  Introduction . . . . . . . . . . . . . . . . . . . . . . . . .  4
   2.  Ephemeral Ports  . . . . . . . . . . . . . . . . . . . . . . .  6
     2.1.  Traditional Ephemeral Port Range . . . . . . . . . . . . .  6
     2.2.  Ephemeral port selection . . . . . . . . . . . . . . . . .  6
     2.3.  Collision of connection-id's . . . . . . . . . . . . . . .  7
   3.  Obfuscating the Ephemeral Ports  . . . . . . . . . . . . . . .  9
     3.1.  Characteristics of a good ephemeral port obfuscation
           algorithm  . . . . . . . . . . . . . . . . . . . . . . . .  9
     3.2.  Ephemeral port number range  . . . . . . . . . . . . . . . 10
     3.3.  Ephemeral Port Obfuscation Algorithms  . . . . . . . . . . 11
       3.3.1.  Algorithm 1: Simple port randomization algorithm . . . 11
       3.3.2.  Algorithm 2: Another simple port randomization
               algorithm  . . . . . . . . . . . . . . . . . . . . . . 13
       3.3.3.  Algorithm 3: Simple hash-based algorithm . . . . . . . 13
       3.3.4.  Algorithm 4: Double-hash obfuscation algorithm . . . . 15
       3.3.5.  Algorithm 5: Random-increments port selection
               algorithm  . . . . . . . . . . . . . . . . . . . . . . 17
     3.4.  Secret-key considerations for hash-based port
           obfuscation algorithms . . . . . . . . . . . . . . . . . . 19
     3.5.  Choosing an ephemeral port obfuscation algorithm . . . . . 20
   4.  Port obfuscation and Network Address Port Translation
       (NAPT) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
   5.  Security Considerations  . . . . . . . . . . . . . . . . . . . 23
   6.  Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . 24
   7.  References . . . . . . . . . . . . . . . . . . . . . . . . . . 25
     7.1.  Normative References . . . . . . . . . . . . . . . . . . . 25
     7.2.  Informative References . . . . . . . . . . . . . . . . . . 26
   Appendix A.  Survey of the algorithms in use by some popular
                implementations . . . . . . . . . . . . . . . . . . . 28
     A.1.  FreeBSD  . . . . . . . . . . . . . . . . . . . . . . . . . 28
     A.2.  Linux  . . . . . . . . . . . . . . . . . . . . . . . . . . 28
     A.3.  NetBSD . . . . . . . . . . . . . . . . . . . . . . . . . . 28
     A.4.  OpenBSD  . . . . . . . . . . . . . . . . . . . . . . . . . 28
     A.5.  OpenSolaris  . . . . . . . . . . . . . . . . . . . . . . . 28
   Appendix B.  Changes from previous versions of the draft (to
                be removed by the RFC Editor before publication
                of this document as a RFC . . . . . . . . . . . . . . 29
     B.1.  Changes from draft-ietf-tsvwg-port-randomization-03  . . . 29
     B.2.  Changes from draft-ietf-tsvwg-port-randomization-02  . . . 29
     B.3.  Changes from draft-ietf-tsvwg-port-randomization-01  . . . 29
     B.4.  Changes from draft-ietf-tsvwg-port-randomization-00  . . . 29
     B.5.  Changes from draft-larsen-tsvwg-port-randomization-02  . . 29
     B.6.  Changes from draft-larsen-tsvwg-port-randomization-01  . . 30
     B.7.  Changes from draft-larsen-tsvwg-port-randomization-00  . . 30
     B.8.  Changes from draft-larsen-tsvwg-port-randomisation-00  . . 30
   Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 31



Larsen & Gont            Expires January 3, 2010                [Page 3]


Internet-Draft             Port Randomization                  July 2009


1.  Introduction

   Recently, awareness has been raised about a number of "blind" attacks
   (i.e., attacks that can be performed without the need to sniff the
   packets that correspond to the transport protocol instance to be
   attacked) that can be performed against the Transmission Control
   Protocol (TCP) [RFC0793] and similar protocols.  The consequences of
   these attacks range from throughput-reduction to broken connections
   or data corruption [I-D.ietf-tcpm-icmp-attacks] [RFC4953] [Watson].

   All these attacks rely on the attacker's ability to guess or know the
   five-tuple (Protocol, Source Address, Source port, Destination
   Address, Destination Port) that identifies the transport protocol
   instance to be attacked.

   Services are usually located at fixed, 'well-known' ports [IANA] at
   the host supplying the service (the server).  Client applications
   connecting to any such service will contact the server by specifying
   the server IP address and service port number.  The IP address and
   port number of the client are normally left unspecified by the client
   application and thus chosen automatically by the client networking
   stack.  Ports chosen automatically by the networking stack are known
   as ephemeral ports [Stevens].

   While the server IP address and well-known port and the client IP
   address may be accurately guessed by an attacker, the ephemeral port
   of the client is usually unknown and must be guessed.

   This document describes a number of algorithms for the selection of
   the ephemeral ports, such that the possibility of an off-path
   attacker guessing the exact value is reduced.  They are not a
   replacement for cryptographic methods of protecting a connection such
   as IPsec [RFC4301], the TCP MD5 signature option [RFC2385], or the
   TCP Authentication Option [I-D.ietf-tcpm-tcp-auth-opt].  For example,
   they do not provide any mitigation in those scenarios in which the
   attacker is able to sniff the packets that correspond to the
   transport protocol connection to be attacked.  However, the proposed
   algorithms provide improved obfuscation with very little effort and
   without any key management overhead.

   The mechanisms described in this document are local modifications
   that may be incrementally deployed, and that does not violate the
   specifications of any of the transport protocols that may benefit
   from it, such as TCP [RFC0793], UDP [RFC0768], SCTP [RFC4960], DCCP
   [RFC4340], UDP-lite [RFC3828], and RTP [RFC3550].

   Since these mechanisms are obfuscation techniques, focus has been on
   a reasonable compromise between the level of obfuscation and the ease



Larsen & Gont            Expires January 3, 2010                [Page 4]


Internet-Draft             Port Randomization                  July 2009


   of implementation.  Thus the algorithms must be computationally
   efficient, and not require substantial state.

   We note that while the technique of mitigating "blind" attacks by
   obfuscating the ephemeral port election is well-known as "port
   randomization", the goal of the algorithms described in this document
   is to reduce the chances of an attacker guessing the ephemeral ports
   selected for new connections, rather than to actually produce
   mathematically random sequences of ephemeral ports.

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in RFC 2119 [RFC2119].






































Larsen & Gont            Expires January 3, 2010                [Page 5]


Internet-Draft             Port Randomization                  July 2009


2.  Ephemeral Ports

2.1.  Traditional Ephemeral Port Range

   The Internet Assigned Numbers Authority (IANA) assigns the unique
   parameters and values used in protocols developed by the Internet
   Engineering Task Force (IETF), including well-known ports [IANA].
   IANA has traditionally reserved the following use of the 16-bit port
   range of TCP and UDP:

   o  The Well Known Ports, 0 through 1023.

   o  The Registered Ports, 1024 through 49151

   o  The Dynamic and/or Private Ports, 49152 through 65535

   The range for assigned ports managed by the IANA is 0-1023, with the
   remainder being registered by IANA but not assigned.

   The ephemeral port range defined by IANA has traditionally consisted
   of the 49152-65535 range.

2.2.  Ephemeral port selection

   As each communication instance is identified by the five-tuple
   {protocol, local IP address, local port, remote IP address, remote
   port}, the selection of ephemeral port numbers must result in a
   unique five-tuple.

   Selection of ephemeral ports such that they result in unique five-
   tuples is handled by some implementations by having a per-protocol
   global 'next_ephemeral' variable that is equal to the previously
   chosen ephemeral port + 1, i.e. the selection process is:


















Larsen & Gont            Expires January 3, 2010                [Page 6]


Internet-Draft             Port Randomization                  July 2009


    /* Initialization at system boot time. Initialization value could be random */
    next_ephemeral = min_ephemeral;

    /* Ephemeral port selection function */
    count = max_ephemeral - min_ephemeral + 1;

    do {
        port = next_ephemeral;
        if (next_ephemeral == max_ephemeral) {
            next_ephemeral = min_ephemeral;
        } else {
            next_ephemeral++;
        }

        if (five-tuple is unique)
            return port;

        count--;

    } while (count > 0);

    return ERROR;

                                 Figure 1

   This algorithm works well provided that the number of connections for
   a each transport protocol that have a life-time longer than it takes
   to exhaust the total ephemeral port range is small, so that five-
   tuple collisions are rare.

   However, this method has the drawback that the 'next_ephemeral'
   variable and thus the ephemeral port range is shared between all
   connections and the next ports chosen by the client are easy to
   predict.  If an attacker operates an "innocent" server to which the
   client connects, it is easy to obtain a reference point for the
   current value of the 'next_ephemeral' variable.  Additionally, if an
   attacker could force a client to periodically establish a new TCP
   connection to an attacker controlled machine (or through an attacker
   observable routing path), the attacker could subtract consecutive
   source port values to obtain the number of outoing TCP connections
   established globally by the target host within that time period (up
   to wrap-around issues and 5-tuple collisions, of course).

2.3.  Collision of connection-id's

   While it is possible for the ephemeral port selection algorithm to
   verify that the selected port number results in connection-id that is
   not currently in use at that system, the resulting connection-id may



Larsen & Gont            Expires January 3, 2010                [Page 7]


Internet-Draft             Port Randomization                  July 2009


   still be in use at a remote system.  For example, consider a scenario
   in which a client establishes a TCP connection with a remote web
   server, and the web server performs the active close on the
   connection.  While the state information for this connection will
   disappear at the client side (that is, the connection will be moved
   to the fictional CLOSED state), the connection-id will remain in the
   TIME-WAIT state at the web server for 2*MSL (Maximum Segment
   Lifetime).  If the same client tried to create a new incarnation of
   the previous connection (that is, a connection with the same
   connection-id as the one in the TIME_WAIT state at the server), a
   connection-id "collision" would occur.  The effect of these
   collisions range from connection-establishment failures to TIME-WAIT
   state assassination (with the potential of data corruption)
   [RFC1337].  In scenarios in which a specific client establishes TCP
   connections with a specific service at a server, these problems
   become evident.  Therefore, an ephemeral port selection algorithm
   should ideally minimize the rate of connection-id collisions.

   A simple approach to minimize the rate of these collisions would be
   to choose port numbers incrementally, so that a given port number
   would not be reused until the rest of the port numbers in ephemeral
   port range have been used for a transport protocol instance.
   However, if a single global variable were used to keep track of the
   last ephemeral port selected, ephemeral port numbers would be
   trivially predictable, thus making it easier for an off-path attacker
   to "guess" the connection-id in use by a target connection.
   Section 3.3.3 and Section 3.3.4 describe algorithms that select port
   numbers incrementally, while still making it difficult for an off-
   path attacker to predict the ephemeral ports used for future
   connections.

   Another possible approach to minimize the rate of collisions of
   connection-id's would be for both end-points of a TCP connection to
   keep state about recent connections (e.g., have both end-points end
   up in the TIME-WAIT state).
















Larsen & Gont            Expires January 3, 2010                [Page 8]


Internet-Draft             Port Randomization                  July 2009


3.  Obfuscating the Ephemeral Ports

3.1.  Characteristics of a good ephemeral port obfuscation algorithm

   There are a number of factors to consider when designing a policy of
   selection of ephemeral ports, which include:

   o  Minimizing the predictability of the ephemeral port numbers used
      for future connections.

   o  Minimizing collisions of connection-id's

   o  Avoiding conflict with applications that depend on the use of
      specific port numbers.

   Given the goal of improving the transport protocol's resistance to
   attack by obfuscation of the five-tuple that identifies a transport-
   protocol instance, it is key to minimize the predictability of the
   ephemeral ports that will be selected for new connections.  While the
   obvious approach to address this requirement would be to select the
   ephemeral ports by simply picking a random value within the chosen
   port number range, this straightforward policy may lead to collisions
   of connection-id's, which could lead to the interoperability problems
   (namely delays in the establishment of new connections, failures in
   connection-establishment, or data curruption) discussed in
   Section 2.3.  As discussed in Section 1, it is worth noting that
   while the technique of mitigating "blind" attacks by obfuscating the
   ephemeral port election is well-known as "port randomization", the
   goal of the algorithms described in this document is to reduce the
   chances of an attacker guessing the ephemeral ports selected for new
   connections, rather than to actually produce sequences of
   mathematically random ephemeral port numbers.

   It is also worth noting that, provided adequate algorithms are in
   use, the larger the range from which ephemeral pots are selected, the
   smaller the chances of an attacker are to guess the selected port
   number.

   In scenarios in which a specific client establishes connections with
   a specific service at a server, the problems described in Section 2.3
   become evident.  A good algorithm to minimize the collisions of
   connection-id's would consider the time a given five-tuple was last
   used, and would avoid reusing the last recently used five-tuples.  A
   simple approach to minimize the rate of collisions would be to choose
   port numbers incrementally, so that a given port number would not be
   reused until the rest of the port numbers in the ephemeral port range
   have been used for a transport protocol instance.  However, if a
   single global variable were used to keep track of the last ephemeral



Larsen & Gont            Expires January 3, 2010                [Page 9]


Internet-Draft             Port Randomization                  July 2009


   port selected, ephemeral port numbers would be trivially predictable.

   It is important to note that a number of applications rely on binding
   specific port numbers that may be within the ephemeral ports range.
   If such an application was run while the corresponding port number
   was in use, the application would fail.  Therefore, transport
   protocols should avoid using those port numbers as ephemeral ports.

   Port numbers that are currently in use by a TCP in the LISTEN state
   should not be allowed for use as ephemeral ports.  If this rule is
   not complied, an attacker could potentially "steal" an incoming
   connection to a local server application by issuing a connection
   request to the victim client at roughly the same time the client
   tries to connect to the victim server application [CPNI-TCP]
   [I-D.gont-tcp-security].  If the SYN segment corresponding to the
   attacker's connection request and the SYN segment corresponding to
   the victim client "cross each other in the network", and provided the
   attacker is able to know or guess the ephemeral port used by the
   client, a TCP simultaneous open scenario would take place, and the
   incoming connection request sent by the client would be matched with
   the attacker's socket rather than with the victim server
   application's socket.

   It should be noted that most applications based on popular
   implementations of TCP API (such as the Sockets API) perform "passive
   opens" in three steps.  Firstly, the application obtains a file
   descriptor to be used for inter-process communication (e.g., by
   issuing a socket() call).  Secondly, the application binds the file
   descriptor to a local TCP port number (e.g., by issuing a bind()
   call), thus creating a TCP in the fictional CLOSED state.  Thirdly,
   the aforementioned TCP is put in the LISTEN state (e.g., by issuing a
   listen() call).  As a result, with such an implementation of the TCP
   API, even if port numbers in use for TCPs in the LISTEN state were
   not allowed for use as ephemeral ports, there is a window of time
   between the second and the third steps in which an attacker could be
   allowed to select a port number that would be later used for
   listening to incoming connections.  Therefore, these implementations
   of the TCP API should enforce a stricter requirement for the
   allocation of port numbers: port numbers that are in use by a TCP in
   the LISTEN or CLOSED states should not be allowed for allocation as
   ephemeral ports [CPNI-TCP] [I-D.gont-tcp-security].

3.2.  Ephemeral port number range

   As mentioned in Section 2.1, the ephemeral port range has
   traditionally consisted of the 49152-65535 range.  However, it should
   also include the range 1024-49151 range.




Larsen & Gont            Expires January 3, 2010               [Page 10]


Internet-Draft             Port Randomization                  July 2009


   Since this range includes user-specific server ports, this may not
   always be possible, though.  A possible workaround for this potential
   problem would be to maintain a local list of the port numbers that
   should not be allocated as ephemeral ports.  Thus, before allocating
   a port number, the ephemeral port selection function would check this
   list, avoiding the allocation of ports that may be needed for
   specific applications.

   Transport protocols SHOULD use the largest possible port range, since
   this improves the obfuscation provided by the ephemeral port
   selection algorithms.

3.3.  Ephemeral Port Obfuscation Algorithms

   Transport protocols SHOULD obfuscate the allocatation of their
   ephemeral ports, since this help to mitigate a number of attacks that
   depend on the attacker's ability to guess or know the five-tuple that
   identifies the transport protocol instance to be attacked.

   The following subsections describe a number of algorithms that could
   be implemented in order to obfuscate the selection of ephemeral port
   numbers.

3.3.1.  Algorithm 1: Simple port randomization algorithm

   In order to address the security issues discussed in Section 1 and
   Section 2.2, a number of systems have implemented simple ephemeral
   port number randomization, as follows:























Larsen & Gont            Expires January 3, 2010               [Page 11]


Internet-Draft             Port Randomization                  July 2009


       /* Ephemeral port selection function */
       num_ephemeral = max_ephemeral - min_ephemeral + 1;
       next_ephemeral = min_ephemeral + (random() % num_ephemeral);
       count = num_ephemeral;

       do {
           if(five-tuple is unique)
                   return next_ephemeral;

           if (next_ephemeral == max_ephemeral) {
               next_ephemeral = min_ephemeral;
           } else {
               next_ephemeral++;
           }

           count--;
       } while (count > 0);

       return ERROR;

                                 Figure 2

   We will refer to this algorithm as 'Algorithm 1'.

   Since the initially chosen port may already be in use with identical
   IP addresses and server port, the resulting five-tuple might not be
   unique.  Therefore, multiple ports may have to be tried and verified
   against all existing connections before a port can be chosen.

   Web proxy servers, NAPTs [RFC2663], and other middle-boxes aggregate
   multiple peers into the same port space and thus increse the
   population of used ephemeral ports, and hence the chances of
   collisions of connection-id's.  However, [Allman] has shown that at
   least in the network scenarios used for measuring the collision
   properties of the algorithms described in this document, the
   collision rate resulting from the use of the aforementioned middle-
   boxes is nevertheless very low.

   Since this algorithm performs a completely random port selection
   (i.e., without taking into account the port numbers previously
   chosen), it has the potential of reusing port numbers too quickly,
   thus possibly leading to collisions of connection-id's.  Even if a
   given five-tuple is verified to be unique by the port selection
   algorithm, the five-tuple might still be in use at the remote system.
   In such a scenario, the connection request could possibly fail
   ([Silbersack] describes this problem for the TCP case).

   This algorithm selects ephemeral port numbers randomly and thus



Larsen & Gont            Expires January 3, 2010               [Page 12]


Internet-Draft             Port Randomization                  July 2009


   reduces the chances of an attacker of guessing the ephemeral port
   selected for a target connection.  Additionally, it prevents
   attackers from obtaining the number of outgoing connections
   established by the client in some period of time.

3.3.2.  Algorithm 2: Another simple port randomization algorithm

   Another algorithm for selecting a random port number is shown in
   Figure 3, in which in the event a local connection-id collision is
   detected, another port number is selected randomly, as follows:


       /* Ephemeral port selection function */
       num_ephemeral = max_ephemeral - min_ephemeral + 1;
       next_ephemeral = min_ephemeral + (random() % num_ephemeral);
       count = num_ephemeral;

       do {
           if(five-tuple is unique)
                   return next_ephemeral;

           next_ephemeral = min_ephemeral + (random() % num_ephemeral);
           count--;
       } while (count > 0);

       return ERROR;

                                 Figure 3

   We will refer to this algorithm as 'Algorithm 2'.  This algorithm
   might be unable to select an ephemeral port (i.e., return "ERROR")
   even if there are port numbers that would result in unique five-
   tuples, when there are a large number of port numbers already in use.
   However, the results in [Allman] have shown that in common scenarios,
   one port choice is enough, and in most cases where more than one
   choice is needed two choices suffice.  Therefore, in those scenarios
   this would not be problem.

3.3.3.  Algorithm 3: Simple hash-based algorithm

   We would like to achieve the port reuse properties of the traditional
   BSD port selection algorithm (described in Section 2.2), while at the
   same time achieve the obfuscation properties of Algorithm 1 and
   Algorithm 2.

   Ideally, we would like a 'next_ephemeral' value for each set of
   (local IP address, remote IP addresses, remote port), so that the
   port reuse frequency is the lowest possible.  Each of these



Larsen & Gont            Expires January 3, 2010               [Page 13]


Internet-Draft             Port Randomization                  July 2009


   'next_ephemeral' variables should be initialized with random values
   within the ephemeral port range and would thus separate the ephemeral
   port ranges of the connections entirely.  Since we do not want to
   maintain in memory all these 'next_ephemeral' values, we propose an
   offset function F(), that can be computed from the local IP address,
   remote IP address, remote port and a secret key.  F() will yield
   (practically) different values for each set of arguments, i.e.:


    /* Initialization code at system boot time. Initialization value could be random. */
    next_ephemeral = 0;

    /* Ephemeral port selection function */
    num_ephemeral = max_ephemeral - min_ephemeral + 1;
    offset = F(local_IP, remote_IP, remote_port, secret_key);
    count = num_ephemeral;

    do {
        port = min_ephemeral + (next_ephemeral + offset) % num_ephemeral;
        next_ephemeral++;

        if(five-tuple is unique)
            return port;

        count--;

    } while (count > 0);

    return ERROR;

                                 Figure 4

   We will refer to this algorithm as 'Algorithm 3'.

   In other words, the function F() provides a per-connection fixed
   offset within the global ephemeral port range.  Both the 'offset' and
   'next_ephemeral' variables may take any value within the storage type
   range since we are restricting the resulting port similar to that
   shown in Figure 3.  This allows us to simply increment the
   'next_ephemeral' variable and rely on the unsigned integer to simply
   wrap-around.

   The function F() should be a cryptographic hash function like MD5
   [RFC1321].  The function should use both IP addresses, the remote
   port and a secret key value to compute the offset.  The remote IP
   address is the primary separator and must be included in the offset
   calculation.  The local IP address and remote port may in some cases
   be constant and not improve the connection separation, however, they



Larsen & Gont            Expires January 3, 2010               [Page 14]


Internet-Draft             Port Randomization                  July 2009


   should also be included in the offset calculation.

   Cryptographic algorithms stronger than e.g.  MD5 should not be
   necessary, given that Algorithm #3 is simply an obfuscation
   technique.  The secret should be chosen as random as possible, see
   [RFC4086] for recommendations on choosing secrets.

   Note that on multiuser systems, the function F() could include user
   specific information, thereby providing protection not only on a host
   to host basis, but on a user to service basis.  In fact, any
   identifier of the remote entity could be used, depending on
   availability an the granularity requested.  With SCTP both hostnames
   and alternative IP addresses may be included in the association
   negotiation and either of these could be used in the offset function
   F().

   When multiple unique identifiers are available, any of these can be
   chosen as input to the offset function F() since they all uniquely
   identify the remote entity.  However, in cases like SCTP where the
   ephemeral port must be unique across all IP address permutations, we
   should ideally always use the same IP address to get a single
   starting offset for each association negotiation from a given remote
   entity to minimize the possibility of collisions.  A simple numerical
   sorting of the IP addresses and always using the numerically lowest
   could achieve this.  However, since most protocols most likely will
   report the same IP addresses in the same order in each association
   setup, this sorting is most likely not necessary and the 'first one'
   can simply be used.

   The ability of hostnames to uniquely define hosts can be discussed,
   and since SCTP always includes at least one IP address, we recommend
   to use this as input to the offset function F() and ignore hostnames
   chunks when searching for ephemeral ports.

   It should be note that, as this algorithm uses a global counter
   ("next_ephemeral") for selecting ephemeral ports, if an attacker
   could force a client to periodically establish a new TCP connection
   to an attacker controlled machine (or through an attacker observable
   routing path), the attacker could subtract consecutive source port
   values to obtain the number of outoing TCP connections established
   globally by the target host within that time period (up to wrap-
   around issues and 5-tuple collisions, of course).

3.3.4.  Algorithm 4: Double-hash obfuscation algorithm

   A tradeoff between maintaining a single global 'next_ephemeral'
   variable and maintaining 2**N 'next_ephemeral' variables (where N is
   the width of of the result of F()) could be achieved as follows.  The



Larsen & Gont            Expires January 3, 2010               [Page 15]


Internet-Draft             Port Randomization                  July 2009


   system would keep an array of TABLE_LENGTH short integers, which
   would provide a separation of the increment of the 'next_ephemeral'
   variable.  This improvement could be incorporated into Algorithm 3 as
   follows:



     /* Initialization at system boot time */
     for(i = 0; i < TABLE_LENGTH; i++)
         table[i] = random() % 65536;


     /* Ephemeral port selection function */
     num_ephemeral = max_ephemeral - min_ephemeral + 1;
     offset = F(local_IP, remote_IP, remote_port, secret_key1);
     index = G(local_IP, remote_IP, remote_port, secret_key2);
     count = num_ephemeral;

     do {
         port = min_ephemeral + (offset + table[index]) % num_ephemeral;
         table[index]++;

         if(five-tuple is unique)
             return port;

        count--;

     } while (count > 0);

     return ERROR;

                                 Figure 5

   We will refer to this algorithm as 'Algorithm 4'.

   'table[]' could be initialized with mathematically random values, as
   indicated by the initialization code in Figure 5.  The function G()
   should be a cryptographic hash function like MD5 [RFC1321].  It
   should use both IP addresses, the remote port and a secret key value
   to compute a value between 0 and (TABLE_LENGTH-1).  Alternatively,
   G() could take as "offset" as input, and perform the exclusive-or
   (xor) operation between all the bytes in 'offset'.

   The array 'table[]' assures that succesive connections to the same
   end-point will use increasing ephemeral port numbers.  However,
   incrementation of the port numbers is separated into TABLE_LENGTH
   different spaces, and thus the port reuse frequency will be
   (probabilistically) lower than that of Algorithm 3.  That is, a



Larsen & Gont            Expires January 3, 2010               [Page 16]


Internet-Draft             Port Randomization                  July 2009


   connection established with some remote end-point will not
   necessarily cause the 'next_ephemeral' variable corresponding to
   other end-points to be incremented.

   It is interesting to note that the size of 'table[]' does not limit
   the number of different port sequences, but rather separates the
   *increments* into TABLE_LENGTH different spaces.  The port sequence
   will result from adding the corresponding entry of 'table[]' to the
   variable 'offset', which selects the actual port sequence (as in
   Algorithm 3).  [Allman] has found that a TABLE_LENGTH of 10 can
   result in an improvement over Algorithm 3.  Further increasing the
   TABLE_LENGTH will increase the obfuscation, and possibly further
   decrease the collision rate.

   An attacker can perform traffic analysis for any "increment space"
   into which the attacker has "visibility", namely that the attacker
   can force the client to establish a transport-protocol connection
   whose G(offset) identifies the target "increment space".  However,
   the attacker's ability to perform traffic analysis is very reduced
   when compared to the traditional BSD algorithm (described in
   Section 2.2) and Algorithm 3.  Additionally, an implementation can
   further limit the attacker's ability to perform traffic analysis by
   further separating the increment space (that is, using a larger value
   for TABLE_LENGTH).

3.3.5.  Algorithm 5: Random-increments port selection algorithm

   [Allman] introduced another port obfuscation algorithm, which offers
   a middle ground between the algorithms that select ephemeral ports
   randomly (such as those described in Section 3.3.1 and
   Section 3.3.2), and those that offer obfuscation but no randomization
   (such as those described in Section 3.3.3 and Section 3.3.4).  We
   will refer to this algorithm as 'Algorithm 5'.


















Larsen & Gont            Expires January 3, 2010               [Page 17]


Internet-Draft             Port Randomization                  July 2009


    /* Initialization code at system boot time. */
    next_ephemeral = random() % 65536;   /* Initialization value */
    N = 500;                             /* Determines the tradeoff (configurable) */

    /* Ephemeral port selection function */
    num_ephemeral = max_ephemeral - min_ephemeral + 1;

    count = num_ephemeral;

    do {
        next_ephemeral = next_ephemeral + (random() % N) + 1;
        port = min_ephemeral + (next_ephemeral % num_ephemeral);

        if(five-tuple is unique)
                return port;

         count--;
    } while (count > 0);

    return ERROR;

                                 Figure 6

   This algorithm aims at at producing a monotonically-increasing
   sequence to prevent the collision of connection-id's, while avoiding
   the use of fixed increments, which would lead to trivially-
   predictable sequences.  The value "N" allows for direct control of
   the tradeoff between the level of obfuscation and the port reuse
   frequency.  The smaller the value of "N", the more linear the more
   similar this algorithm is to the traditioanl BSD port selection
   algorithm (described in Section 2.2.  The larger the value of "N",
   the more similar this algorithm is to the algorithm described in
   Section 3.3.1 of this document.

   When the port numbers wrap, there's the risk of collisions of
   connection-id's.  Therefore, "N" should be selecting according to the
   following criteria:

   o  It should maximize the wrapping time of the ephemeral port space

   o  It should minimize collisions of connection-id's

   o  It should maximize obfuscation

   Clearly, these are competing goals, and the decision of which value
   of "N" to use is a tradeoff.  Therefore, the value of "N" should be
   configurable so that system administrators can make the tradeoff for
   themselves.



Larsen & Gont            Expires January 3, 2010               [Page 18]


Internet-Draft             Port Randomization                  July 2009


3.4.  Secret-key considerations for hash-based port obfuscation
      algorithms

   Every complex manipulation (like MD5) is no more secure than the
   input values, and in the case of ephemeral ports, the secret key.  If
   an attacker is aware of which cryptographic hash function is being
   used by the victim (which we should expect), and the attacker can
   obtain enough material (e.g. ephemeral ports chosen by the victim),
   the attacker may simply search the entire secret key space to find
   matches.

   To protect against this, the secret key should be of a reasonable
   length.  Key lengths of 32 bits should be adequate, since a 32-bit
   secret would result in approximately 65k possible secrets if the
   attacker is able to obtain a single ephemeral port (assuming a good
   hash function).  If the attacker is able to obtain more ephemeral
   ports, key lengths of 64 bits or more should be used.

   Another possible mechanism for protecting the secret key is to change
   it after some time.  If the host platform is capable of producing
   reasonable good random data, the secret key can be changed
   automatically.

   Changing the secret will cause abrupt shifts in the chosen ephemeral
   ports, and consequently collisions may occur.  That is, upon changing
   the secret, the "offset" value (see Figure 4 and Figure 5) used for
   each destination end-point will be different from that computed with
   the previous secret, ths leading to the selection of a port number
   recently used for connecting to the same end-point.

   Thus the change in secret key should be done with consideration and
   could be performed whenever one of the following events occur:

   o  The system is being bootstrapped.

   o  Some predefined/random time has expired.

   o  The secret has been used N times (i.e. we consider it insecure).

   o  There are few active connections (i.e., possibility of collision
      is low).

   o  There is little traffic (the performance overhead of collisions is
      tolerated).

   o  There is enough random data available to change the secret key
      (pseudo-random changes should not be done).




Larsen & Gont            Expires January 3, 2010               [Page 19]


Internet-Draft             Port Randomization                  July 2009


3.5.  Choosing an ephemeral port obfuscation algorithm

   [Allman] is an empyrical study of the properties of the algorithms
   described in this document, which has found that all the algorithms
   described in this document offer low collision rates -- at most 0.3%.
   That is, in those network scenarios asessed by [Allman] all of the
   algorithms described in this document perform good in terms of
   collisions of connection-id's.  However, these results may vary
   depending on the characteristics of network traffic and the specific
   network setup.

   The algorithm sketched in Figure 1 is the traditional ephemeral port
   selection algorithm implemented in BSD-derived systems.  It generates
   a global sequence of ephemeral port numbers, which makes it trivial
   for an attacker to predict the port number that will be used for a
   future transport protocol instance.  However, it is very simple, and
   leads to a low port resuse frequency.

   Algorithm 1 and Algorithm 2 have the advantage that they provide
   complete randomization.  However, they may increase the chances of
   port number collisions, which could lead to the failure of the
   connection establishment attempt.  [Allman] found that these two
   algorithms show the largest collision rates (among all the algorithms
   described in this document).

   Algorithm 3 provides complete separation in local and remote IP
   addresses and remote port space, and only limited separation in other
   dimensions (see Section 3.4).  However, implementations should
   consider the performance impact of computing the cryptographic hash
   used for the offset.

   Algorithm 4 improves Algorithm 3, usually leading to a lower port
   reuse frequency, at the expense of more processor cycles used for
   computing G(), and additional kernel memory for storing the array
   'table[]'.

   Algorithm 5 offers middle ground between the simple randomization
   algorithms (Algorithm 1 and Algorthm 2) and the hash-based algorithms
   (Algorithm 3 and Algorithm 4).  The upper limit on the random
   increments (the value "N" in Figure 6 controls the trade-off between
   randomization and port-reuse frequency.

   Finally, a special case that may preclude the utilization of
   Algorithm 3 and Algorithm 4 should be analyzed.  There exist some
   applications that contain the following code sequence:






Larsen & Gont            Expires January 3, 2010               [Page 20]


Internet-Draft             Port Randomization                  July 2009


       s = socket();
       bind(s, IP_address, port = *);


                                 Figure 7

   In some BSD-derived systems, the call to bind() will result in the
   selection of an ephemeral port number.  However, as neither the
   remote IP address nor the remote port will be available to the
   ephemeral port selection function, the hash function F() used in
   Algorithm 3 and Algorithm 4 will not have all the required arguments,
   and thus the result of the hash function will be impossible to
   compute.  Transport protocols implementating Algorithm 3 or Algorithm
   4 should consider using Algorithm 2 when facing the scenario just
   described.

   An alternative to this behavior would be to implement "lazy binding"
   in response to the bind() call.  That is, selection of an epphemeral
   port would be delayed until, e.g., connect() or send() are called.
   Thus, at that point the ephemeral port is actually selected, all the
   necessary arguments for the hash function F() would be available, and
   thus Algorithm 3 and Algorithm 4 could still be used in this
   scenario.  This policy has been implemented by Linux [Linux].




























Larsen & Gont            Expires January 3, 2010               [Page 21]


Internet-Draft             Port Randomization                  July 2009


4.  Port obfuscation and Network Address Port Translation (NAPT)

   Network Address Port Translation (NAPT) translate both the network
   address and transport-protocol port number, thus allowing the
   transport identifiers of a number of private hosts to be multiplexed
   into the transport identifiers of a single external address.
   [RFC2663]

   In those scenarios in which a NAPT is present between the two end-
   points of transport-protocol connection, the obfuscation of the
   ephemeral ports (from the point of view of the external network) will
   depend on the ephemeral port selection function at the NAPT.
   Therefore, NAPTs should consider obfuscating the ephemeral ports by
   means of any of the algorithms discussed in this document.  It should
   be noted that in some network scenarios, a NAPT may naturally obscure
   ephemeral port selections simply due to the vast range of services
   with which it establishes connections and to the overall rate of the
   traffic [Allman].

   Section 3.5 provides guidance in choosing a port obfuscation
   algorithm.






























Larsen & Gont            Expires January 3, 2010               [Page 22]


Internet-Draft             Port Randomization                  July 2009


5.  Security Considerations

   Obfuscating ephemeral ports is no replacement for cryptographic
   mechanisms, such as IPsec [RFC4301], in terms of protecting transport
   protocol instances against blind attacks.

   An eavesdropper, which can monitor the packets that correspond to the
   connection to be attacked could learn the IP addresses and port
   numbers in use (and also sequence numbers etc.) and easily attack the
   connection.  Ephemeral port obfuscation does not provide any
   additional protection against this kind of attacks.  In such
   situations, proper authentication mechanisms such as those described
   in [RFC4301] should be used.

   If the local offset function F() results in identical offsets for
   different inputs, the port-offset mechanism proposed in this document
   has no or reduced effect.

   If random numbers are used as the only source of the secret key, they
   must be chosen in accordance with the recommendations given in
   [RFC4086].

   If an attacker uses dynamically assigned IP addresses, the current
   ephemeral port offset (Algorithm 3 and Algorithm 4) for a given five-
   tuple can be sampled and subsequently used to attack an innocent peer
   reusing this address.  However, this is only possible until a re-
   keying happens as described above.  Also, since ephemeral ports are
   only used on the client side (e.g. the one initiating the
   connection), both the attacker and the new peer need to act as
   servers in the scenario just described.  While servers using dynamic
   IP addresses exist, they are not very common and with an appropriate
   re-keying mechanism the effect of this attack is limited.



















Larsen & Gont            Expires January 3, 2010               [Page 23]


Internet-Draft             Port Randomization                  July 2009


6.  Acknowledgements

   The offset function was inspired by the mechanism proposed by Steven
   Bellovin in [RFC1948] for defending against TCP sequence number
   attacks.

   The authors would like to thank (in alphabetical order) Mark Allman,
   Matthias Bethke, Stephane Bortzmeyer, Brian Carpenter, Vincent
   Deffontaines, Lars Eggert, Gorry Fairhurst, Guillermo Gont, Alfred
   Hoenes, Amit Klein, Carlos Pignataro, Kacheong Poon, Joe Touch, and
   Dan Wing for their valuable feedback on earlier versions of this
   document.

   The authors would like to thank FreeBSD's Mike Silbersack for a very
   fruitful discussion about ephemeral port selection techniques.

   Fernando Gont would like to thank Carolina Suarez for her love and
   support.

































Larsen & Gont            Expires January 3, 2010               [Page 24]


Internet-Draft             Port Randomization                  July 2009


7.  References

7.1.  Normative References

   [RFC0768]  Postel, J., "User Datagram Protocol", STD 6, RFC 768,
              August 1980.

   [RFC0793]  Postel, J., "Transmission Control Protocol", STD 7,
              RFC 793, September 1981.

   [RFC1321]  Rivest, R., "The MD5 Message-Digest Algorithm", RFC 1321,
              April 1992.

   [RFC1948]  Bellovin, S., "Defending Against Sequence Number Attacks",
              RFC 1948, May 1996.

   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119, March 1997.

   [RFC2385]  Heffernan, A., "Protection of BGP Sessions via the TCP MD5
              Signature Option", RFC 2385, August 1998.

   [RFC2663]  Srisuresh, P. and M. Holdrege, "IP Network Address
              Translator (NAT) Terminology and Considerations",
              RFC 2663, August 1999.

   [RFC3550]  Schulzrinne, H., Casner, S., Frederick, R., and V.
              Jacobson, "RTP: A Transport Protocol for Real-Time
              Applications", STD 64, RFC 3550, July 2003.

   [RFC3828]  Larzon, L-A., Degermark, M., Pink, S., Jonsson, L-E., and
              G. Fairhurst, "The Lightweight User Datagram Protocol
              (UDP-Lite)", RFC 3828, July 2004.

   [RFC4086]  Eastlake, D., Schiller, J., and S. Crocker, "Randomness
              Requirements for Security", BCP 106, RFC 4086, June 2005.

   [RFC4301]  Kent, S. and K. Seo, "Security Architecture for the
              Internet Protocol", RFC 4301, December 2005.

   [RFC4340]  Kohler, E., Handley, M., and S. Floyd, "Datagram
              Congestion Control Protocol (DCCP)", RFC 4340, March 2006.

   [RFC4960]  Stewart, R., "Stream Control Transmission Protocol",
              RFC 4960, September 2007.






Larsen & Gont            Expires January 3, 2010               [Page 25]


Internet-Draft             Port Randomization                  July 2009


7.2.  Informative References

   [FreeBSD]  The FreeBSD Project, "http://www.freebsd.org".

   [IANA]     "IANA Port Numbers",
              <http://www.iana.org/assignments/port-numbers>.

   [I-D.ietf-tcpm-icmp-attacks]
              Gont, F., "ICMP attacks against TCP",
              draft-ietf-tcpm-icmp-attacks-05 (work in progress),
              June 2009.

   [RFC1337]  Braden, B., "TIME-WAIT Assassination Hazards in TCP",
              RFC 1337, May 1992.

   [RFC4953]  Touch, J., "Defending TCP Against Spoofing Attacks",
              RFC 4953, July 2007.

   [Allman]   Allman, M., "Comments On Selecting Ephemeral Ports",  ACM
              Computer Communicatiion Review, 39(2), 2009.

   [CPNI-TCP]
              Gont, F., "CPNI Technical Note 3/2009: Security Assessment
              of the Transmission Control Protocol (TCP)",  UK Centre
              for the Protection of National Infrastructure, 2009.

   [I-D.gont-tcp-security]
              Gont, F., "Security Assessment of the Transmission Control
              Protocol (TCP)", draft-gont-tcp-security-00 (work in
              progress), February 2009.

   [Linux]    The Linux Project, "http://www.kernel.org".

   [NetBSD]   The NetBSD Project, "http://www.netbsd.org".

   [OpenBSD]  The OpenBSD Project, "http://www.openbsd.org".

   [OpenSolaris]
              OpenSolaris, "http://www.opensolaris.org".

   [Silbersack]
              Silbersack, M., "Improving TCP/IP security through
              randomization without sacrificing interoperability.",
              EuroBSDCon 2005 Conference .

   [Stevens]  Stevens, W., "Unix Network Programming, Volume 1:
              Networking APIs: Socket and XTI", Prentice Hall , 1998.




Larsen & Gont            Expires January 3, 2010               [Page 26]


Internet-Draft             Port Randomization                  July 2009


   [I-D.ietf-tcpm-tcp-auth-opt]
              Touch, J., Mankin, A., and R. Bonica, "The TCP
              Authentication Option", draft-ietf-tcpm-tcp-auth-opt-04
              (work in progress), March 2009.

   [Watson]   Watson, P., "Slipping in the Window: TCP Reset Attacks",
              CanSecWest 2004 Conference .












































Larsen & Gont            Expires January 3, 2010               [Page 27]


Internet-Draft             Port Randomization                  July 2009


Appendix A.  Survey of the algorithms in use by some popular
             implementations

A.1.  FreeBSD

   FreeBSD implements Algorithm 1, and in response to this document now
   uses a 'min_port' of 10000 and a 'max_port' of 65535.  [FreeBSD]

A.2.  Linux

   Linux implements Algorithm 3.  If the algorithm is faced with the
   corner-case scenario described in Section 3.5, Algorithm 1 is used
   instead [Linux].

A.3.  NetBSD

   NetBSD does not obfuscate its ephemeral port numbers.  It selects
   ephemeral port numbers from the range 49152-65535, starting from port
   65535, and decreasing the port number for each ephemeral port number
   selected [NetBSD].

A.4.  OpenBSD

   OpenBSD implements Algorithm 1, with a 'min_port' of 1024 and a
   'max_port' of 49151.  [OpenBSD]

A.5.  OpenSolaris

   OpenSolaris implements Algorithm 1, with a 'min_port' of 32768 and a
   'max_port' of 65535.  [OpenSolaris]





















Larsen & Gont            Expires January 3, 2010               [Page 28]


Internet-Draft             Port Randomization                  July 2009


Appendix B.  Changes from previous versions of the draft (to be removed
             by the RFC Editor before publication of this document as a
             RFC

B.1.  Changes from draft-ietf-tsvwg-port-randomization-03

   o  Addresses WGLC comments from Mark Allman.  See:
      http://www.ietf.org/mail-archive/web/tsvwg/current/msg09149.html

B.2.  Changes from draft-ietf-tsvwg-port-randomization-02

   o  Added clarification of what we mean by "port randomization".

   o  Addresses feedback sent on-list and off-list by Mark Allman.

   o  Added references to [Allman] and [CPNI-TCP].

B.3.  Changes from draft-ietf-tsvwg-port-randomization-01

   o  Added Section 2.3.

   o  Added discussion of "lazy binding in Section 3.5.

   o  Added discussion of obtaining the number of outgoing connections.

   o  Miscellaneous editorial changes

B.4.  Changes from draft-ietf-tsvwg-port-randomization-00

   o  Added Section 3.1.

   o  Changed Intended Status from "Standards Track" to "BCP".

   o  Miscellaneous editorial changes.

B.5.  Changes from draft-larsen-tsvwg-port-randomization-02

   o  Draft resubmitted as draft-ietf.

   o  Included references and text on protocols other than TCP.

   o  Added the second variant of the simple port randomization
      algorithm

   o  Reorganized the algorithms into different sections

   o  Miscellaneous editorial changes.




Larsen & Gont            Expires January 3, 2010               [Page 29]


Internet-Draft             Port Randomization                  July 2009


B.6.  Changes from draft-larsen-tsvwg-port-randomization-01

   o  No changes.  Draft resubmitted after expiration.

B.7.  Changes from draft-larsen-tsvwg-port-randomization-00

   o  Fixed a bug in expressions used to calculate number of ephemeral
      ports

   o  Added a survey of the algorithms in use by popular TCP
      implementations

   o  The whole document was reorganizaed

   o  Miscellaneous editorial changes

B.8.  Changes from draft-larsen-tsvwg-port-randomisation-00

   o  Document resubmitted after original document by M. Larsen expired
      in 2004

   o  References were included to current WG documents of the TCPM WG

   o  The document was made more general, to apply to all transport
      protocols

   o  Miscellaneous editorial changes
























Larsen & Gont            Expires January 3, 2010               [Page 30]


Internet-Draft             Port Randomization                  July 2009


Authors' Addresses

   Michael Vittrup Larsen
   TietoEnator
   Skanderborgvej 232
   Aarhus  DK-8260
   Denmark

   Phone: +45 8938 5100
   Email: michael.larsen@tietoenator.com


   Fernando Gont
   Universidad Tecnologica Nacional / Facultad Regional Haedo
   Evaristo Carriego 2644
   Haedo, Provincia de Buenos Aires  1706
   Argentina

   Phone: +54 11 4650 8472
   Email: fernando@gont.com.ar































Larsen & Gont            Expires January 3, 2010               [Page 31]

Document	Document type	This is an older version of an Internet-Draft that was ultimately published as RFC 6056. Expired & archived
	Select version	00 01 02 03 04 05 06 07 08 09 RFC 6056
	Compare versions
	Author
	Replaces	draft-larsen-tsvwg-port-randomization
	RFC stream
	Other formats	txt pdf bibtex bibxml
	Additional resources	Mailing list discussion