Internet Engineering Task Force                                L. Peluso
Internet-Draft                                      University of Napoli
Intended status: Standards Track                                T. Zseby
Expires: September 7, 2010                    Fraunhofer Institute FOKUS
                                                            S. D'Antonio
                                           CINI Consortium/University of
                                                     Napoli "Parthenope"
                                                               M. Molina
                                                                   DANTE
                                                          March 06, 2010


                       Flow Selection Techniques
              draft-ietf-ipfix-flow-selection-tech-01.txt

Abstract

   Flow selection is the process of selecting a subset of flows from all
   flows observed at an observation point.  The objective of flow
   selection is to reduce the effor for post-processing flow data and
   for transfering flow records.  The flow selection process can be
   enabled at different stages of the measurement process.  It can be
   applied directly after classification or at recording/exporting time
   by limiting the number of flows to be stored and/or exported to the
   collecting process.  This document describes motivations for flow
   selection and presents flow selection techniques.  It furthermore
   provides an information model for configuring flow selection
   techniques and discusses what information about a flow selection
   process is beneficial to be exported by adopting a suitable
   information model.

Requirements Language

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in RFC 2119 [RFC2119].

Status of this Memo

   This Internet-Draft is submitted to IETF in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as Internet-
   Drafts.

   Internet-Drafts are draft documents valid for a maximum of six months



Peluso, et al.          Expires September 7, 2010               [Page 1]


Internet-Draft          Flow Selection Techniques             March 2010


   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/ietf/1id-abstracts.txt.

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html.

   This Internet-Draft will expire on September 7, 2010.

Copyright Notice

   Copyright (c) 2010 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (http://trustee.ietf.org/license-info) in effect on the date of
   publication of this document.  Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document.  Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of
   the Trust Legal Provisions and are provided without warranty as
   described in the BSD License.

   This document may contain material from IETF Documents or IETF
   Contributions published or made publicly available before November
   10, 2008.  The person(s) controlling the copyright in some of this
   material may not have granted the IETF Trust the right to allow
   modifications of such material outside the IETF Standards Process.
   Without obtaining an adequate license from the person(s) controlling
   the copyright in such materials, this document may not be modified
   outside the IETF Standards Process, and derivative works of it may
   not be created outside the IETF Standards Process, except to format
   it for publication as an RFC or to translate it into languages other
   than English.













Peluso, et al.          Expires September 7, 2010               [Page 2]


Internet-Draft          Flow Selection Techniques             March 2010


Table of Contents

   1.  Introduction . . . . . . . . . . . . . . . . . . . . . . . . .  4
   2.  Scope  . . . . . . . . . . . . . . . . . . . . . . . . . . . .  4
   3.  Terminology  . . . . . . . . . . . . . . . . . . . . . . . . .  5
   4.  Position of the Flow Selection Process . . . . . . . . . . . .  5
   5.  Flow selection techniques  . . . . . . . . . . . . . . . . . .  7
     5.1.  Flow selection based on flow record content  . . . . . . .  7
     5.2.  Flow selection based on flow record arrival time or
           sequence . . . . . . . . . . . . . . . . . . . . . . . . .  7
     5.3.  Flow selection on external events  . . . . . . . . . . . .  7
   6.  Reporting of Flow Selection Information  . . . . . . . . . . .  7
     6.1.  Flow selection in the metering process . . . . . . . . . .  8
     6.2.  Flow selection in the flow recording process . . . . . . .  8
     6.3.  Flow selection in the exporting process  . . . . . . . . .  9
   7.  Information model for flow selection information exporting . . 11
     7.1.  Meter process related (TBD1-TBD2)  . . . . . . . . . . . . 12
       7.1.1.  FsMeter_UnmeasPacketCount  . . . . . . . . . . . . . . 13
       7.1.2.  FsMeter_UnmeasBytesCount . . . . . . . . . . . . . . . 13
     7.2.  Flow recording process related (TBD3-TBD8) . . . . . . . . 13
       7.2.1.  FsFrec_PacketInDroppedRecsCount  . . . . . . . . . . . 14
       7.2.2.  FsFrec_ByteInDroppedRecsCount  . . . . . . . . . . . . 14
       7.2.3.  FsFrec_FrecDroppedCount  . . . . . . . . . . . . . . . 14
       7.2.4.  FsFrec_UnexportedFrecCount . . . . . . . . . . . . . . 15
       7.2.5.  FsFrec_UnexportedPacketInFrecCount . . . . . . . . . . 15
       7.2.6.  FsFrec_UnexportedBytesInFrecCount  . . . . . . . . . . 15
     7.3.  Flow exporting process related (TBD9-TBD14)  . . . . . . . 16
       7.3.1.  FsExp_PacketInDroppedRecsCount . . . . . . . . . . . . 16
       7.3.2.  FsExp_ByteInDroppedRecsCount . . . . . . . . . . . . . 16
       7.3.3.  FsExp_FrecDroppedCount . . . . . . . . . . . . . . . . 17
       7.3.4.  FsExp_UnexportedCount  . . . . . . . . . . . . . . . . 17
       7.3.5.  FsExp_UnexportedPacketCount  . . . . . . . . . . . . . 17
       7.3.6.  FsExp_UnexportedByteInExpCount . . . . . . . . . . . . 18
   8.  Requirements put on implementations  . . . . . . . . . . . . . 18
   9.  Information Model for Configuration of Flow Selection
       Techniques . . . . . . . . . . . . . . . . . . . . . . . . . . 19
     9.1.  selectorMethod . . . . . . . . . . . . . . . . . . . . . . 19
     9.2.  flowMaxAdmitFlowRecords  . . . . . . . . . . . . . . . . . 20
     9.3.  flowRecordBytesSize  . . . . . . . . . . . . . . . . . . . 20
     9.4.  flowRecordPacketsSize  . . . . . . . . . . . . . . . . . . 21
     9.5.  flowInactivityTime . . . . . . . . . . . . . . . . . . . . 21
   10. Security Considerations  . . . . . . . . . . . . . . . . . . . 22
   11. IANA Considerations  . . . . . . . . . . . . . . . . . . . . . 22
   12. References . . . . . . . . . . . . . . . . . . . . . . . . . . 22
     12.1. Normative References . . . . . . . . . . . . . . . . . . . 22
     12.2. Informative References . . . . . . . . . . . . . . . . . . 22
   Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 23




Peluso, et al.          Expires September 7, 2010               [Page 3]


Internet-Draft          Flow Selection Techniques             March 2010


1.  Introduction

   This document describes flow selection techniques for traffic
   measurements.  [RFC5475], describes packet selection techniques,
   which describe sampling and filtering techniques for selecting a
   subset of packets The element on which this selection mechanism is
   performed is a packet and the selection decision is based on packet
   propeorties in contrats to this, flow selection techniques consider
   flows as the basic elements on which a selection process is performed
   In the IPFIX architecture the basis element on which the selection
   process is performed are the IPFIX flow records.  For several
   applications it makes sense to select only the flows of interest if
   resources are scarce.  Examples are accounting or attack detection
   applications.  One example are attack detection techniques.  If an
   attack is ongoing in the network, resources are quite scarce.
   Maintainign and exporting all flow records to the collecting process
   would increse resource demands even further with the result that data
   is randomly discarded.  A better solution is to export only a
   representative subset of flows.  For some botnet attacks many similar
   flows occur from different source addresses.  So it can be useful to
   include some meta-data ,to indicate that there are multiple flows
   active with similar characteristics.  Another example are accounting
   applications.  In many networks few large flows contribute to the
   majority of the overall traffic volume [DuLT01a], [DuLT01b].  This
   phenomenon is also referred to as "Quasi-Zipf-Law" [KuXW04] or as
   "elephant and mice phenomenon".  For accounting puposes it may be
   useful to concentrate on the large so-called "heavy hitter" flows to
   cope with a limited flow cache size or limited transmission capacity
   in times when resources are scarce.


2.  Scope

   This document describes flow selection techniques and its parameters.
   It addresses the configuration of flow selection techniques and
   defines which information should be reported by devices that perform
   flow selection.  It only describes processes directly acting on
   traffic flows during the metering phase and/or the exporting phase.
   Therefore it is assumed that flow selection is performed after
   packets are classified into flows.  This document does not address
   the flow selection effects that might result from the sampling or
   filtering of packets in the metering process before the
   classification process is performed.  Such packet selection
   techniques are described in [RFC5475] and, therefore, outside the
   scope of this document.






Peluso, et al.          Expires September 7, 2010               [Page 4]


Internet-Draft          Flow Selection Techniques             March 2010


3.  Terminology

   This document uses the terminology introduced in [RFC5470] and
   [RFC5475] In this section, some additional terms are presented which
   extend the terminology introduced in [RFC5475].

   * Flow Selection Process

      A Flow Selection Process takes a set of Flow Records as its input
      and selects a subset of that set as its output.

   * Flow Selection State

      A Flow Selection Process may maintain state information for use by
      the Flow Selection Process.  At a given time, the Flow Selection
      State may depend on flows observed at and before that time, and
      other variables.  Examples include:

        (i)   number of accounted flow records;

        (ii)  memory space available for flow recording;

        (iii) state of the pseudorandom number generators;

        (iv)  hash values calculated during selection.

   * Flow Selector

      A Flow Selector defines the action of a Flow Selection Process on
      a single flow of its input.  The Flow Selector can make use of the
      following information in determining whether a flow is selected:

        (i)   the content of the flow record;

        (ii)  any information state related to the flow recording;

        (iii) any selection state that may be maintained by the Flow
              Selection Process.


4.  Position of the Flow Selection Process

   Figure 1 shows the IPFIX reference model as defined in [RFC5470], and
   extends it by introducing the functional components where flow
   selection can take place.






Peluso, et al.          Expires September 7, 2010               [Page 5]


Internet-Draft          Flow Selection Techniques             March 2010


                       Packet(s) coming in to Observation Point(s)
                         |                                     |
                         v                                     v
        +----------------+---------------------------+   +-----+-------+
        |          Metering Process on an            |   |             |
        |             Observation Point              |   |             |
        |                                            |   |             |
        |   packet header capturing                  |   |             |
        |        |                                   |...| Metering    |
        |   timestamping                             |   | Process N   |
        |        |                                   |   |             |
        |   packet selection                         |   |             |
        |        |                                   |   |             |
        |   classification                           |   |             |
        |        |                                   |   |             |
        |   field match fitering on flow keys or     |   |             |
        |   flow state dependent packet sampling (*) |   |             |
        |        |                                   |   |             |
        |   aggregation                              |   |             |
        |        |                                   |   |             |
        |   flow recording (*)                       |   |             |
        |        |                                   |   |             |
        |        |        Timing out Flows           |   |             |
        |        |    Handle resource overloads      |   |             |
        +--------|-----------------------------------+   +-----|-------+
                 |                                             |
         Flow Records (selected by Observation Domain)    Flow Records
                 |                                             |
                 +----------------------+----------------------+
                                        |
                 +----------------------|---------------+
                 | Exporting Process    v               |
                 |      +---------------+-----------+   |
                 |      |        flow export (*)    |   |
                 |      +---------------+-----------+   |
                 |                      |               |
                 +----------------------+---------------+
                                        |
                                        v
                         IPFIX export packet to Collector

      (*) indicates where flow selection can take place.

                                 Figure 1

   In contrast to packet selection, flow selection is always applied
   after the packets are classified into flows.  Flows can be selected
   at different stages of the measurement chain:



Peluso, et al.          Expires September 7, 2010               [Page 6]


Internet-Draft          Flow Selection Techniques             March 2010


   1.  during metering [RFC5475]

   2.  during flow recording

   3.  during flow export


5.  Flow selection techniques

   We can distinguish the following selection techniques:

   1.  based on flow record content (i.e. flow characteristics that are
       described in the flow report);

   2.  based on flow record arrival time;

   3.  based on external events like the exhaustion of local resources.

5.1.  Flow selection based on flow record content

   Flow selection can be done based on fields in an IPFIX flow record.
   This can be done analogous to field match filtering for packet
   selection described in [RFC5475].  The difference here is that
   instead of packets here field of the flow record content are used for
   the selection decision.  An example would be to select flow records
   with regard to the flow size in bytes or number of packets.  Another
   application would be to select flow records based on flow start time
   or on flow keys (IP addresses, ports) of the stored flow record.

5.2.  Flow selection based on flow record arrival time or sequence

   Flow records can be selected based on their arrival time at the
   exporting process.  An example would be to select a number of flow
   records for certain periods of time.  Another option is to select
   flow records based on the order at which they arrive at the exporting
   process.  With this one can select systematically every kth record or
   select randomly a set of flow records.

5.3.  Flow selection on external events

   The selection of flow records can be alsio triggered by external
   events.  An example would be router state like number of entries in
   flow cache.


6.  Reporting of Flow Selection Information

   In this section we identify and describe in more detail some possible



Peluso, et al.          Expires September 7, 2010               [Page 7]


Internet-Draft          Flow Selection Techniques             March 2010


   causes of flow selection, along with the information that can be
   beneficial to make available to applications about it.

6.1.  Flow selection in the metering process

   The main reason for applying in the metering process a flow state
   dependent sampling is that the flow recording process may not have,
   at a certain point in time, enough positions to record all observable
   flows.  Another reason may be that there may not be enough processing
   resources to create and manage a new flow record.  To overcome with
   these limitations, a number of possible policies can be applied, the
   simplest one being not to consider for measurement the new packets
   that do not belong to already existing flow records (i.e. that would
   require the creation of a new one).  More refined policies are
   however possible, mainly aimed at the so called elephant flow
   detection, i.e. to give priority in the flow recording process to
   flows carrying more traffic.  For instance, [EsVa01] proposes
   criteria to define a packet eligible to create a new flow record
   (sample and hold, multistage filters).  Independently of the specific
   algorithms, we are concerned here about defining what information it
   makes sense to keep about the flow state dependent packet sampling
   and make available to applications (by exporting it out of an IPFIX
   device).  It is certainly possible to keep a cumulative counter of
   the total number of packets and bytes that were not considered for
   measurement because of flow state dependent sampling.  Also, it is
   possible to keep a timestamp for the first and last of these non
   measured packets.  This means, in practice, to aggregate all these
   packets in a macro flow, and keep track of its volume and duration.
   Imagining keeping more detailed information about packets not
   measured because of flow state dependent sampling would contradict
   the fact that the sampling is done because of lack of memory and/or
   processing resources.

6.2.  Flow selection in the flow recording process

   This block is optional in the IPFIX framework architecture.  However,
   we address here the case where it is present.  We already described
   in the previous section that because of lack of memory positions in
   the flow recording process some incoming packets may be discarded if
   they lead to the opening of a new flow record.  However, under
   certain circumstances, it may be advantageous to discard an existing
   flow record in the flow recording process to make room for the new
   record opened by an arriving packet.  For example, an algorithm for
   taking the decision whether to discard the new arriving packet or an
   existing flow record is described in [Moli03].  In this section we
   are not concerned about the algorithm details but about what
   information to store about this record removal.  For the same reasons
   expressed before, we argue that it does not make sense to store



Peluso, et al.          Expires September 7, 2010               [Page 8]


Internet-Draft          Flow Selection Techniques             March 2010


   separate information for each discarded flow record, as it would
   contradict the motivation itself for which the discarding is done
   (i.e. lack of memory resources).  The information that is certainly
   possible to keep with a limited effort is a cumulative counter of the
   total number of not yet exported packets and bytes belonging to flow
   records that were eliminated from the flow recording process.
   Ideally, we would also like to keep a timestamp for the first (T_fd)
   and last (T_ld) not yet exported packets belonging to all these
   discarded flow records.  This would mean, in practice, to aggregate
   all these packets in a macro flow, and keep track of its volume and
   duration.  To do so precisely, we would need to keep in each flow
   record a timestamp for the first and last non-exported packets, and
   whenever a record is discarded look at these timestamps to see if
   they are smaller or larger (respectively) of T_fd and T_ld and if yes
   update them.  Another information that can be easily kept is the
   number of these discarding events, along with a timestamp of the
   first and last of them.  This information should not be used by
   applications to re- normalize their received per flow statistics
   (because a flow may be discarded and re-created multiple times) but
   rather to keep under control the good functioning of the implemented
   policy.  Note that we consider a discarding event only when the
   discarded flow record contains some not exported traffic.  Otherwise,
   the removal of a record whose traffic was fully exported (after a
   timeout or after the arrival of specific packets, e.g.  TCP FIN or
   RST) is part of the normal functioning of an IPFIX flow metering
   system.  Note also that we consider only the case when an elimination
   of a flow record from the flow recording process leads to the
   complete loss of all the information contained in the flow record.
   If on the contrary another policy is implemented, like immediate
   exporting of the flow record before elimination, or freezing of the
   flow record and moving it in an area of memory different from which
   is considered the flow recording process for later exporting, this is
   not considered an elimination and therefore is out of the scope of
   this document.  In parallel to the information about the number of
   discarded flow records and associated packets and bytes, it is useful
   to keep cumulative information about the number of flow records
   containing not yet exported traffic that exist in the flow recording
   process, along with the cumulative number of not exported packets and
   bytes contained in them.  This information is useful also for
   exporting process related reasons, as clarified in the following
   paragraph.

6.3.  Flow selection in the exporting process

   The exporting process may implement policies for not exporting the
   whole set of flow records of the flow recording process.  In case of
   absence of the flow recording process, when the metering process
   directly feeds the exporting process (i.e. directly put the exported



Peluso, et al.          Expires September 7, 2010               [Page 9]


Internet-Draft          Flow Selection Techniques             March 2010


   packets in the IPFIX format), the following reasoning does not apply.
   The motivations for not exporting some flow records (containing non
   exported traffic) can be two: there are explicit configured policies
   or the exporting process faces resource limitation.  An example of
   explicit policy can be not to export the flows whose accounted
   traffic is below a certain threshold, or a more complex mechanism
   such as the one described in [DuLT01a] or [DuLT01b].  An example of
   resource limitation is that the exporting process has an assigned,
   limited time slot to operate or a limited predefined number of export
   packets that it can send.  There can also be hybrid cases where there
   are resource limitations and policies are applied in order to
   optimize the exported information (e.g. given that we want to export
   only N flow records, select a subset so that the overall number of
   reported packets and bytes belonging to the subset is maximized).
   Coming to the issue of which information it makes sense to keep about
   this flow selection, there are two cases to consider.  If a flow is
   not exported and because of this decision is deleted from the flow
   recording process, we are in the same case described before (where
   the deletion was triggered by the need to make room for another
   record).  The information to keep is then naturally the same as
   described before (cumulative packets and bytes for all the flows not
   exported, timestamps of the first and last packets belonging to non
   exported flow records, counter of dropping events and timestamp of
   first and last dropping event).  Only the reason for this removal is
   different.  If on the contrary a record eligible for exporting is not
   exported but it remains in the flow recording process it has always a
   chance to be exported in the future.  For an application, however, it
   would be beneficial to know what it is not currently being exported
   because of exporting process policies/resource limitations, in terms
   of flow records, packets and bytes.  This, not to re-normalize its
   estimates (it would be dangerous and error prone because the
   exporting of these records may be simply delayed), but rather to keep
   under control what is happening: for example, understand if there are
   pathologic situations where a large number of flow records and/or
   associated traffic are never exported, or if the number of flow
   records in the flow recording process is growing, etc.  When it comes
   to understanding if this information can be easily available,
   however, we recognize that there is the problem that in order to be
   aware that it has not exported a flow record, an exporting process
   should at least have browsed through it.  In other words, we would
   have to assume that there is always a full scanning of the flow
   recording process associated to the exporting process selection
   decision.  However, there may be more efficient implementations where
   this does not happen.  Therefore, even if we provide support in the
   information model for this information, defining it as mandatory in
   the protocol definition would put a constraint on the exporting
   process implementation, which is undesirable.




Peluso, et al.          Expires September 7, 2010              [Page 10]


Internet-Draft          Flow Selection Techniques             March 2010


7.  Information model for flow selection information exporting

   We formally define the elements to contain the information described
   in the previous section.  Some elements have an associated couple of
   timestamps, which we reference for brevity (when it is not ambiguous)
   as Tfirst and Tlast (instead of element_nameTfirst,
   element_nameTlast).  Note that all the following information elements
   are aimed at describing macro flows (e.g. the total number of packets
   and bytes contained in all dropped or not created flow records).
   Some of these macro flows are additive only, in the sense that they
   only add contributions to them, but never subtract.  E.g. the macro
   flow of the packets contained in flow records that are discarded from
   the flow reporting process receives a contribution when a flow record
   is discarded, and this contribution can never be subtracted.  On the
   contrary, some of the macro flows can dynamically receive and loose
   contributions.  E.g. the macro flows of packets not yet exported
   receives a contribution when a new packets arrives, and looses some
   contribution when there is an exporting event.  Associating a
   timestamp for the oldest and most recent contributions to additive
   only flow is easy, while for the others is not (would require to
   maintain full state) and that is why we did not define timestamps for
   these information elements.

   The information elements here introduced are defined in accordance
   with the IPFIX information model [RFC5102] to which reference should
   be made for more detailed information.  Furthermore, the data types
   used to formally rappresent the Flow Selection related information
   elements are those defined in section 3.1 of the IPFIX information
   model [RFC 2051].  For that reason, they are not redefined in this
   section.

   List of additional Flow Selection information elements:



















Peluso, et al.          Expires September 7, 2010              [Page 11]


Internet-Draft          Flow Selection Techniques             March 2010


              +-------+------------------------------------+
              | ID    | Name                               |
              +-------+------------------------------------+
              | TBD1  | FsMeter_UnmeasPacketCount          |
              +-------+------------------------------------+
              | TBD2  | FsMeter_UnmeasBytesCount           |
              +-------+------------------------------------+
              | TBD3  | FsFrec_PacketInDroppedRecsCount    |
              +-------+------------------------------------+
              | TBD4  | FsFrec_ByteInDroppedRecsCount      |
              +-------+------------------------------------+
              | TBD5  | FsFrec_FrecDroppedCount            |
              +-------+------------------------------------+
              | TBD6  | FsFrec_UnexportedFrecCount         |
              +-------+------------------------------------+
              | TBD7  | FsFrec_UnexportedPacketInFrecCount |
              +-------+------------------------------------+
              | TBD8  | FsRec_UnexportedBytesInFrecCount   |
              +-------+------------------------------------+
              | TBD9  | FsExp_PacketInDroppedRecsCount     |
              +-------+------------------------------------+
              | TBD10 | FsExp_BytesInDroppedRecsCount      |
              +-------+------------------------------------+
              | TBD11 | FsExp_FrecDroppedCount             |
              +-------+------------------------------------+
              | TBD12 | FsExp_UnexportedCount              |
              +-------+------------------------------------+
              | TBD13 | FsExp_UnexportedPacketCount        |
              +-------+------------------------------------+
              | TBD14 | FsExp_UnexportedByteInExpCount     |
              +-------+------------------------------------+

7.1.  Meter process related (TBD1-TBD2)

   Information Elements in this section are related to Flow Selection at
   the Matering Process.

                   +------+---------------------------+
                   | ID   | Name                      |
                   +------+---------------------------+
                   | TBD1 | FsMeter_UnmeasPacketCount |
                   +------+---------------------------+
                   | TBD2 | FsMeter_UnmeasBytesCount  |
                   +------+---------------------------+







Peluso, et al.          Expires September 7, 2010              [Page 12]


Internet-Draft          Flow Selection Techniques             March 2010


7.1.1.  FsMeter_UnmeasPacketCount

   Contains the count of packets that were not measured because of flow
   state dependent sampling, in terms of:

   TsFirst: timestamp of the first packet not measured because of flow
   state dependent sampling (Type: dateTime)

   TsLast: timestamp of the last packet not measured because of flow
   state dependent sampling (Type: dataTime)

7.1.2.  FsMeter_UnmeasBytesCount

   Description:

      This Information Elements contains the count of bytes that were
      not measured because of flow state dependent sampling

   Abstract Data Type: unsigned64

   Data Type Semantics: quantity

   ElementId: TBD2

   Status: Proposed

   Units: bytes

7.2.  Flow recording process related (TBD3-TBD8)

   Information Elements in this section are related to Flow Selection at
   the Flow Recording Process if present.

               +------+------------------------------------+
               | ID   | Name                               |
               +------+------------------------------------+
               | TBD3 | FsFrec_PacketInDroppedRecsCount    |
               +------+------------------------------------+
               | TBD4 | FsFrec_ByteInDroppedRecsCount      |
               +------+------------------------------------+
               | TBD5 | FsFrec_FrecDroppedCount            |
               +------+------------------------------------+
               | TBD6 | FsFrec_UnexportedFrecCount         |
               +------+------------------------------------+
               | TBD7 | FsFrec_UnexportedPacketInFrecCount |
               +------+------------------------------------+
               | TBD8 | FsFrec_UnexportedBytesInFrecCount  |
               +------+------------------------------------+



Peluso, et al.          Expires September 7, 2010              [Page 13]


Internet-Draft          Flow Selection Techniques             March 2010


7.2.1.  FsFrec_PacketInDroppedRecsCount

   Contains the count of non exported packets that were contained in
   flow records eliminated from the flow recording process because of
   resource limitations/policies in the flow recording process.  It is
   defined in terms of:

   TsFirst: timestamp of the first non-exported packet belonging to a
   eliminated flow record (Type: dateTime)

   TsLast: timestamp of the last non-exported packet belonging to a
   eliminated flow record (Type: dateTime)

7.2.2.  FsFrec_ByteInDroppedRecsCount

   Description:

      This Information Elements contains the count of non exported bytes
      that were contained in flow records eliminated from the flow
      recording process because of resource limitations/policies in the
      flow recording process.

   Abstract Data Type: unsigned64

   Data Type Semantics: quantity

   ElementId: TBD4

   Status: Proposed

   Units: bytes

7.2.3.  FsFrec_FrecDroppedCount

   Contains the count of flow records containing non exported packets
   eliminated from the flow recording process because of resources
   limitations/policies in the flow recording process.  It is defined in
   terms of:

   TsFirst: timestamp of the first flow record elimination event from
   the flow recording process (Type: dateTime)

   TsLast: timestamp of the last flow record elimination event from the
   flow recording process (Type: dateTime)







Peluso, et al.          Expires September 7, 2010              [Page 14]


Internet-Draft          Flow Selection Techniques             March 2010


7.2.4.  FsFrec_UnexportedFrecCount

   Description:

      This Information Elements contains the count of the flow records
      currently existing in the flow recording process containing at
      least one non exported packet.

   Abstract Data Type: unsigned32

   Data Type Semantics: quantity

   ElementId: TBD6

   Status: Proposed

   Units: flow records

7.2.5.  FsFrec_UnexportedPacketInFrecCount

   Description:

      This Information Elements contains the count of non exported
      packets contained in flow records of the flow recording process.

   Abstract Data Type: unsigned32

   Data Type Semantics: quantity

   ElementId: TBD7

   Status: Proposed

   Units: packets

7.2.6.  FsFrec_UnexportedBytesInFrecCount

   Description:

      This Information Elements contains the count of non exported bytes
      contained in flow records of the flow recording process.

   Abstract Data Type: unsigned64

   Data Type Semantics: quantity

   ElementId: TBD8




Peluso, et al.          Expires September 7, 2010              [Page 15]


Internet-Draft          Flow Selection Techniques             March 2010


   Status: Proposed

   Units: bytes

7.3.  Flow exporting process related (TBD9-TBD14)

   Information Elements in this section are related to Flow Selection at
   the Flow Exporting Process.

                +-------+--------------------------------+
                | ID    | Name                           |
                +-------+--------------------------------+
                | TBD9  | FsExp_PacketInDroppedRecsCount |
                +-------+--------------------------------+
                | TBD10 | FsExp_ByteInDroppedRecsCount   |
                +-------+--------------------------------+
                | TBD11 | FsExp_FrecDroppedCount         |
                +-------+--------------------------------+
                | TBD12 | FsExp_UnexportedCount          |
                +-------+--------------------------------+
                | TBD13 | FsExp_UnexportedPacketCount    |
                +-------+--------------------------------+
                | TBD14 | FsExp_UnexportedByteInExpCount |
                +-------+--------------------------------+

7.3.1.  FsExp_PacketInDroppedRecsCount

   Contains the count of non exported packets that were contained in
   flow records eliminated from the flow recording process because of
   resource limitations/policies in the exporting process.  It is
   defined in terms of:

   TsFirst: timestamp of the first non exported packet belonging to a
   eliminated flow record (Type: dateTime)

   TsLast: timestamp of the last non exported packet belonging to a
   eliminated flow record (Type: dateTime)

7.3.2.  FsExp_ByteInDroppedRecsCount

   Description:

      This Information Elements contains the count of non exported bytes
      that were contained in flow records eliminated from the flow
      recording process because of resource limitations/policies in the
      exporting process.

   Abstract Data Type: unsigned64



Peluso, et al.          Expires September 7, 2010              [Page 16]


Internet-Draft          Flow Selection Techniques             March 2010


   Data Type Semantics: quantity

   ElementId: TBD10

   Status: Proposed

   Units: bytes

7.3.3.  FsExp_FrecDroppedCount

   Contains the count of flow records containing non exported packets
   eliminated from the flow recording process because of resource
   limitations/policies in the exporting process.  It is defined in
   terms of:

   TsFirst: timestamp of the first flow record elimination event from
   the flow recording process (Type: dateTime)

   TsLast: timestamp of the last flow record elimination event from the
   flow recording process (Type: dateTime)

7.3.4.  FsExp_UnexportedCount

   Description:

      This Information Elements contains the count of the flow records
      currently existing in the flow recording process containing non-
      exported traffic and not being exported because of exporting
      process resource lmitations/policies.

   Abstract Data Type: unsigned32

   Data Type Semantics: quantity

   ElementId: TBD12

   Status: Proposed

   Units: flow records

7.3.5.  FsExp_UnexportedPacketCount

   Description:

      This Information Elements contains the count of non exported
      packets contained in flow records of the flow recording process
      not being exported because of exporting process resource
      limitations/policies.



Peluso, et al.          Expires September 7, 2010              [Page 17]


Internet-Draft          Flow Selection Techniques             March 2010


   Abstract Data Type: unsigned32

   Data Type Semantics: quantity

   ElementId: TBD13

   Status: Proposed

   Units: packets

7.3.6.  FsExp_UnexportedByteInExpCount

   Description:

      This Information Elements contains the count of non exported bytes
      contained in flow records of the flow recording process not being
      exported because of exporting process resource limitations/
      policies.

   Abstract Data Type: unsigned64

   Data Type Semantics: quantity

   ElementId: TBD14

   Status: Proposed

   Units: bytes


8.  Requirements put on implementations

   To support the described information model an implementation must
   keep, in the flow records, counts for non-exported packets and bytes.
   Sometimes these are referred as delta counts.  An implementation may
   also keep absolute counts for scopes not specified in this
   information model (it appears that both delta and absolute counters
   can be exported in the IPFIX information model, see [RFC5102]).  In
   addition, to fully support this information model, it would be
   required to keep in a flow record a timestamp for the first and last
   non-exported packets.  An implementation may need to keep timestamps
   for the first and last exported packets as well for scopes not
   specified in this information model, or to join the two timers for
   the last exported and first exported packets (which is of course an
   approximation) or to approximate them with the time of the exporting
   event.





Peluso, et al.          Expires September 7, 2010              [Page 18]


Internet-Draft          Flow Selection Techniques             March 2010


9.  Information Model for Configuration of Flow Selection Techniques

   This section aims at describing the representative parameters of the
   above presented flow selection techniques.  To this regard, it
   provides the basis for an information model to adopt in order to
   configure the flow selection process at an IPFIX device.  The
   information elements here introduced are defined in accordance with
   the IPFIX information model [RFC5102] to which reference should be
   made for more detailed information.  Furthermore, the data types used
   to formally rappresent the Flow Selection related information
   elements are those defined in section 3.1 of the IPFIX information
   model [RFC 2051].  For that reason, they are not redefined in this
   section.

   List of additional Flow Selection information elements:

   +-------+-------------------------+-------+-----------------------+
   | ID    | Name                    | ID    | Name                  |
   +-------+-------------------------+-------+-----------------------+
   | TBD15 | selectorMethod          | TBD18 | flowRecordPacketsSize |
   +-------+-------------------------+-------+-----------------------+
   | TBD16 | flowMaxAdmitFlowRecords | TBD19 | flowInactivityTime    |
   +-------+-------------------------+-------+-----------------------+
   | TBD17 | flowRecordBytesSize     | ...   | ...                   |
   +-------+-------------------------+-------+-----------------------+

9.1.  selectorMethod

   Description:

      This Information Element identifies the flow selection method that
      are applied by the Flow Selection process, in accordance to what
      described in the section 5 of this document.

      Same of these methods may have parameters in order to fully
      support the selected technique.  For that reason, further
      Information Elements are defined in the following subsections.

      The following flow selection methods identifiers are defined here:

   +----+----------------------------+---------------------------------+
   | ID | Method                     | Parameters                      |
   +----+----------------------------+---------------------------------+
   | 1  | Selection based on flow    | flowMaxAdmitFlowRecords         |
   |    | size count                 | flowRecordBytesSize             |
   |    |                            | flowRecordPacketsSize           |
   +----+----------------------------+---------------------------------+




Peluso, et al.          Expires September 7, 2010              [Page 19]


Internet-Draft          Flow Selection Techniques             March 2010


   +----+----------------------------+---------------------------------+
   | 2  | Selection based on flow    | flowMaxAdmitFlowRecords         |
   |    | content property match     | ...........                     |
   +----+----------------------------+---------------------------------+
   | 3  | Selection based on flow    | flowMaxAdmitFlowRecords         |
   |    | record arrival time or     | flowInactivityTime              |
   |    | sequence                   |                                 |
   +----+----------------------------+---------------------------------+
   | 4  | Selection based on         | flowMaxAdmitFlowRecords         |
   |    | external events            | ...........                     |
   +----+----------------------------+---------------------------------+

   Abstract Data Type: unsigned16

   Data Type Semantics: identifier

   ElementId: TBD15

   Status: Proposed

9.2.  flowMaxAdmitFlowRecords

   Description:

      This Information Element specifies the maximum number of elegible
      flow records which might be created in to the flow cache.  It is
      used by the Selector Process in order to identify the time when
      flow selection should be triggered.  A value of 0 means that the
      Flow Selection State related to the memory space available for
      flow recording must be used to estimate the max flow cache size.

      For example, this Information Element may be used to describe the
      configuration of a flow size count Flow Selector.

   Abstract Data Type: unsigned32

   Data Type Semantics: quantity

   ElementId: TBD16

   Status: Proposed

   Units: flow records

9.3.  flowRecordBytesSize

   Description:




Peluso, et al.          Expires September 7, 2010              [Page 20]


Internet-Draft          Flow Selection Techniques             March 2010


      This Information Element specifies the minimum number of bytes
      contained in a flow record to be considered not elegible for
      removal.  It may be used in order to identify elephant flows.

      For example, this Information Element may be used to describe the
      configuration of a flow size count Flow Selector.

   Abstract Data Type: unsigned64

   Data Type Semantics: quantity

   ElementId: TBD17

   Status: Proposed

   Units: bytes

9.4.  flowRecordPacketsSize

   Description:

      This Information Element specifies the minimum number of packets
      contained in a flow record to be considered not elegible for
      removal.  It may be used in order to identify elephant flows.

      For example, this Information Element may be used to describe the
      configuration of a flow size count Flow Selector.

   Abstract Data Type: unsigned32

   Data Type Semantics: quantity

   ElementId: TBD18

   Status: Proposed

   Units: packets

9.5.  flowInactivityTime

   Description:

      This Information Element specifies the time interval in
      microseconds during which the corresponding flow record may be
      considered still active.  It is used by the metering process
      and/or the flow recording process in order to take the decision
      whether to discard an existing flow to make room for a new one.




Peluso, et al.          Expires September 7, 2010              [Page 21]


Internet-Draft          Flow Selection Techniques             March 2010


      For example, this Information Element may be used to describe the
      configuration of a flow arrival time Flow Selector.

   Abstract Data Type: dateTimeMicroseconds

   Data Type Semantics: quantity

   ElementId: TBD19

   Status: Proposed

   Units: microseconds


10.  Security Considerations

   This document descirbes methods for flow selection techniques that
   are applied in network measurements.  If users know or can guess the
   selection policies they may craft flows in a way to avoid beeing
   selected.  Furthermore network measurements are often used for the
   detecction of network attacks.  Therefore it has to be taken into
   account that flow selection may remove flows that are of interest for
   the detection taks. [more here]


11.  IANA Considerations

   This document introduces several new information elements as an
   extension to the IPFIX information model.  IANA assignments should be
   created for the information elements described in this document.


12.  References

12.1.  Normative References

   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119, March 1997.

12.2.  Informative References

   [DuLT01a]  Duffield, N., Lund, C., and M. Thorup, "Charging from
              Sampled Network Usage", ACM Internet Measurement Workshop
              IMW 2001, San Francisco, USA, November 2001.

   [DuLT01b]  Duffield, N., Lund, C., and M. Thorup, "Properties and
              Prediction of Flow Statistics from Sampled Packet
              Streams", ACM SIGCOMM Internet Measurement Workshop 2002,



Peluso, et al.          Expires September 7, 2010              [Page 22]


Internet-Draft          Flow Selection Techniques             March 2010


              November 2002.

   [DuLT01c]  Duffield, N., Lund, C., and M. Thorup, "Learn More, sample
              less: control of volume and variance in network
              measurement", IEEE Transactions on Information Theory,
              May 2005.

   [DuLT01d]  Duffield, N., Lund, C., and M. Thorup, "Flow Sampling
              under Hard Resource Constraints", ACM IFIP Conference on
              Measurement and Modeling of Computer Systems SIGMETRICS,
              June 2004.

   [EsVa01]   Estan, C. and G,. Varghese, "New Directions in Traffic
              Measurement and Accounting: Focusing on the Elephants,
              Ignoring the Mice", ACM SIGCOMM Internet Measurement
              Workshop 2001, San Francisco (CA), November 2001.

   [FeGL98]   Feldmann, A., Rexford, J., and R. Caceres, "Efficient
              Policies for Carrying Web Traffic over Flow-Switched
              Networks", IEEE/ACM Transaction on Networking,
              December 1998.

   [KuXW04]   Kumar, K., Xu, J., Wang, J., Spatschek, O., and L. Li,
              "Space-code bloom filter for efficient per-flow traffic
              measurement", INFOCOM 2004 Twenty-third AnnualJoint
              Conference of the IEEE Computer and Communications
              Societies, March 2004.

   [Moli03]   Molina, M., "A scalable and efficient methodology for flow
              monitoring in the Internet", International Teletraffic
              Congress (ITC-18), Berlin, September 2003.

   [RFC5102]  Quittek, J., Bryant, S., Claise, B., Aitken, P., and J.
              Meyer, "Information Model for IP Flow Information Export",
              RFC 5102, January 2008.

   [RFC5470]  Sadasivan, G., Brownlee, N., Claise, B., and J. Quittek,
              "Architecture for IP Flow Information Export", RFC 5470,
              March 2009.

   [RFC5475]  Zseby, T., Molina, M., Duffield, N., Niccolini, S., and F.
              Raspall, "Sampling and Filtering Techniques for IP Packet
              Selection", RFC 5475, March 2009.








Peluso, et al.          Expires September 7, 2010              [Page 23]


Internet-Draft          Flow Selection Techniques             March 2010


Authors' Addresses

   Lorenzo Peluso
   University of Napoli
   Via Claudio 21
   Napoli  80125
   Italy

   Phone: +39 081 7683821
   Email: lorenzo.peluso@unina.it


   Tanja Zseby
   Fraunhofer Institute FOKUS
   Kaiserin-Augusta-Allee 31
   Berlin  10589
   Germany

   Phone: +49 30 3463 7153
   Email: tanja.zseby@fokus.fraunhofer.de


   Salvatore D'Antonio
   CINI Consortium/University of Napoli "Parthenope"
   Monte S.Angelo, Via Cinthia
   Napoli  80126
   Italy

   Phone: +39 081 679944
   Email: saldanto@unina.it


   Maurizio Molina
   DANTE
   Hills Road 126-130
   Cambridge  CB2 1PQ
   United Kingdom

   Phone: +44 1223 371300
   Email: maurizio.molina@dante.org.uk











Peluso, et al.          Expires September 7, 2010              [Page 24]