datatracker.ietf.org
Sign in
Version 5.4.0, 2014-04-22
Report a bug

RTP Payload Formats for European Telecommunications Standards Institute (ETSI) European Standard ES 202 050, ES 202 211, and ES 202 212 Distributed Speech Recognition Encoding
RFC 4060

Document type: RFC - Proposed Standard (May 2005; Errata)
Document stream: IETF
Last updated: 2013-03-02
Other versions: plain text, pdf, html

IETF State: (None)
Consensus: Unknown
Document shepherd: No shepherd assigned

IESG State: RFC 4060 (Proposed Standard)
Responsible AD: Allison Mankin
Send notices to: csp@csperkins.org, magnus.westerlund@ericsson.com

Network Working Group                                             Q. Xie
Request for Comments: 4060                                     D. Pearce
Category: Standards Track                                       Motorola
                                                                May 2005

          RTP Payload Formats for European Telecommunications
              Standards Institute (ETSI) European Standard
                 ES 202 050, ES 202 211, and ES 202 212
                Distributed Speech Recognition Encoding

Status of This Memo

   This document specifies an Internet standards track protocol for the
   Internet community, and requests discussion and suggestions for
   improvements.  Please refer to the current edition of the "Internet
   Official Protocol Standards" (STD 1) for the standardization state
   and status of this protocol.  Distribution of this memo is unlimited.

Copyright Notice

   Copyright (C) The Internet Society (2005).

Abstract

   This document specifies RTP payload formats for encapsulating
   European Telecommunications Standards Institute (ETSI) European
   Standard ES 202 050 DSR Advanced Front-end (AFE), ES 202 211 DSR
   Extended Front-end (XFE), and ES 202 212 DSR Extended Advanced
   Front-end (XAFE) signal processing feature streams for distributed
   speech recognition (DSR) systems.

Xie & Pearce                Standards Track                     [Page 1]
RFC 4060            RTP Payloads for ETSI DSR Codecs            May 2005

Table of Contents

   1. Introduction ....................................................2
      1.1. Conventions and Acronyms ...................................3
   2. ETSI DSR Front-end Codecs .......................................4
      2.1. ES 202 050 Advanced DSR Front-end Codec ....................4
      2.2. ES 202 211 Extended DSR Front-end Codec ....................4
      2.3. ES 202 212 Extended Advanced DSR Front-end Codec ...........5
   3. DSR RTP Payload Formats .........................................6
      3.1. Common Considerations of the Three DSR RTP Payload
           Formats ....................................................6
           3.1.1. Number of FPs in Each RTP Packet ....................6
           3.1.2. Support for Discontinuous Transmission ..............6
           3.1.3. RTP Header Usage ....................................6
      3.2. Payload Format for ES 202 050 DSR ..........................7
           3.2.1. Frame Pair Formats ..................................7
      3.3. Payload Format for ES 202 211 DSR ..........................9
           3.3.1. Frame Pair Formats ..................................9
      3.4. Payload Format for ES 202 212 DSR .........................11
           3.4.1. Frame Pair Formats .................................12
   4. IANA Considerations ............................................14
      4.1. Mapping MIME Parameters into SDP ..........................15
      4.2. Usage in Offer/Answer .....................................16
      4.3. Congestion Control ........................................16
   5. Security Considerations ........................................16
   6. Acknowledgments ................................................16
   7. References .....................................................16
      7.1. Normative References ......................................16
      7.2. Informative References ....................................17

1.  Introduction

   Distributed speech recognition (DSR) technology is intended for a
   remote device acting as a thin client (a.k.a. the front-end) to
   communicate with a speech recognition server (a.k.a. a speech
   engine), over a network connection to obtain speech recognition
   services.  More details on DSR over Internet can be found in RFC 3557
   [10].

   To achieve interoperability with different client devices and speech
   engines, the first ETSI standard DSR front-end ES 201 108 was
   published in early 2000 [11].  An RTP packetization for ES 201 108
   frames is defined in RFC 3557 [10] by IETF.

   In ES 202 050 [1], ETSI issues another standard for an Advanced DSR
   front-end that provides substantially improved recognition
   performance when background noise is present.  The codecs in ES 202

Xie & Pearce                Standards Track                     [Page 2]
RFC 4060            RTP Payloads for ETSI DSR Codecs            May 2005

   050 use a slightly different frame format from that of ES 201 108 and
   thus the two do not inter-operate with each other.

   The RTP packetization for ES 202 050 front-end defined in this
   document uses the same RTP packet format layout as that defined in
   RFC 3557 [10].  The differences are in the DSR codec frame bit

[include full document text]