Requirements for Distributed Control of Automatic Speech Recognition (ASR), Speaker Identification/Speaker Verification (SI/SV), and Text-to-Speech (TTS) Resources
RFC 4313
Technical Summary
This document outlines the needs and requirements for a protocol to control
distributed speech processing of audio streams. By speech processing, this
document specifically means automatic speech recognition (ASR), speaker
recognition - which includes both speaker identification (SI) and speaker
verification (SV) - and text-to-speech (TTS). Other IETF protocols, such as
SIP and RTSP, address rendezvous and control for generalized media streams.
However, speech processing presents additional requirements that none of the
extant IETF protocols address.
Working Group Summary
The SPEECHSC Working Group supported the advancement of the document.
Protocol Quality
This document was reviewed for the IESG by Jon Peterson.