Use of Natural Language for Agent Communication
draft-verma-dmsc-nlip-notes-00
This document is an Internet-Draft (I-D).
Anyone may submit an I-D to the IETF.
This I-D is not endorsed by the IETF and has no formal standing in the
IETF standards process.
| Document | Type | Active Internet-Draft (individual) | |
|---|---|---|---|
| Authors | Dinesh C. Verma , Elisa Bertino , Abhay Ratnaparkhi , Sean Hughes , Luyi Xing , Zichuan Li , Xiaojing Liao , Jian Cui , Rasit Topaloglu , Mohamed Rahouti | ||
| Last updated | 2026-02-11 | ||
| RFC stream | (None) | ||
| Intended RFC status | (None) | ||
| Formats | |||
| Stream | Stream state | (No stream defined) | |
| Consensus boilerplate | Unknown | ||
| RFC Editor Note | (None) | ||
| IESG | IESG state | I-D Exists | |
| Telechat date | (None) | ||
| Responsible AD | (None) | ||
| Send notices to | (None) |
draft-verma-dmsc-nlip-notes-00
DMSC Working Group D. Verma
Internet-Draft IBM
Intended status: Informational E. Bertino
Expires: 14 August 2026 Purdue University
A. Ratnaparkhi
eBay Inc.
S. Hughes
Service Now
L. Xing
Z. Li
X. Liao
J. Cui
University of Illinois at Urbana-Champaign
R. Topaloglu
Marist University
M. Rahouti
Fordham University
10 February 2026
Use of Natural Language for Agent Communication
draft-verma-dmsc-nlip-notes-00
Abstract
In current communication protocol designs, the prevalent practice is
to define a strict Application Programming Interface (API) for
different types of interactions among software entities. There is an
explosion of APIs which makes interoperability hard, and the
standards for interoperability fragile. Specifically for agent to
agent communications, defining strict APIs for interactions such as
registration, discovery, telemetry and debugging creates significant
operational challenges. In this draft, we argue that a flexible
design option -- where generative AI is used to create a flexible
communication protocol for APIs, can lead to a common and flexible
way for a single uniform API to be used between agents, and enables a
better communication paradigm.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at https://datatracker.ietf.org/drafts/current/.
Verma, et al. Expires 14 August 2026 [Page 1]
Internet-Draft BGP-LS-Inter-AS-Ext February 2026
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on 14 August 2026.
Copyright Notice
Copyright (c) 2026 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents (https://trustee.ietf.org/
license-info) in effect on the date of publication of this document.
Please review these documents carefully, as they describe your rights
and restrictions with respect to this document. Code Components
extracted from this document must include Revised BSD License text as
described in Section 4.e of the Trust Legal Provisions and are
provided without warranty as described in the Revised BSD License.
Table of Contents
1. Conventions used in this document . . . . . . . . . . . . . . 2
2. Terminology . . . . . . . . . . . . . . . . . . . . . . . . . 2
3. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 3
4. Other Modalities . . . . . . . . . . . . . . . . . . . . . . 4
5. Existing Standard . . . . . . . . . . . . . . . . . . . . . . 5
6. New Conventions . . . . . . . . . . . . . . . . . . . . . . . 5
7. Security Considerations . . . . . . . . . . . . . . . . . . . 6
8. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 6
9. Acknowledgement . . . . . . . . . . . . . . . . . . . . . . . 6
10. References . . . . . . . . . . . . . . . . . . . . . . . . . 6
10.1. Normative References . . . . . . . . . . . . . . . . . . 6
10.2. Informative References . . . . . . . . . . . . . . . . . 7
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 7
1. Conventions used in this document
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119 [RFC2119] .
2. Terminology
The following terms are defined in this document:
* NLIP: Natural Language Interaction Protocol
Verma, et al. Expires 14 August 2026 [Page 2]
Internet-Draft BGP-LS-Inter-AS-Ext February 2026
* ntohs: Network to Host Short
* htons: Host to Network Short
3. Introduction
Contemporary application-level protocols and Application Programming
Interface (API) are designed by specifying a function name (or URI)
and a data structure. All communicating endpoints are required to
use the same data structure, and this data structure is transferred
between them using a structured representation such as JSON. This
design approach causes significant interoperability issues when the
version upgrade of any software causes a change in the data structure
that needs to be exchanged.
A more flexible communication paradigm can be obtained by leveraging
the ability of large language models to translate between
unstructured natural language and a structured representation such as
JSON or the internal structure of a programming language effectively.
The communicating parties can transmit a natural language
representation among each other, and each endpoint can maintain its
own internal representation for the structure being maintained for
its tasks.
In this draft, we argue that this flexible exchange offers various
benefits in agent to agent communications.
The natural language text exchanged between communicating parties has
the same semantics as the structures maintained at various
communication endpoints, but the format is natural language. A LLM
at each end point can translate the wire format into the local
structure format. This allows for the internal representation of
structures at endpoints to be independent of each other. This
independence allows for improved interoperability and the freedom for
each end-point to upgrade its internal structures as needed.
As an example, let us consider the simple but illustrative case where
one of the endpoints is an agent that is sending its communication
endpoint information to the other endpoint which is a registrar
providing registration services for agents. The registrar maintains
information about each endpoint as a URL, while the agent maintains
its internal information as a local host and port. The agent can
register itself to the registrar using plain text - "I am running on
host hostname on port 1430" or an equivalent natural language text
representation. The registrar can use a local LLM to translate this
text to an internal URL representation.
Verma, et al. Expires 14 August 2026 [Page 3]
Internet-Draft BGP-LS-Inter-AS-Ext February 2026
Using natural language provides two key advantages. It provides
resilience against upgrades of the endpoints. Suppose the registrar
is upgraded to a new version in which it supports each agent
information as a structure consisting of a port, a protocol and a
host address instead of the URL. This upgrade can be made without
any impact to any of the agents. On the other hand, if the
traditional approach of defining a fixed structure on the wire were
to be used, the upgrade of the registrar would require changes in
each of the agents interacting with it.
The second advantage is the simplification in the versioning of
protocols. During any definition of a standard protocol, some key
features or functions may be missed due to a variety of reasons. If
a new protocol version is defined, the protocol needs to be defined
to support a version number and a careful handling of mismatches in
the structural differences across versions. By breaking the linkage
between the internal representation of endpoint structures and the
wire structure, natural language simplifies version management
significantly.
The separation of the wire format from the internal structures can be
viewed as a modern take on the concept of ntohs and htons - concepts
developed during the early stages of computer communications
development to promote interoperability between computer systems.
When big-endian machines needed to talk to little- endian machines,
these two macros translated between network and host structure
formats. NLIP is providing the same functionality at the application
level, leveraging the ability of LLMs to translate between
unstructured natural text and structured representations.
It is recommended that two communicating parties exchange information
about their internal representation with each other. When an agent
registers with the registrar in the above example, the registrar can
send a natural text representation of the internal information to the
agent to validate that it has translated the text correctly to an
internal representation and then translated it back. The agent can
do its own internal translation and validate that the information is
consistent with its internal representation. The same exchange also
helps to validate if a field is missing or not incorporated properly.
4. Other Modalities
Natural language is not the only modality which is needed for general
communication among software entities. Some agents may require other
modality such as images, video or audio/speech as part of their
operation. Nevertheless, the same principle of separating the wire
format from the internal structures and capabilities of the software
application can be used.
Verma, et al. Expires 14 August 2026 [Page 4]
Internet-Draft BGP-LS-Inter-AS-Ext February 2026
For each modality, the wire exchange format can specify the modality
of the exchange. However, it can leave the task of interpreting the
contents and internal structure of the exchanged information to the
local AI model. The agent can use its local AI model to interpret
the format and process it.
This is an analogue of the way humans communicate using the different
senses. Our eyes process light/visual modality, our ears process the
sound/speech modality etc. Our internal intelligence processes these
signals without relying on the strict adherence to internal
structures. We can simply identify the information modality and let
the agent process the information.
5. Existing Standard
There is an existing standard which is designed based on this
principle of using natural language for exchange among agents. This
standard - Natural Language Interaction Protocol (NLIP) [ECMA430]--
defines a way for exchanged information to simply define the modality
, and let interacting agents interpret the exchanged content without
requiring a strict structure on the wire.
Experiences implementing NLIP based services and the standard have
shown that the approach is viable, has good performance and is
secure. For development and debugging, the NLIP protocol allowed for
a single interface for trouble-shooting, and our performance
evaluation showed that the protocol added little to no overhead, and
in many cases out-performed registration functions using a more
traditional API.
As the IETF community works towards defining conventions for agent to
agent communications, we want to bring this design approach to their
attention, since leveraging a protocol like NLIP will provide
tremendous flexibility and operational simplicity to software
implementing the functions of agents.
6. New Conventions
With the presence of existing standard of NLIP, the task of defining
common functions for any agent to agent communications becomes that
of defining the semantics of the information that should be
transferred, as opposed to defining a rigid structure for the
interaction. When agents need to communicate with each other, they
need to perform functions such as discover other agents, register
themselves with a registrar, or obtain the governing policies for
security or data management from other agents. While NLIP provides
the basic envelop for transfer of the information, new conventions or
standards would need to be developed for each of these functions.
Verma, et al. Expires 14 August 2026 [Page 5]
Internet-Draft BGP-LS-Inter-AS-Ext February 2026
The key difference from traditional approach is that one does not
define a rigid structure (e.g. a JSON schema or a XML structure) for
interaction but only the semantic content of the information to be
exchanged.
We would consider it a task of the IETF to define the exact
semantics. As an example for the task of registration, the semantic
specification may define that the port number and server host name of
the agent should be identified, along with the identity of the owning
organization and the provider of security certificate. This
information can be expressed in natural language and converted into
the local structured representation.
Similarly, for the task of discovery, the agent looking for specific
type of remote agent can describe their requirements in natural
language, instead of defining a rigid schema for discovery of remote
agents. Each agent can convert the requirements to their local
structure to decide whether or not they match the requirements. IETF
needs to define the semantic content for discovery requests -- e.g.
specify that agents must define the type of capabilities they are
looking for - such as image analysis, speech to text conversion,
security requirements, performance constraints, or billing limits
etc. There is no need to define a specific structure for the task.
By following this convention, the standardization process can be made
more streamlined and agents implementing the protocol interoperate
better with other agents.
7. Security Considerations
This document should not affect the security of the Internet.
8. IANA Considerations
This document includes no request to IANA.
9. Acknowledgement
This template uses extracts from templates written by Pekka Savola,
Elwyn Davies and Henrik Levkowetz.
10. References
10.1. Normative References
[ECMA430] Ecma International, "Natural Language Interaction
Protocol", Standard ECMA-430, Edition 1, December 2025,
<https://www.ecma-international.org>.
Verma, et al. Expires 14 August 2026 [Page 6]
Internet-Draft BGP-LS-Inter-AS-Ext February 2026
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119,
DOI 10.17487/RFC2119, March 1997,
<https://www.rfc-editor.org/info/rfc2119>.
10.2. Informative References
Authors' Addresses
Dinesh Verma
IBM
P.O. Box 218, Yorktown Heights
New York, 10598
United States of America
Email: dverma@us.ibm.com
Elisa Bertino
Purdue University
CS Department
West Lafayette , IN 47907
United States of America
Email: bertino@purdue.edu
Abhay Ratnaparkhi
eBay Inc.
7700 West Parmer Lane
Austin, 78729
United States of America
Email: abratnaparkhi@ebay.com
Sean Hughes
Service Now
2225 Lawson Lane
Santa Clara , CA 95054
United States of America
Email: sean.hughes@servicenow.com
Luyi Xing
University of Illinois at Urbana-Champaign
201 N Goodwin Ave
Urbana , IL 61801
United States of America
Email: lxing2@illinois.edu
Verma, et al. Expires 14 August 2026 [Page 7]
Internet-Draft BGP-LS-Inter-AS-Ext February 2026
Zichuan Li
University of Illinois at Urbana-Champaign
201 N Goodwin Ave
Urbana , IL 61801
United States of America
Email: zichuan7@illinois.edu
Xiaojing Liao
University of Illinois at Urbana-Champaign
201 N Goodwin Ave
Urbana , IL 61801
United States of America
Email: xjliao@illinois.edu
Jian Cui
University of Illinois at Urbana-Champaign
201 N Goodwin Ave
Urbana , IL 61801
United States of America
Email: jiancui3@illinois.edu
Rasit Topaloglu
Marist University
3399 North Road
Poughkeepsie , NY 12601
United States of America
Email: rasit.topaloglu@marist.edu
Mohamed Rahouti
Fordham University
113 W 60th Street
New York , NY 10023
United States of America
Email: mohamed@fordham.edu
Verma, et al. Expires 14 August 2026 [Page 8]