Skip to main content

Multicast On-path Telemetry using IOAM
draft-ietf-mboned-multicast-telemetry-12

Revision differences

Document history

Date Rev. By Action
2024-08-27
(System)
Received changes through RFC Editor sync (changed state to RFC, created became rfc relationship between draft-ietf-mboned-multicast-telemetry and RFC 9630, changed IESG state to RFC …
Received changes through RFC Editor sync (changed state to RFC, created became rfc relationship between draft-ietf-mboned-multicast-telemetry and RFC 9630, changed IESG state to RFC Published)
2024-08-20
12 (System) RFC Editor state changed to AUTH48-DONE from AUTH48
2024-08-05
12 (System) RFC Editor state changed to AUTH48
2024-08-05
12 (System) RFC Editor state changed to RFC-EDITOR from EDIT
2024-07-15
12 Carlos Pignataro Closed request for Last Call review by OPSDIR with state 'Overtaken by Events'
2024-07-15
12 Carlos Pignataro Assignment of request for Last Call review by OPSDIR to Menachem Dodge was marked no-response
2024-06-28
12 (System) IANA Action state changed to RFC-Ed-Ack from Waiting on RFC Editor
2024-06-28
12 (System) IANA Action state changed to Waiting on RFC Editor from In Progress
2024-06-28
12 (System) IANA Action state changed to In Progress from Waiting on Authors
2024-06-28
12 (System) IANA Action state changed to Waiting on Authors from In Progress
2024-06-25
12 Mike McBride New version available: draft-ietf-mboned-multicast-telemetry-12.txt
2024-06-25
12 Mike McBride New version accepted (logged-in submitter: Mike McBride)
2024-06-25
12 Mike McBride Uploaded new revision
2024-06-25
11 (System) RFC Editor state changed to EDIT
2024-06-25
11 (System) IESG state changed to RFC Ed Queue from Approved-announcement sent
2024-06-25
11 (System) Announcement was received by RFC Editor
2024-06-25
11 (System) IANA Action state changed to In Progress
2024-06-25
11 (System) Removed all action holders (IESG state changed)
2024-06-25
11 Jenny Bui IESG state changed to Approved-announcement sent from IESG Evaluation::AD Followup
2024-06-25
11 Jenny Bui IESG has approved the document
2024-06-25
11 Jenny Bui Closed "Approve" ballot
2024-06-25
11 Jenny Bui Ballot approval text was generated
2024-06-25
11 Gunter Van de Velde
[Ballot comment]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-09.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Thanks for writing up this …
[Ballot comment]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-09.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Thanks for writing up this draft and to start introducing Telemetry to multicast.

One of the items that confused me when reading trough the document was that the I and N bit were new. I have troubles understanding why a single bit is not sufficient? There are not that many flag field available, hence being conservatives is not a bad habit. Also, what is recipient node to do if t received the node-id/interface-id when it should not or if if it should receive it, but it wasn't added?

The section about Applicability introduced me some confusion on what it was trying to achieve. If the intent is to say that the introduced technology procedures can be used on these multicast control plane technologies, then why not have a short list without all the details? It makes it hard read, especially the X-PMSI section (so many acronyms in that section, not all have a reference i think)

Below you find 6 different DISCUSS items to be looked at and to see how to resolve. I think some will be easy to resolve, others may be less trivial.

And finally, in the COMMENTS section i have added a series of comments with additional context and classified them into [minor] and [major].

I hope this review and the various observations provide a way to help improve the document.

G/

#DISCUSS items resolved in draft-ietf-mboned-multicast-telemetry-10.txt
#======================================================================
##[resolved] DISCUSS1
Some multicast tree bilding technologies have been mentioned, while another set was silently ignored (maybe due to historical or lesser used?)
Can these be taken into the story flow and mentioned if considered, not considered or deemend irrelivant for Telemetry extensions?

i.e. PIM-SM (Protocol Independent Multicast - Sparse Mode), PIM-DM (Protocol Independent Multicast - Dense Mode), CBT (Core-Based Tree), DVMRP (Distance Vector Multicast Routing Protocol), MOSPF (Multicast Extensions to OSPF), Bidir-PIM (Bidirectional PIM), SR Replication Segments (SR-MPLS and SRv6 (work in progress)

##[resolved] DISCUSS2
It is unclear if the 2nd method documented in the section "Modifications to Existing Solutions" needs modification. Maybe the exact nature of the modification can be more explicit documented?

##[resolved] DISCUSS3
When "Per-hop postcard using IOAM DEX" is used and per hop it seems operationally desireable to achieve such based upon sampled packets within a multicast flow. The sampling requirements for multicast may be different from unicast traffic. This is not discussed and considered. Is there a reason it is not discussed?

##[resolved] DISCUSS4
How is postcard based telemetry achievable for high volume mcast flows when retaining each single packet, then process the telemetry and finally forwarding once all telemetry is processed. Maybe this solution is intended for low volme mcast? (assuming there is some identification what low volume means for the branch node).

##[resolved] DISCUSS55
Eric Vyncke pointed out that IPv6 needs to be considered, or at least not excluded. (i support his DISCUSS)

##[resolved] DISCUSS6
The formal procedures when using BIER are a little light. The applicability section talks about "would be possible" or has handwaving on the different encapsulation types of BIER. The various types should maybe be explicit mentioned and associated formal procedures discussed?

##Update after draft-ietf-mboned-multicast-telemetry-11: BIER section removed from the draft to resolve the blocking discuss

#DETAILED COMMENTS
#=================
##classified as [minor] and [major]

91   Multicast has many use cases.  For example, it can be used by
92   residential broadband customers across operator networks, private
93   MPLS customers, and internal customers within corporate intranet.
94   Multicast provides real time interactive online meetings or podcasts,
95   IPTV, and financial markets real-time data, which all have a reliance
96   on UDP's unreliable transport.  End-to-end QOS, therefore, should be
97   a critical component of multicast deployment in order to provide a
98   good end user experience.  In multicast real-time media streaming,
99   loss of a single packet containing a reference frame can result in
100   the inability of thousands of receivers to decode a whole sequence of
101   packets called Group-of-Picture, introducing black picture for
102   periods of a few seconds.  Unexpected long delay in propagation of a
103   packet in such real-time media streaming may equally result in the
104   packet not being received and create the same results.  Multicast
105   packet drops and delay can therefore severely affect the application
106   performance and user experience.

[minor]
This section seems to flow not so well when reading and observations are made with seemingly handwaiving to what is believed well known artifacts. In general i think this paragraph tries to describe that mcast uses UDP and that it is inherently unreliable, and that a single packet loss may result in amplified impacts across many receivers. Not only streaming servies should maybe be flaged, but loss of single packet in financial envirenment (it was mentioned in the text, but not mentioned in the negative imacts) may cause a wrong tick and inequality between brokers using such data. For these, some informative references could be appreciated.

What about following initial rewrite, assuming references are added during later moment:
"Multicast has numerous use case environments, including residential broadband services across operator networks, private MPLS customer networks, and internal corporate intranets. It enables applications such as real-time interactive online meetings, podcasts, IPTV, and financial market real-time data feeds, all of which rely on the unreliable transport of UDP.

To ensure a positive end-user experience, superior end-to-end Quality of Service (QoS) is essential in multicast deployments. In multicast real-time media streaming, the loss of a single packet containing a reference frame can prevent thousands of receivers from decoding an entire sequence of packets, known as a Group-of-Pictures (GoP), resulting in a black screen for several seconds. Similarly, unexpected delays in packet propagation can cause packets to be received late or not at all, leading to the same issues. Therefore, packet drops and delays in multicast streaming can significantly degrade application performance and user experience.
"

108   It is important to monitor the performance of the multicast traffic.
109   New on-path telemetry techniques such as In-situ OAM (IOAM)
110   [RFC9197], IOAM Direct Export (DEX) [RFC9326] IOAM Marking-based
111   Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid
112   Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step] are useful and
113   complementary to the existing active OAM performance monitoring
114   methods (e.g., ICMP ping [RFC0792]), provide promising means to
115   directly monitor the network experience of multicast traffic.
116   However, multicast traffic has some unique characteristics which pose
117   some challenges on applying such techniques in an efficient way.

[minor]
Fixed some typos and readability in the textblob with following proposal:
"
It is essential to monitor the performance of multicast traffic. New on-path telemetry techniques, such as In-situ OAM (IOAM) [RFC9197], IOAM Direct Export (DEX) [RFC9326], IOAM Marking-based Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step], complement existing active OAM performance monitoring methods like ICMP ping [RFC0792]. These techniques offer promising means to directly monitor multicast traffic. However, multicast traffic's unique characteristics present challenges in applying these techniques efficiently.
"

119   The IP multicast packet data for a particular (S, G) state is
120   identical from one branch to another on its way to multiple
121   receivers.  When adding IOAM trace data to multicast packets, each
122   replicated packet would keep the telemetry data for its entire
123   forwarding path.  Since the replicated packets all share common path
124   segments, redundant data will be collected for the same original
125   multicast packet.  Such redundancy consumes extra network bandwidth
126   unnecessarily.  For a large multicast tree, such redundancy is
127   considerable.  Alternatively, it could be more efficient to collect
128   the telemetry data using solutions such as IOAM DEX to eliminate the
129   data redundancy.  However, IOAM DEX lacks a branch identifier, making
130   telemetry data correlation and multicast-tree reconstruction
131   difficult.

[minor]
Fixing some typos and making the text flow easier to read. THis could use an example of how such IOAM trace data is redundant. for common segments.

"The IP multicast packet data for a particular (S, G) state remains identical across different branches to multiple receivers. When IOAM trace data is added to multicast packets, each replicated packet retains telemetry data for its entire forwarding path. This results in redundant data collection for common path segments, unnecessarily consuming extra network bandwidth. For large multicast trees, this redundancy is substantial. Using solutions like IOAM DEX could be more efficient by eliminating data redundancy, but IOAM DEX lacks a branch identifier, complicating telemetry data correlation and multicast tree reconstruction.
"

140 2.  Requirements for Multicast Traffic Telemetry

142   Multicast traffic is forwarded through a multicast tree.  With PIM
143   and P2MP, the forwarding tree is established and maintained by the
144   multicast routing protocol.  With BIER, no state is created in the
145   network to establish a forwarding tree; instead, a bier header
146   provides the necessary information for each packet to know the egress
147   points.  Multicast packets are only replicated at each tree branch
148   fork node for efficiency.

[major]
This sections discusses various technologies to build mcast trees, however not all of them are mentioned. Maybe the following can be added in addition to BIER to make the overview more complete.

#PIM-SM (Protocol Independent Multicast - Sparse Mode):
* Builds shared trees rooted at a Rendezvous Point (RP) and can switch to source-based trees for more efficient delivery.

#PIM-DM (Protocol Independent Multicast - Dense Mode):
* Initially floods multicast traffic to all nodes and then prunes back the unwanted branches.

#CBT (Core-Based Tree):
* Constructs a shared tree rooted at a core router, minimizing state information in the network.
* RFC 2189: "Core Based Trees (CBT) Multicast Routing Architecture"
* RFC 2201: "Core Based Trees (CBT) Multicast Routing Protocol Specification"

#DVMRP (Distance Vector Multicast Routing Protocol):
* Uses distance vector algorithms to build source-based trees, suitable for small to medium-sized networks.

#MOSPF (Multicast Extensions to OSPF):
* Extends OSPF to support multicast by building source-based trees.

#Bidir-PIM (Bidirectional PIM):
* Builds bidirectional shared trees to support efficient many-to-many communication.

#SR Replication Segments (SR-MPLS and SRv6 (work in progress)

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

[minor]
s/a few of which are/a non exclusive list is/

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

153   *  Reconstruct and visualize the multicast tree through data plane
154       monitoring.

156   *  Gather the multicast packet delay and jitter performance on each
157       path.

159   *  Find the multicast packet drop location and reason.

161   *  Gather the VPN state and tunnel information in case of P2MP
162       multicast.

[major]
This list was created with the solution being proposed already in mind and what it intends to fullfill for multicast. It is as result not fully objective list.

I believe important for multicast telemetry is also:

Scalability:
* Handle large-scale networks with numerous multicast groups and receivers.
Minimal Overhead:
* Ensure telemetry collection does not significantly impact network performance or consume excessive bandwidth.
Real-Time Data Collection:
* Provide timely insights for monitoring and troubleshooting.
Accuracy and Precision:
* Capture detailed and accurate network performance metrics.
Compatibility:
* Integrate with existing network protocols and telemetry systems.
Security:
* Protect telemetry data from unauthorized access and tampering.
Support for Various Telemetry Techniques:

164   In order to meet these requirements, we need the ability to directly
165   monitor the multicast traffic and derive data from the multicast
166   packets.  The conventional OAM mechanisms, such as multicast ping
167   [RFC6450] and trace [RFC8487], are not sufficient to meet these
168   requirements.

[minor]
I believe there is more to the eye then what is listed here. Maybe this can be taken as an opportunity to isolate from the many requirements those requirements that are addressed by the proposed solution? THis will lead to a more objective document by showing requiremets that were maybe not met.

184   If the IOAM trace option is used for on-path data collection, the
185   partial trace data will also be replicated into the packet copy for
186   each branch.  The end result is that, at the multicast tree leaves,
187   each copy of the multicast packet has a complete trace.  Most of the
188   data (except data from the last leaf branch) appear in multiple
189   copies while only one copy is sufficient.  Data redundancy introduces
190   unnecessary header overhead, wastes network bandwidth, and
191   complicates the data processing.  The larger the multicast tree, or
192   the longer the multicast path, the more severe the redundancy problem
193   becomes.

[minor]
The following rewrite provides a flow that is easier to read
"When the IOAM trace option is utilized for on-path data collection, partial trace data is replicated into the packet copy for each branch of the multicast tree. Consequently, at the leaves of the multicast tree, each copy of the multicast packet contains a complete trace. This results in data redundancy, as most of the data (except from the final leaf branch) appears in multiple copies, where only one is sufficient. This redundancy introduces unnecessary header overhead, wastes network bandwidth, and complicates data processing. The larger the multicast tree or the longer the multicast path, the more severe the redundancy problem becomes.
"

195   The postcard-based solutions (e.g., IOAM DEX), can be used to
196   eliminate such data redundancy, because each node on the tree only
197   sends a postcard covering local data.  However, they cannot track and
198   correlate the tree branches properly due to the lack of branching
199   information, so they can bring confusion about the multicast tree
200   topology.  For example, in a multicast tree, Node A has two branches,
201   one to Node B and the other to node C; further, Node B leads to Node
202   D and Node C leads to Node E.  When applying postcard-based methods,
203   one cannot tell whether or not Node D(E) is the next hop of Node B(C)
204   from the received postcards alone, unless one correlates the
205   exporting nodes with knowledge about the tree collected by other
206   means (e.g., mtrace).  Such correlation is undesirable because it
207   introduces extra work and complexity.

[major]
It is unclear what the D(E) and/or the B(C) is representing. I can guess what it
means, but for a standards track document guessing is discouraged

Would the following description be correct analysis?
"The postcard-based solutions, such as IOAM Direct Export (DEX), can eliminate data redundancy because each node on the multicast tree sends a postcard with only local data. However, these methods cannot accurately track and correlate tree branches due to the absence of branching information. For instance, in a multicast tree where Node A branches to Node B and Node C, and further, Node B leads to Node D and Node C leads to Node E, it is impossible to determine from postcards alone whether Node D is a continuation of Node B or Node C. This ambiguity necessitates additional correlation using external knowledge about the tree, such as through mtrace, which introduces extra complexity and effort.
"

213 4.  Modifications to Existing Solutions

215   We provide two solutions to address the above issues.  One is based
216   on IOAM DEX and requires an extension to the instruction header of
217   the IOAM DEX Option.  The second solution combines the IOAM trace
218   option and postcards for redundancy removal.

[major]
Two solutions for the same problem in a single standards track document seems to make it not trivial to fully implement the proposed standard. Would it make sense to flag one proposal as the preferred one and the other as the less preferred one? or maybe ballot conditions for when the first proposal is preferred above the second proposal and visa versa?
What are the pro's and cons of each?

220 4.1.  Per-hop postcard using IOAM DEX

222   One way to mitigate the postcard-based telemetry's tree tracking
223   weakness is to augment it with a branch identifier field.  Note that

[major]
Not being overly familiar with IOAM, is this intended for each single packet of the mcast flow?
or will this logic happen for a subset of identified packets? processing each packet seems not trivial in high volume mcast flows? This could be a major operational usage issue

265   Conforming to the node ID specification in IOAM [RFC9197], the node
266   ID is a 3-octet unsigned integer.  The interface index is a two-octet
267   unsigned integer.  As shown in Figure 2, the branch ID consumes 8
268   octets in total.  The three unused octets MUST be set to 0.

[major]
What to do if the recipient gets these and they are not set to 0? drop, process, alert, etc?
What if there are so many interfaces resulting in Interface index overflow?

280   Figure 3 shows that the branch ID is carried as an optional field
281   after the flow ID and sequence number optional fields in the IOAM DEX
282   option header.  Two bits "N" and "I" (i.e., the third and fourth bits
283   in the Extension-Flags field) are reserved to indicate the presence
284   of the optional branch ID field.  "N" stands for the Node ID and "I"
285   stands for the interface index.  If "N" and "I" are both set to 1,
286   the optional multicast branch ID field is present; otherwise it is
287   absent.

[major]
It was not entirely clear why exactly these bits were selected? And why there are two bits?
Would a single bit not be good enough? with 2 bits there are 4 states possible, and only one causes that the information is present. What in the other three states? what if the info is there but shouldn't or what if it should be there, but the branch info is not? What happens in those situations

311 4.2.  Per-section postcard for IOAM Trace

[minor]
Maybe the intend of what postcard based IOAM trace can be helpful for a reader of the specification. What about adding something as the following proposed section
"
The postcard-based method for IOAM trace works by each node in the network independently sending "postcards," which are packets containing telemetry data about the packet processing at that specific node. These postcards are sent directly to a collection system and not carried within the data packet itself. This method eliminates redundancy because each node only reports its own data, but it also introduces challenges in reconstructing the full path and topology of the multicast tree due to the lack of inherent branching information in the individual postcards. This reconstruction often requires additional correlation using external tools or data, adding complexity.
"

313   The second solution is a combination of the IOAM trace option and the
314   postcard-based telemetry.  To avoid data redundancy, at each branch
315   fork node, the trace data accumulated up to this node is exported by
316   a postcard before the packet is replicated.  In this solution, each

[major]
How is this achievable for high volume mcast flows to retain each single packet, processing the telemetry and then forwarding once all telemetry is processed. Maybe this solution is intended for low volume mcast? (assuming there is some identification what low volume means for the branch node).


320   the trace of each branch.  This is also necessary because each
321   replicated multicast packet can have different telemetry data
322   pertaining to this particular copy (e.g., node delay, egress
323   timestamp, and egress interface).  As a consequence, the local data
324   exported by each branch fork node can only contain partial data
325   (e.g., ingress interface and ingress timestamp).

[major]
This text does not truly compute for me. postcards are not carried within
the packet itself, but sent independently. Hence i am slightly lost how this causes different telemetry for each copy? is that not always the situation, replicated or not?

353   There is no need to modify the IOAM trace option header format as
354   specified in [RFC9197].  We just need to configure the branch fork
355   nodes to export the postcards and refresh the IOAM header and data
356   (e.g., clear the node data list and reset the Remaining Length
357   field).

[minor]
Does this means that everything required for this to work already exists? If no, then what piece of encoding and formal procedures is missing?

[major]
What does the formal procedure to clear node data list and length fields exactly mean?

359 5.  Application Considerations for Multicast Protocols

[major]
What about segment routing replication segments?
https://www.rfc-editor.org/rfc/rfc9524.html
There seems some ongoing work wrt SRV6 replication segments (currently still work in progress and expired, but nevertheless one can expect this to be developed sooner or later)

From a high level perspective this sections seems slightly overkill and i am not sure it adds a lot of value. Maybe i am missing a introduction of what this section is all about?. If this is saying that telemetry can be used for these types of tunnels, is there then need for so much text and acronyms? Why not simply list all of them and reduce the complete section to some bullet points?

366   diagnostic information.  Unlike unicast traceroute, Mtrace2 traces
367   the path that the tree building messages follow from receiver to
368   source.  It is usually initiated from an Mtrace2 client by sending an

[minor]
These follow the control plane messages to build the tree? for all tree building technologies?
how would that work for things like MOSPF for example? Maybe there is assumption that this is some PIM style of messaging involved?

382   status data through direct measurements.  There are various multicast
383   protocols that are used to forward the multicast data.  Each will
384   require their own unique on-path telemetry solution.  Mtrace2 doesn't

[minor]
I am not sure what this is saying exactly with 'multicast data'. I assume that this is saying that there are multiple multicast protocols to build forwarding trees? or is the 'multicast data' referring to something else?

388 5.2.  Application in PIM

[major]
What about IPv6
What about PIM-BIDIR? PIM-DM (even though it is non-optimal technology)
I am not sure what the intend of this section is? Is it only to say the telemetry can be useful when PIM is used?

405 5.3.  Application of MVPN X-PMSI Tunnel Encapsulation Attribute

[major]
What is this section trying to achieve? so many acronyms and very different tunnel types.

433 5.4.  Application in BIER

[major]
This section is not providing an IOAM procedures, but seems to be saying that there are BIER requirements and that there is possibility for adding additional metadata in the BIER headers. However no formal procedures are provided, but only indicated. if there are formal procedures to make such mapping, then that should be made explicitly cristal clear in the prescriptive text on how to achieve such

454 6.  Security Considerations

[minor]
high volume mcast streams can be filling up BW very rapidly. IOAM sampling will be important to protect the infrastructure
2024-06-25
11 Gunter Van de Velde [Ballot Position Update] Position for Gunter Van de Velde has been changed to No Objection from Discuss
2024-06-24
11 Mike McBride New version available: draft-ietf-mboned-multicast-telemetry-11.txt
2024-06-24
11 Mike McBride New version accepted (logged-in submitter: Mike McBride)
2024-06-24
11 Mike McBride Uploaded new revision
2024-06-17
10 Éric Vyncke
[Ballot comment]
Thanks for addressing my previous DISCUSS points ( https://mailarchive.ietf.org/arch/msg/mboned/_fcjFzCxSMIXlkRkUwVSvSLSEgg/ ).

Please note that there is at least one non-blocking COMMENT that is not …
[Ballot comment]
Thanks for addressing my previous DISCUSS points ( https://mailarchive.ietf.org/arch/msg/mboned/_fcjFzCxSMIXlkRkUwVSvSLSEgg/ ).

Please note that there is at least one non-blocking COMMENT that is not addressed, the I-D would probably benefit by addressing them (e.g., s/bier header/BIER header/ in section 2).
2024-06-17
10 Éric Vyncke [Ballot Position Update] Position for Éric Vyncke has been changed to No Objection from Discuss
2024-06-04
10 Gunter Van de Velde
[Ballot discuss]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-10.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Many thanks for addressing most …
[Ballot discuss]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-10.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Many thanks for addressing most of the observed blocking DISCUSS observations.
Below you find 1 remaining DISCUSS item.

#remaining DISCUSS item
#======================
##DISCUSS6
The formal procedures when using BIER are a little light. The applicability section talks about "would be possible" or has handwaving on the different encapsulation types of BIER. The various types should maybe be explicit mentioned and associated formal procedures discussed?

##Update after draft-ietf-mboned-multicast-telemetry-10: the diff between version -09 and -10 does not show any change in formal procedures for BIER for the various types of encapsulation. The draft seems to indicate that there is various formal procedures for method and header encoding per encapsulation method. Should those not be explicit outlined?
2024-06-04
10 Gunter Van de Velde
[Ballot comment]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-09.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Thanks for writing up this …
[Ballot comment]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-09.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Thanks for writing up this draft and to start introducing Telemetry to multicast.

One of the items that confused me when reading trough the document was that the I and N bit were new. I have troubles understanding why a single bit is not sufficient? There are not that many flag field available, hence being conservatives is not a bad habit. Also, what is recipient node to do if t received the node-id/interface-id when it should not or if if it should receive it, but it wasn't added?

The section about Applicability introduced me some confusion on what it was trying to achieve. If the intent is to say that the introduced technology procedures can be used on these multicast control plane technologies, then why not have a short list without all the details? It makes it hard read, especially the X-PMSI section (so many acronyms in that section, not all have a reference i think)

Below you find 6 different DISCUSS items to be looked at and to see how to resolve. I think some will be easy to resolve, others may be less trivial.

And finally, in the COMMENTS section i have added a series of comments with additional context and classified them into [minor] and [major].

I hope this review and the various observations provide a way to help improve the document.

G/

#DISCUSS items resolved in draft-ietf-mboned-multicast-telemetry-10.txt
#======================================================================
##[resolved] DISCUSS1
Some multicast tree bilding technologies have been mentioned, while another set was silently ignored (maybe due to historical or lesser used?)
Can these be taken into the story flow and mentioned if considered, not considered or deemend irrelivant for Telemetry extensions?

i.e. PIM-SM (Protocol Independent Multicast - Sparse Mode), PIM-DM (Protocol Independent Multicast - Dense Mode), CBT (Core-Based Tree), DVMRP (Distance Vector Multicast Routing Protocol), MOSPF (Multicast Extensions to OSPF), Bidir-PIM (Bidirectional PIM), SR Replication Segments (SR-MPLS and SRv6 (work in progress)

##[resolved] DISCUSS2
It is unclear if the 2nd method documented in the section "Modifications to Existing Solutions" needs modification. Maybe the exact nature of the modification can be more explicit documented?

##[resolved] DISCUSS3
When "Per-hop postcard using IOAM DEX" is used and per hop it seems operationally desireable to achieve such based upon sampled packets within a multicast flow. The sampling requirements for multicast may be different from unicast traffic. This is not discussed and considered. Is there a reason it is not discussed?

##[resolved] DISCUSS4
How is postcard based telemetry achievable for high volume mcast flows when retaining each single packet, then process the telemetry and finally forwarding once all telemetry is processed. Maybe this solution is intended for low volme mcast? (assuming there is some identification what low volume means for the branch node).

##[resolved] DISCUSS55
Eric Vyncke pointed out that IPv6 needs to be considered, or at least not excluded. (i support his DISCUSS)

#DETAILED COMMENTS
#=================
##classified as [minor] and [major]

91   Multicast has many use cases.  For example, it can be used by
92   residential broadband customers across operator networks, private
93   MPLS customers, and internal customers within corporate intranet.
94   Multicast provides real time interactive online meetings or podcasts,
95   IPTV, and financial markets real-time data, which all have a reliance
96   on UDP's unreliable transport.  End-to-end QOS, therefore, should be
97   a critical component of multicast deployment in order to provide a
98   good end user experience.  In multicast real-time media streaming,
99   loss of a single packet containing a reference frame can result in
100   the inability of thousands of receivers to decode a whole sequence of
101   packets called Group-of-Picture, introducing black picture for
102   periods of a few seconds.  Unexpected long delay in propagation of a
103   packet in such real-time media streaming may equally result in the
104   packet not being received and create the same results.  Multicast
105   packet drops and delay can therefore severely affect the application
106   performance and user experience.

[minor]
This section seems to flow not so well when reading and observations are made with seemingly handwaiving to what is believed well known artifacts. In general i think this paragraph tries to describe that mcast uses UDP and that it is inherently unreliable, and that a single packet loss may result in amplified impacts across many receivers. Not only streaming servies should maybe be flaged, but loss of single packet in financial envirenment (it was mentioned in the text, but not mentioned in the negative imacts) may cause a wrong tick and inequality between brokers using such data. For these, some informative references could be appreciated.

What about following initial rewrite, assuming references are added during later moment:
"Multicast has numerous use case environments, including residential broadband services across operator networks, private MPLS customer networks, and internal corporate intranets. It enables applications such as real-time interactive online meetings, podcasts, IPTV, and financial market real-time data feeds, all of which rely on the unreliable transport of UDP.

To ensure a positive end-user experience, superior end-to-end Quality of Service (QoS) is essential in multicast deployments. In multicast real-time media streaming, the loss of a single packet containing a reference frame can prevent thousands of receivers from decoding an entire sequence of packets, known as a Group-of-Pictures (GoP), resulting in a black screen for several seconds. Similarly, unexpected delays in packet propagation can cause packets to be received late or not at all, leading to the same issues. Therefore, packet drops and delays in multicast streaming can significantly degrade application performance and user experience.
"

108   It is important to monitor the performance of the multicast traffic.
109   New on-path telemetry techniques such as In-situ OAM (IOAM)
110   [RFC9197], IOAM Direct Export (DEX) [RFC9326] IOAM Marking-based
111   Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid
112   Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step] are useful and
113   complementary to the existing active OAM performance monitoring
114   methods (e.g., ICMP ping [RFC0792]), provide promising means to
115   directly monitor the network experience of multicast traffic.
116   However, multicast traffic has some unique characteristics which pose
117   some challenges on applying such techniques in an efficient way.

[minor]
Fixed some typos and readability in the textblob with following proposal:
"
It is essential to monitor the performance of multicast traffic. New on-path telemetry techniques, such as In-situ OAM (IOAM) [RFC9197], IOAM Direct Export (DEX) [RFC9326], IOAM Marking-based Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step], complement existing active OAM performance monitoring methods like ICMP ping [RFC0792]. These techniques offer promising means to directly monitor multicast traffic. However, multicast traffic's unique characteristics present challenges in applying these techniques efficiently.
"

119   The IP multicast packet data for a particular (S, G) state is
120   identical from one branch to another on its way to multiple
121   receivers.  When adding IOAM trace data to multicast packets, each
122   replicated packet would keep the telemetry data for its entire
123   forwarding path.  Since the replicated packets all share common path
124   segments, redundant data will be collected for the same original
125   multicast packet.  Such redundancy consumes extra network bandwidth
126   unnecessarily.  For a large multicast tree, such redundancy is
127   considerable.  Alternatively, it could be more efficient to collect
128   the telemetry data using solutions such as IOAM DEX to eliminate the
129   data redundancy.  However, IOAM DEX lacks a branch identifier, making
130   telemetry data correlation and multicast-tree reconstruction
131   difficult.

[minor]
Fixing some typos and making the text flow easier to read. THis could use an example of how such IOAM trace data is redundant. for common segments.

"The IP multicast packet data for a particular (S, G) state remains identical across different branches to multiple receivers. When IOAM trace data is added to multicast packets, each replicated packet retains telemetry data for its entire forwarding path. This results in redundant data collection for common path segments, unnecessarily consuming extra network bandwidth. For large multicast trees, this redundancy is substantial. Using solutions like IOAM DEX could be more efficient by eliminating data redundancy, but IOAM DEX lacks a branch identifier, complicating telemetry data correlation and multicast tree reconstruction.
"

140 2.  Requirements for Multicast Traffic Telemetry

142   Multicast traffic is forwarded through a multicast tree.  With PIM
143   and P2MP, the forwarding tree is established and maintained by the
144   multicast routing protocol.  With BIER, no state is created in the
145   network to establish a forwarding tree; instead, a bier header
146   provides the necessary information for each packet to know the egress
147   points.  Multicast packets are only replicated at each tree branch
148   fork node for efficiency.

[major]
This sections discusses various technologies to build mcast trees, however not all of them are mentioned. Maybe the following can be added in addition to BIER to make the overview more complete.

#PIM-SM (Protocol Independent Multicast - Sparse Mode):
* Builds shared trees rooted at a Rendezvous Point (RP) and can switch to source-based trees for more efficient delivery.

#PIM-DM (Protocol Independent Multicast - Dense Mode):
* Initially floods multicast traffic to all nodes and then prunes back the unwanted branches.

#CBT (Core-Based Tree):
* Constructs a shared tree rooted at a core router, minimizing state information in the network.
* RFC 2189: "Core Based Trees (CBT) Multicast Routing Architecture"
* RFC 2201: "Core Based Trees (CBT) Multicast Routing Protocol Specification"

#DVMRP (Distance Vector Multicast Routing Protocol):
* Uses distance vector algorithms to build source-based trees, suitable for small to medium-sized networks.

#MOSPF (Multicast Extensions to OSPF):
* Extends OSPF to support multicast by building source-based trees.

#Bidir-PIM (Bidirectional PIM):
* Builds bidirectional shared trees to support efficient many-to-many communication.

#SR Replication Segments (SR-MPLS and SRv6 (work in progress)

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

[minor]
s/a few of which are/a non exclusive list is/

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

153   *  Reconstruct and visualize the multicast tree through data plane
154       monitoring.

156   *  Gather the multicast packet delay and jitter performance on each
157       path.

159   *  Find the multicast packet drop location and reason.

161   *  Gather the VPN state and tunnel information in case of P2MP
162       multicast.

[major]
This list was created with the solution being proposed already in mind and what it intends to fullfill for multicast. It is as result not fully objective list.

I believe important for multicast telemetry is also:

Scalability:
* Handle large-scale networks with numerous multicast groups and receivers.
Minimal Overhead:
* Ensure telemetry collection does not significantly impact network performance or consume excessive bandwidth.
Real-Time Data Collection:
* Provide timely insights for monitoring and troubleshooting.
Accuracy and Precision:
* Capture detailed and accurate network performance metrics.
Compatibility:
* Integrate with existing network protocols and telemetry systems.
Security:
* Protect telemetry data from unauthorized access and tampering.
Support for Various Telemetry Techniques:

164   In order to meet these requirements, we need the ability to directly
165   monitor the multicast traffic and derive data from the multicast
166   packets.  The conventional OAM mechanisms, such as multicast ping
167   [RFC6450] and trace [RFC8487], are not sufficient to meet these
168   requirements.

[minor]
I believe there is more to the eye then what is listed here. Maybe this can be taken as an opportunity to isolate from the many requirements those requirements that are addressed by the proposed solution? THis will lead to a more objective document by showing requiremets that were maybe not met.

184   If the IOAM trace option is used for on-path data collection, the
185   partial trace data will also be replicated into the packet copy for
186   each branch.  The end result is that, at the multicast tree leaves,
187   each copy of the multicast packet has a complete trace.  Most of the
188   data (except data from the last leaf branch) appear in multiple
189   copies while only one copy is sufficient.  Data redundancy introduces
190   unnecessary header overhead, wastes network bandwidth, and
191   complicates the data processing.  The larger the multicast tree, or
192   the longer the multicast path, the more severe the redundancy problem
193   becomes.

[minor]
The following rewrite provides a flow that is easier to read
"When the IOAM trace option is utilized for on-path data collection, partial trace data is replicated into the packet copy for each branch of the multicast tree. Consequently, at the leaves of the multicast tree, each copy of the multicast packet contains a complete trace. This results in data redundancy, as most of the data (except from the final leaf branch) appears in multiple copies, where only one is sufficient. This redundancy introduces unnecessary header overhead, wastes network bandwidth, and complicates data processing. The larger the multicast tree or the longer the multicast path, the more severe the redundancy problem becomes.
"

195   The postcard-based solutions (e.g., IOAM DEX), can be used to
196   eliminate such data redundancy, because each node on the tree only
197   sends a postcard covering local data.  However, they cannot track and
198   correlate the tree branches properly due to the lack of branching
199   information, so they can bring confusion about the multicast tree
200   topology.  For example, in a multicast tree, Node A has two branches,
201   one to Node B and the other to node C; further, Node B leads to Node
202   D and Node C leads to Node E.  When applying postcard-based methods,
203   one cannot tell whether or not Node D(E) is the next hop of Node B(C)
204   from the received postcards alone, unless one correlates the
205   exporting nodes with knowledge about the tree collected by other
206   means (e.g., mtrace).  Such correlation is undesirable because it
207   introduces extra work and complexity.

[major]
It is unclear what the D(E) and/or the B(C) is representing. I can guess what it
means, but for a standards track document guessing is discouraged

Would the following description be correct analysis?
"The postcard-based solutions, such as IOAM Direct Export (DEX), can eliminate data redundancy because each node on the multicast tree sends a postcard with only local data. However, these methods cannot accurately track and correlate tree branches due to the absence of branching information. For instance, in a multicast tree where Node A branches to Node B and Node C, and further, Node B leads to Node D and Node C leads to Node E, it is impossible to determine from postcards alone whether Node D is a continuation of Node B or Node C. This ambiguity necessitates additional correlation using external knowledge about the tree, such as through mtrace, which introduces extra complexity and effort.
"

213 4.  Modifications to Existing Solutions

215   We provide two solutions to address the above issues.  One is based
216   on IOAM DEX and requires an extension to the instruction header of
217   the IOAM DEX Option.  The second solution combines the IOAM trace
218   option and postcards for redundancy removal.

[major]
Two solutions for the same problem in a single standards track document seems to make it not trivial to fully implement the proposed standard. Would it make sense to flag one proposal as the preferred one and the other as the less preferred one? or maybe ballot conditions for when the first proposal is preferred above the second proposal and visa versa?
What are the pro's and cons of each?

220 4.1.  Per-hop postcard using IOAM DEX

222   One way to mitigate the postcard-based telemetry's tree tracking
223   weakness is to augment it with a branch identifier field.  Note that

[major]
Not being overly familiar with IOAM, is this intended for each single packet of the mcast flow?
or will this logic happen for a subset of identified packets? processing each packet seems not trivial in high volume mcast flows? This could be a major operational usage issue

265   Conforming to the node ID specification in IOAM [RFC9197], the node
266   ID is a 3-octet unsigned integer.  The interface index is a two-octet
267   unsigned integer.  As shown in Figure 2, the branch ID consumes 8
268   octets in total.  The three unused octets MUST be set to 0.

[major]
What to do if the recipient gets these and they are not set to 0? drop, process, alert, etc?
What if there are so many interfaces resulting in Interface index overflow?

280   Figure 3 shows that the branch ID is carried as an optional field
281   after the flow ID and sequence number optional fields in the IOAM DEX
282   option header.  Two bits "N" and "I" (i.e., the third and fourth bits
283   in the Extension-Flags field) are reserved to indicate the presence
284   of the optional branch ID field.  "N" stands for the Node ID and "I"
285   stands for the interface index.  If "N" and "I" are both set to 1,
286   the optional multicast branch ID field is present; otherwise it is
287   absent.

[major]
It was not entirely clear why exactly these bits were selected? And why there are two bits?
Would a single bit not be good enough? with 2 bits there are 4 states possible, and only one causes that the information is present. What in the other three states? what if the info is there but shouldn't or what if it should be there, but the branch info is not? What happens in those situations

311 4.2.  Per-section postcard for IOAM Trace

[minor]
Maybe the intend of what postcard based IOAM trace can be helpful for a reader of the specification. What about adding something as the following proposed section
"
The postcard-based method for IOAM trace works by each node in the network independently sending "postcards," which are packets containing telemetry data about the packet processing at that specific node. These postcards are sent directly to a collection system and not carried within the data packet itself. This method eliminates redundancy because each node only reports its own data, but it also introduces challenges in reconstructing the full path and topology of the multicast tree due to the lack of inherent branching information in the individual postcards. This reconstruction often requires additional correlation using external tools or data, adding complexity.
"

313   The second solution is a combination of the IOAM trace option and the
314   postcard-based telemetry.  To avoid data redundancy, at each branch
315   fork node, the trace data accumulated up to this node is exported by
316   a postcard before the packet is replicated.  In this solution, each

[major]
How is this achievable for high volume mcast flows to retain each single packet, processing the telemetry and then forwarding once all telemetry is processed. Maybe this solution is intended for low volume mcast? (assuming there is some identification what low volume means for the branch node).


320   the trace of each branch.  This is also necessary because each
321   replicated multicast packet can have different telemetry data
322   pertaining to this particular copy (e.g., node delay, egress
323   timestamp, and egress interface).  As a consequence, the local data
324   exported by each branch fork node can only contain partial data
325   (e.g., ingress interface and ingress timestamp).

[major]
This text does not truly compute for me. postcards are not carried within
the packet itself, but sent independently. Hence i am slightly lost how this causes different telemetry for each copy? is that not always the situation, replicated or not?

353   There is no need to modify the IOAM trace option header format as
354   specified in [RFC9197].  We just need to configure the branch fork
355   nodes to export the postcards and refresh the IOAM header and data
356   (e.g., clear the node data list and reset the Remaining Length
357   field).

[minor]
Does this means that everything required for this to work already exists? If no, then what piece of encoding and formal procedures is missing?

[major]
What does the formal procedure to clear node data list and length fields exactly mean?

359 5.  Application Considerations for Multicast Protocols

[major]
What about segment routing replication segments?
https://www.rfc-editor.org/rfc/rfc9524.html
There seems some ongoing work wrt SRV6 replication segments (currently still work in progress and expired, but nevertheless one can expect this to be developed sooner or later)

From a high level perspective this sections seems slightly overkill and i am not sure it adds a lot of value. Maybe i am missing a introduction of what this section is all about?. If this is saying that telemetry can be used for these types of tunnels, is there then need for so much text and acronyms? Why not simply list all of them and reduce the complete section to some bullet points?

366   diagnostic information.  Unlike unicast traceroute, Mtrace2 traces
367   the path that the tree building messages follow from receiver to
368   source.  It is usually initiated from an Mtrace2 client by sending an

[minor]
These follow the control plane messages to build the tree? for all tree building technologies?
how would that work for things like MOSPF for example? Maybe there is assumption that this is some PIM style of messaging involved?

382   status data through direct measurements.  There are various multicast
383   protocols that are used to forward the multicast data.  Each will
384   require their own unique on-path telemetry solution.  Mtrace2 doesn't

[minor]
I am not sure what this is saying exactly with 'multicast data'. I assume that this is saying that there are multiple multicast protocols to build forwarding trees? or is the 'multicast data' referring to something else?

388 5.2.  Application in PIM

[major]
What about IPv6
What about PIM-BIDIR? PIM-DM (even though it is non-optimal technology)
I am not sure what the intend of this section is? Is it only to say the telemetry can be useful when PIM is used?

405 5.3.  Application of MVPN X-PMSI Tunnel Encapsulation Attribute

[major]
What is this section trying to achieve? so many acronyms and very different tunnel types.

433 5.4.  Application in BIER

[major]
This section is not providing an IOAM procedures, but seems to be saying that there are BIER requirements and that there is possibility for adding additional metadata in the BIER headers. However no formal procedures are provided, but only indicated. if there are formal procedures to make such mapping, then that should be made explicitly cristal clear in the prescriptive text on how to achieve such

454 6.  Security Considerations

[minor]
high volume mcast streams can be filling up BW very rapidly. IOAM sampling will be important to protect the infrastructure
2024-06-04
10 Gunter Van de Velde Ballot comment and discuss text updated for Gunter Van de Velde
2024-06-03
10 (System) Changed action holders to Warren Kumari (IESG state changed)
2024-06-03
10 (System) Sub state has been changed to AD Followup from Revised I-D Needed
2024-06-03
10 (System) IANA Review state changed to Version Changed - Review Needed from IANA OK - Actions Needed
2024-06-03
10 Mike McBride New version available: draft-ietf-mboned-multicast-telemetry-10.txt
2024-06-03
10 Mike McBride New version accepted (logged-in submitter: Mike McBride)
2024-06-03
10 Mike McBride Uploaded new revision
2024-05-31
09 Jim Guichard [Ballot Position Update] New position, No Objection, has been recorded for Jim Guichard
2024-05-30
09 (System) Changed action holders to Haoyu Song, Mike McBride, Greg Mirsky, Gyan Mishra, Hitoshi Asaeda, Tianran Zhou (IESG state changed)
2024-05-30
09 Jenny Bui IESG state changed to IESG Evaluation::Revised I-D Needed from IESG Evaluation
2024-05-30
09 Zaheduzzaman Sarker
[Ballot comment]
Thanks for working on this specification. Thanks to Bernard Aboba for his TSVART review. I can see resolutions have been reached to improve …
[Ballot comment]
Thanks for working on this specification. Thanks to Bernard Aboba for his TSVART review. I can see resolutions have been reached to improve the document but they are not present in the current version of this document. I am relaying on the responsible AD to make sure the resolutions are reflected in the futuere versions of the document before it gets approved. Hence not holding a Discuss on those points.

I have an additional comment-

- The first paragraph of the introduction appeared to be describing multicast video scnenarios which can be realized over the Internet. The lack of relevance to IOAM context gives a feeling that this specification is addresing something that is out of scope. I would suggest that we explicitly make the scenario description related to scope of the IOAN operations. If this is intented to be used over the Internet then we have issue here.
2024-05-30
09 Zaheduzzaman Sarker [Ballot Position Update] New position, No Objection, has been recorded for Zaheduzzaman Sarker
2024-05-30
09 Francesca Palombini [Ballot Position Update] New position, No Objection, has been recorded for Francesca Palombini
2024-05-29
09 Murray Kucherawy
[Ballot comment]
The shepherd writeup says:

"As per RFC7322, it might be appropriate to name one or two editors.

Rather than having six authors, …
[Ballot comment]
The shepherd writeup says:

"As per RFC7322, it might be appropriate to name one or two editors.

Rather than having six authors, was this ever considered?

Just a suggestion: There's very light use of BCP 14 here.  You could probably actually get rid of it.
2024-05-29
09 Murray Kucherawy [Ballot Position Update] New position, No Objection, has been recorded for Murray Kucherawy
2024-05-28
09 Paul Wouters [Ballot Position Update] New position, No Objection, has been recorded for Paul Wouters
2024-05-28
09 Roman Danyliw [Ballot comment]
Thank you to Roni Even for the GENART review.
2024-05-28
09 Roman Danyliw [Ballot Position Update] New position, No Objection, has been recorded for Roman Danyliw
2024-05-27
09 Orie Steele [Ballot Position Update] New position, No Objection, has been recorded for Orie Steele
2024-05-27
09 Gunter Van de Velde
[Ballot discuss]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-09.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Thanks for writing up this …
[Ballot discuss]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-09.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Thanks for writing up this draft and to start introducing Telemetry to multicast.

One of the items that confused me when reading trough the document was that the I and N bit were new. I have troubles understanding why a single bit is not sufficient? There are not that many flag field available, hence being conservatives is not a bad habit. Also, what is recipient node to do if t received the node-id/interface-id when it should not or if if it should receive it, but it wasn't added?

The section about Applicability introduced me some confusion on what it was trying to achieve. If the intent is to say that the introduced technology procedures can be used on these multicast control plane technologies, then why not have a short list without all the details? It makes it hard read, especially the X-PMSI section (so many acronyms in that section, not all have a reference i think)

Below you find 6 different DISCUSS items to be looked at and to see how to resolve. I think some will be easy to resolve, others may be less trivial.

And finally, in the COMMENTS section i have added a series of comments with additional context and classified them into [minor] and [major].

I hope this review and the various observations provide a way to help improve the document.

G/

#DISCUSS items
#=============
##DISCUSS1
Some multicast tree bilding technologies have been mentioned, while another set was silently ignored (maybe due to historical or lesser used?)
Can these be taken into the story flow and mentioned if considered, not considered or deemend irrelivant for Telemetry extensions?

i.e. PIM-SM (Protocol Independent Multicast - Sparse Mode), PIM-DM (Protocol Independent Multicast - Dense Mode), CBT (Core-Based Tree), DVMRP (Distance Vector Multicast Routing Protocol), MOSPF (Multicast Extensions to OSPF), Bidir-PIM (Bidirectional PIM), SR Replication Segments (SR-MPLS and SRv6 (work in progress)

##DISCUSS2
It is unclear if the 2nd method documented in the section "Modifications to Existing Solutions" needs modification. Maybe the exact nature of the modification can be more explicit documented?

##DISCUSS3
When "Per-hop postcard using IOAM DEX" is used and per hop it seems operationally desireable to achieve such based upon sampled packets within a multicast flow. The sampling requirements for multicast may be different from unicast traffic. This is not discussed and considered. Is there a reason it is not discussed?

###DISCUSS4
How is postcard based telemetry achievable for high volume mcast flows when retaining each single packet, then process the telemetry and finally forwarding once all telemetry is processed. Maybe this solution is intended for low volme mcast? (assuming there is some identification what low volume means for the branch node).

##DISCUSS5
Eric Vyncke pointed out that IPv6 needs to be considered, or at least not excluded. (i support his DISCUSS)

##DISCUSS6
The formal procedures when using BIER are a little light. The applicability section talks about "would be possible" or has handwaiving on the different encapsulation types of BIER. The various types shouuld maybe be explicit mentioned and associated formal procedures discussed?
2024-05-27
09 Gunter Van de Velde Ballot discuss text updated for Gunter Van de Velde
2024-05-27
09 Gunter Van de Velde
[Ballot comment]
#DETAILED COMMENTS
#=================
##classified as [minor] and [major]

91   Multicast has many use cases.  For example, it can be used by
92 …
[Ballot comment]
#DETAILED COMMENTS
#=================
##classified as [minor] and [major]

91   Multicast has many use cases.  For example, it can be used by
92   residential broadband customers across operator networks, private
93   MPLS customers, and internal customers within corporate intranet.
94   Multicast provides real time interactive online meetings or podcasts,
95   IPTV, and financial markets real-time data, which all have a reliance
96   on UDP's unreliable transport.  End-to-end QOS, therefore, should be
97   a critical component of multicast deployment in order to provide a
98   good end user experience.  In multicast real-time media streaming,
99   loss of a single packet containing a reference frame can result in
100   the inability of thousands of receivers to decode a whole sequence of
101   packets called Group-of-Picture, introducing black picture for
102   periods of a few seconds.  Unexpected long delay in propagation of a
103   packet in such real-time media streaming may equally result in the
104   packet not being received and create the same results.  Multicast
105   packet drops and delay can therefore severely affect the application
106   performance and user experience.

[minor]
This section seems to flow not so well when reading and observations are made with seemingly handwaiving to what is believed well known artifacts. In general i think this paragraph tries to describe that mcast uses UDP and that it is inherently unreliable, and that a single packet loss may result in amplified impacts across many receivers. Not only streaming servies should maybe be flaged, but loss of single packet in financial envirenment (it was mentioned in the text, but not mentioned in the negative imacts) may cause a wrong tick and inequality between brokers using such data. For these, some informative references could be appreciated.

What about following initial rewrite, assuming references are added during later moment:
"Multicast has numerous use case environments, including residential broadband services across operator networks, private MPLS customer networks, and internal corporate intranets. It enables applications such as real-time interactive online meetings, podcasts, IPTV, and financial market real-time data feeds, all of which rely on the unreliable transport of UDP.

To ensure a positive end-user experience, superior end-to-end Quality of Service (QoS) is essential in multicast deployments. In multicast real-time media streaming, the loss of a single packet containing a reference frame can prevent thousands of receivers from decoding an entire sequence of packets, known as a Group-of-Pictures (GoP), resulting in a black screen for several seconds. Similarly, unexpected delays in packet propagation can cause packets to be received late or not at all, leading to the same issues. Therefore, packet drops and delays in multicast streaming can significantly degrade application performance and user experience.
"

108   It is important to monitor the performance of the multicast traffic.
109   New on-path telemetry techniques such as In-situ OAM (IOAM)
110   [RFC9197], IOAM Direct Export (DEX) [RFC9326] IOAM Marking-based
111   Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid
112   Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step] are useful and
113   complementary to the existing active OAM performance monitoring
114   methods (e.g., ICMP ping [RFC0792]), provide promising means to
115   directly monitor the network experience of multicast traffic.
116   However, multicast traffic has some unique characteristics which pose
117   some challenges on applying such techniques in an efficient way.

[minor]
Fixed some typos and readability in the textblob with following proposal:
"
It is essential to monitor the performance of multicast traffic. New on-path telemetry techniques, such as In-situ OAM (IOAM) [RFC9197], IOAM Direct Export (DEX) [RFC9326], IOAM Marking-based Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step], complement existing active OAM performance monitoring methods like ICMP ping [RFC0792]. These techniques offer promising means to directly monitor multicast traffic. However, multicast traffic's unique characteristics present challenges in applying these techniques efficiently.
"

119   The IP multicast packet data for a particular (S, G) state is
120   identical from one branch to another on its way to multiple
121   receivers.  When adding IOAM trace data to multicast packets, each
122   replicated packet would keep the telemetry data for its entire
123   forwarding path.  Since the replicated packets all share common path
124   segments, redundant data will be collected for the same original
125   multicast packet.  Such redundancy consumes extra network bandwidth
126   unnecessarily.  For a large multicast tree, such redundancy is
127   considerable.  Alternatively, it could be more efficient to collect
128   the telemetry data using solutions such as IOAM DEX to eliminate the
129   data redundancy.  However, IOAM DEX lacks a branch identifier, making
130   telemetry data correlation and multicast-tree reconstruction
131   difficult.

[minor]
Fixing some typos and making the text flow easier to read. THis could use an example of how such IOAM trace data is redundant. for common segments.

"The IP multicast packet data for a particular (S, G) state remains identical across different branches to multiple receivers. When IOAM trace data is added to multicast packets, each replicated packet retains telemetry data for its entire forwarding path. This results in redundant data collection for common path segments, unnecessarily consuming extra network bandwidth. For large multicast trees, this redundancy is substantial. Using solutions like IOAM DEX could be more efficient by eliminating data redundancy, but IOAM DEX lacks a branch identifier, complicating telemetry data correlation and multicast tree reconstruction.
"

140 2.  Requirements for Multicast Traffic Telemetry

142   Multicast traffic is forwarded through a multicast tree.  With PIM
143   and P2MP, the forwarding tree is established and maintained by the
144   multicast routing protocol.  With BIER, no state is created in the
145   network to establish a forwarding tree; instead, a bier header
146   provides the necessary information for each packet to know the egress
147   points.  Multicast packets are only replicated at each tree branch
148   fork node for efficiency.

[major]
This sections discusses various technologies to build mcast trees, however not all of them are mentioned. Maybe the following can be added in addition to BIER to make the overview more complete.

#PIM-SM (Protocol Independent Multicast - Sparse Mode):
* Builds shared trees rooted at a Rendezvous Point (RP) and can switch to source-based trees for more efficient delivery.

#PIM-DM (Protocol Independent Multicast - Dense Mode):
* Initially floods multicast traffic to all nodes and then prunes back the unwanted branches.

#CBT (Core-Based Tree):
* Constructs a shared tree rooted at a core router, minimizing state information in the network.
* RFC 2189: "Core Based Trees (CBT) Multicast Routing Architecture"
* RFC 2201: "Core Based Trees (CBT) Multicast Routing Protocol Specification"

#DVMRP (Distance Vector Multicast Routing Protocol):
* Uses distance vector algorithms to build source-based trees, suitable for small to medium-sized networks.

#MOSPF (Multicast Extensions to OSPF):
* Extends OSPF to support multicast by building source-based trees.

#Bidir-PIM (Bidirectional PIM):
* Builds bidirectional shared trees to support efficient many-to-many communication.

#SR Replication Segments (SR-MPLS and SRv6 (work in progress)

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

[minor]
s/a few of which are/a non exclusive list is/

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

153   *  Reconstruct and visualize the multicast tree through data plane
154       monitoring.

156   *  Gather the multicast packet delay and jitter performance on each
157       path.

159   *  Find the multicast packet drop location and reason.

161   *  Gather the VPN state and tunnel information in case of P2MP
162       multicast.

[major]
This list was created with the solution being proposed already in mind and what it intends to fullfill for multicast. It is as result not fully objective list.

I believe important for multicast telemetry is also:

Scalability:
* Handle large-scale networks with numerous multicast groups and receivers.
Minimal Overhead:
* Ensure telemetry collection does not significantly impact network performance or consume excessive bandwidth.
Real-Time Data Collection:
* Provide timely insights for monitoring and troubleshooting.
Accuracy and Precision:
* Capture detailed and accurate network performance metrics.
Compatibility:
* Integrate with existing network protocols and telemetry systems.
Security:
* Protect telemetry data from unauthorized access and tampering.
Support for Various Telemetry Techniques:

164   In order to meet these requirements, we need the ability to directly
165   monitor the multicast traffic and derive data from the multicast
166   packets.  The conventional OAM mechanisms, such as multicast ping
167   [RFC6450] and trace [RFC8487], are not sufficient to meet these
168   requirements.

[minor]
I believe there is more to the eye then what is listed here. Maybe this can be taken as an opportunity to isolate from the many requirements those requirements that are addressed by the proposed solution? THis will lead to a more objective document by showing requiremets that were maybe not met.

184   If the IOAM trace option is used for on-path data collection, the
185   partial trace data will also be replicated into the packet copy for
186   each branch.  The end result is that, at the multicast tree leaves,
187   each copy of the multicast packet has a complete trace.  Most of the
188   data (except data from the last leaf branch) appear in multiple
189   copies while only one copy is sufficient.  Data redundancy introduces
190   unnecessary header overhead, wastes network bandwidth, and
191   complicates the data processing.  The larger the multicast tree, or
192   the longer the multicast path, the more severe the redundancy problem
193   becomes.

[minor]
The following rewrite provides a flow that is easier to read
"When the IOAM trace option is utilized for on-path data collection, partial trace data is replicated into the packet copy for each branch of the multicast tree. Consequently, at the leaves of the multicast tree, each copy of the multicast packet contains a complete trace. This results in data redundancy, as most of the data (except from the final leaf branch) appears in multiple copies, where only one is sufficient. This redundancy introduces unnecessary header overhead, wastes network bandwidth, and complicates data processing. The larger the multicast tree or the longer the multicast path, the more severe the redundancy problem becomes.
"

195   The postcard-based solutions (e.g., IOAM DEX), can be used to
196   eliminate such data redundancy, because each node on the tree only
197   sends a postcard covering local data.  However, they cannot track and
198   correlate the tree branches properly due to the lack of branching
199   information, so they can bring confusion about the multicast tree
200   topology.  For example, in a multicast tree, Node A has two branches,
201   one to Node B and the other to node C; further, Node B leads to Node
202   D and Node C leads to Node E.  When applying postcard-based methods,
203   one cannot tell whether or not Node D(E) is the next hop of Node B(C)
204   from the received postcards alone, unless one correlates the
205   exporting nodes with knowledge about the tree collected by other
206   means (e.g., mtrace).  Such correlation is undesirable because it
207   introduces extra work and complexity.

[major]
It is unclear what the D(E) and/or the B(C) is representing. I can guess what it
means, but for a standards track document guessing is discouraged

Would the following description be correct analysis?
"The postcard-based solutions, such as IOAM Direct Export (DEX), can eliminate data redundancy because each node on the multicast tree sends a postcard with only local data. However, these methods cannot accurately track and correlate tree branches due to the absence of branching information. For instance, in a multicast tree where Node A branches to Node B and Node C, and further, Node B leads to Node D and Node C leads to Node E, it is impossible to determine from postcards alone whether Node D is a continuation of Node B or Node C. This ambiguity necessitates additional correlation using external knowledge about the tree, such as through mtrace, which introduces extra complexity and effort.
"

213 4.  Modifications to Existing Solutions

215   We provide two solutions to address the above issues.  One is based
216   on IOAM DEX and requires an extension to the instruction header of
217   the IOAM DEX Option.  The second solution combines the IOAM trace
218   option and postcards for redundancy removal.

[major]
Two solutions for the same problem in a single standards track document seems to make it not trivial to fully implement the proposed standard. Would it make sense to flag one proposal as the preferred one and the other as the less preferred one? or maybe ballot conditions for when the first proposal is preferred above the second proposal and visa versa?
What are the pro's and cons of each?

220 4.1.  Per-hop postcard using IOAM DEX

222   One way to mitigate the postcard-based telemetry's tree tracking
223   weakness is to augment it with a branch identifier field.  Note that

[major]
Not being overly familiar with IOAM, is this intended for each single packet of the mcast flow?
or will this logic happen for a subset of identified packets? processing each packet seems not trivial in high volume mcast flows? This could be a major operational usage issue

265   Conforming to the node ID specification in IOAM [RFC9197], the node
266   ID is a 3-octet unsigned integer.  The interface index is a two-octet
267   unsigned integer.  As shown in Figure 2, the branch ID consumes 8
268   octets in total.  The three unused octets MUST be set to 0.

[major]
What to do if the recipient gets these and they are not set to 0? drop, process, alert, etc?
What if there are so many interfaces resulting in Interface index overflow?

280   Figure 3 shows that the branch ID is carried as an optional field
281   after the flow ID and sequence number optional fields in the IOAM DEX
282   option header.  Two bits "N" and "I" (i.e., the third and fourth bits
283   in the Extension-Flags field) are reserved to indicate the presence
284   of the optional branch ID field.  "N" stands for the Node ID and "I"
285   stands for the interface index.  If "N" and "I" are both set to 1,
286   the optional multicast branch ID field is present; otherwise it is
287   absent.

[major]
It was not entirely clear why exactly these bits were selected? And why there are two bits?
Would a single bit not be good enough? with 2 bits there are 4 states possible, and only one causes that the information is present. What in the other three states? what if the info is there but shouldn't or what if it should be there, but the branch info is not? What happens in those situations

311 4.2.  Per-section postcard for IOAM Trace

[minor]
Maybe the intend of what postcard based IOAM trace can be helpful for a reader of the specification. What about adding something as the following proposed section
"
The postcard-based method for IOAM trace works by each node in the network independently sending "postcards," which are packets containing telemetry data about the packet processing at that specific node. These postcards are sent directly to a collection system and not carried within the data packet itself. This method eliminates redundancy because each node only reports its own data, but it also introduces challenges in reconstructing the full path and topology of the multicast tree due to the lack of inherent branching information in the individual postcards. This reconstruction often requires additional correlation using external tools or data, adding complexity.
"

313   The second solution is a combination of the IOAM trace option and the
314   postcard-based telemetry.  To avoid data redundancy, at each branch
315   fork node, the trace data accumulated up to this node is exported by
316   a postcard before the packet is replicated.  In this solution, each

[major]
How is this achievable for high volume mcast flows to retain each single packet, processing the telemetry and then forwarding once all telemetry is processed. Maybe this solution is intended for low volume mcast? (assuming there is some identification what low volume means for the branch node).


320   the trace of each branch.  This is also necessary because each
321   replicated multicast packet can have different telemetry data
322   pertaining to this particular copy (e.g., node delay, egress
323   timestamp, and egress interface).  As a consequence, the local data
324   exported by each branch fork node can only contain partial data
325   (e.g., ingress interface and ingress timestamp).

[major]
This text does not truly compute for me. postcards are not carried within
the packet itself, but sent independently. Hence i am slightly lost how this causes different telemetry for each copy? is that not always the situation, replicated or not?

353   There is no need to modify the IOAM trace option header format as
354   specified in [RFC9197].  We just need to configure the branch fork
355   nodes to export the postcards and refresh the IOAM header and data
356   (e.g., clear the node data list and reset the Remaining Length
357   field).

[minor]
Does this means that everything required for this to work already exists? If no, then what piece of encoding and formal procedures is missing?

[major]
What does the formal procedure to clear node data list and length fields exactly mean?

359 5.  Application Considerations for Multicast Protocols

[major]
What about segment routing replication segments?
https://www.rfc-editor.org/rfc/rfc9524.html
There seems some ongoing work wrt SRV6 replication segments (currently still work in progress and expired, but nevertheless one can expect this to be developed sooner or later)

From a high level perspective this sections seems slightly overkill and i am not sure it adds a lot of value. Maybe i am missing a introduction of what this section is all about?. If this is saying that telemetry can be used for these types of tunnels, is there then need for so much text and acronyms? Why not simply list all of them and reduce the complete section to some bullet points?

366   diagnostic information.  Unlike unicast traceroute, Mtrace2 traces
367   the path that the tree building messages follow from receiver to
368   source.  It is usually initiated from an Mtrace2 client by sending an

[minor]
These follow the control plane messages to build the tree? for all tree building technologies?
how would that work for things like MOSPF for example? Maybe there is assumption that this is some PIM style of messaging involved?

382   status data through direct measurements.  There are various multicast
383   protocols that are used to forward the multicast data.  Each will
384   require their own unique on-path telemetry solution.  Mtrace2 doesn't

[minor]
I am not sure what this is saying exactly with 'multicast data'. I assume that this is saying that there are multiple multicast protocols to build forwarding trees? or is the 'multicast data' referring to something else?

388 5.2.  Application in PIM

[major]
What about IPv6
What about PIM-BIDIR? PIM-DM (even though it is non-optimal technology)
I am not sure what the intend of this section is? Is it only to say the telemetry can be useful when PIM is used?

405 5.3.  Application of MVPN X-PMSI Tunnel Encapsulation Attribute

[major]
What is this section trying to achieve? so many acronyms and very different tunnel types.

433 5.4.  Application in BIER

[major]
This section is not providing an IOAM procedures, but seems to be saying that there are BIER requirements and that there is possibility for adding additional metadata in the BIER headers. However no formal procedures are provided, but only indicated. if there are formal procedures to make such mapping, then that should be made explicitly cristal clear in the prescriptive text on how to achieve such

454 6.  Security Considerations

[minor]
high volume mcast streams can be filling up BW very rapidly. IOAM sampling will be important to protect the infrastructure
2024-05-27
09 Gunter Van de Velde Ballot comment text updated for Gunter Van de Velde
2024-05-27
09 Gunter Van de Velde
[Ballot comment]
#DETAILED COMMENTS
#=================
##classified as [minor] and [major]

91   Multicast has many use cases.  For example, it can be used by
92 …
[Ballot comment]
#DETAILED COMMENTS
#=================
##classified as [minor] and [major]

91   Multicast has many use cases.  For example, it can be used by
92   residential broadband customers across operator networks, private
93   MPLS customers, and internal customers within corporate intranet.
94   Multicast provides real time interactive online meetings or podcasts,
95   IPTV, and financial markets real-time data, which all have a reliance
96   on UDP's unreliable transport.  End-to-end QOS, therefore, should be
97   a critical component of multicast deployment in order to provide a
98   good end user experience.  In multicast real-time media streaming,
99   loss of a single packet containing a reference frame can result in
100   the inability of thousands of receivers to decode a whole sequence of
101   packets called Group-of-Picture, introducing black picture for
102   periods of a few seconds.  Unexpected long delay in propagation of a
103   packet in such real-time media streaming may equally result in the
104   packet not being received and create the same results.  Multicast
105   packet drops and delay can therefore severely affect the application
106   performance and user experience.

[minor]
This section seems to flow not so well when reading and observations are made with seemingly handwaiving to what is believed well known artifacts. In general i think this paragraph tries to describe that mcast uses UDP and that it is inherently unreliable, and that a single packet loss may result in amplified impacts across many receivers. Not only streaming servies should maybe be flaged, but loss of single packet in financial envirenment (it was mentioned in the text, but not mentioned in the negative imacts) may cause a wrong tick and inequality between brokers using such data. For these, some informative references could be appreciated.

What about following initial rewrite, assuming references are added during later moment:
"Multicast has numerous use case environments, including residential broadband services across operator networks, private MPLS customer networks, and internal corporate intranets. It enables applications such as real-time interactive online meetings, podcasts, IPTV, and financial market real-time data feeds, all of which rely on the unreliable transport of UDP.

To ensure a positive end-user experience, superior end-to-end Quality of Service (QoS) is essential in multicast deployments. In multicast real-time media streaming, the loss of a single packet containing a reference frame can prevent thousands of receivers from decoding an entire sequence of packets, known as a Group-of-Pictures (GoP), resulting in a black screen for several seconds. Similarly, unexpected delays in packet propagation can cause packets to be received late or not at all, leading to the same issues. Therefore, packet drops and delays in multicast streaming can significantly degrade application performance and user experience.
"

108   It is important to monitor the performance of the multicast traffic.
109   New on-path telemetry techniques such as In-situ OAM (IOAM)
110   [RFC9197], IOAM Direct Export (DEX) [RFC9326] IOAM Marking-based
111   Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid
112   Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step] are useful and
113   complementary to the existing active OAM performance monitoring
114   methods (e.g., ICMP ping [RFC0792]), provide promising means to
115   directly monitor the network experience of multicast traffic.
116   However, multicast traffic has some unique characteristics which pose
117   some challenges on applying such techniques in an efficient way.

[minor]
Fixed some typos and readability in the textblob with following proposal:
"
It is essential to monitor the performance of multicast traffic. New on-path telemetry techniques, such as In-situ OAM (IOAM) [RFC9197], IOAM Direct Export (DEX) [RFC9326], IOAM Marking-based Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step], complement existing active OAM performance monitoring methods like ICMP ping [RFC0792]. These techniques offer promising means to directly monitor multicast traffic. However, multicast traffic's unique characteristics present challenges in applying these techniques efficiently.
"

119   The IP multicast packet data for a particular (S, G) state is
120   identical from one branch to another on its way to multiple
121   receivers.  When adding IOAM trace data to multicast packets, each
122   replicated packet would keep the telemetry data for its entire
123   forwarding path.  Since the replicated packets all share common path
124   segments, redundant data will be collected for the same original
125   multicast packet.  Such redundancy consumes extra network bandwidth
126   unnecessarily.  For a large multicast tree, such redundancy is
127   considerable.  Alternatively, it could be more efficient to collect
128   the telemetry data using solutions such as IOAM DEX to eliminate the
129   data redundancy.  However, IOAM DEX lacks a branch identifier, making
130   telemetry data correlation and multicast-tree reconstruction
131   difficult.

[minor]
Fixing some typos and making the text flow easier to read. THis could use an example of how such IOAM trace data is redundant. for common segments.

"The IP multicast packet data for a particular (S, G) state remains identical across different branches to multiple receivers. When IOAM trace data is added to multicast packets, each replicated packet retains telemetry data for its entire forwarding path. This results in redundant data collection for common path segments, unnecessarily consuming extra network bandwidth. For large multicast trees, this redundancy is substantial. Using solutions like IOAM DEX could be more efficient by eliminating data redundancy, but IOAM DEX lacks a branch identifier, complicating telemetry data correlation and multicast tree reconstruction.
"

140 2.  Requirements for Multicast Traffic Telemetry

142   Multicast traffic is forwarded through a multicast tree.  With PIM
143   and P2MP, the forwarding tree is established and maintained by the
144   multicast routing protocol.  With BIER, no state is created in the
145   network to establish a forwarding tree; instead, a bier header
146   provides the necessary information for each packet to know the egress
147   points.  Multicast packets are only replicated at each tree branch
148   fork node for efficiency.

[major]
This sections discusses various technologies to build mcast trees, however not all of them are mentioned. Maybe the following can be added in addition to BIER to make the overview more complete.

PIM-SM (Protocol Independent Multicast - Sparse Mode):
* Builds shared trees rooted at a Rendezvous Point (RP) and can switch to source-based trees for more efficient delivery.
PIM-DM (Protocol Independent Multicast - Dense Mode):
* Initially floods multicast traffic to all nodes and then prunes back the unwanted branches.
CBT (Core-Based Tree):
* Constructs a shared tree rooted at a core router, minimizing state information in the network.
* RFC 2189: "Core Based Trees (CBT) Multicast Routing Architecture"
* RFC 2201: "Core Based Trees (CBT) Multicast Routing Protocol Specification"
DVMRP (Distance Vector Multicast Routing Protocol):
* Uses distance vector algorithms to build source-based trees, suitable for small to medium-sized networks.
MOSPF (Multicast Extensions to OSPF):
* Extends OSPF to support multicast by building source-based trees.
Bidir-PIM (Bidirectional PIM):
* Builds bidirectional shared trees to support efficient many-to-many communication.
SR Replication Segments (SR-MPLS and SRv6 (work in progress)

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

[minor]
s/a few of which are/a non exclusive list is/

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

153   *  Reconstruct and visualize the multicast tree through data plane
154       monitoring.

156   *  Gather the multicast packet delay and jitter performance on each
157       path.

159   *  Find the multicast packet drop location and reason.

161   *  Gather the VPN state and tunnel information in case of P2MP
162       multicast.

[major]
THis list was created with the solution being proposed already in mind and what it intends to fullfill for multicast. It is as result not fully objective list.

I believe important for multicast telemetry is also:

Scalability:
* Handle large-scale networks with numerous multicast groups and receivers.
Minimal Overhead:
* Ensure telemetry collection does not significantly impact network performance or consume excessive bandwidth.
Real-Time Data Collection:
* Provide timely insights for monitoring and troubleshooting.
Accuracy and Precision:
* Capture detailed and accurate network performance metrics.
Compatibility:
* Integrate with existing network protocols and telemetry systems.
Security:
* Protect telemetry data from unauthorized access and tampering.
Support for Various Telemetry Techniques:

164   In order to meet these requirements, we need the ability to directly
165   monitor the multicast traffic and derive data from the multicast
166   packets.  The conventional OAM mechanisms, such as multicast ping
167   [RFC6450] and trace [RFC8487], are not sufficient to meet these
168   requirements.

[minor]
I believe there is more to the eye then what is listed here. Maybe this can be taken as an opportunity to isolate from the many requirements those requirements that are addressed by the proposed solution? THis will lead to a more objective document by showing requiremets that were maybe not met.

184   If the IOAM trace option is used for on-path data collection, the
185   partial trace data will also be replicated into the packet copy for
186   each branch.  The end result is that, at the multicast tree leaves,
187   each copy of the multicast packet has a complete trace.  Most of the
188   data (except data from the last leaf branch) appear in multiple
189   copies while only one copy is sufficient.  Data redundancy introduces
190   unnecessary header overhead, wastes network bandwidth, and
191   complicates the data processing.  The larger the multicast tree, or
192   the longer the multicast path, the more severe the redundancy problem
193   becomes.

[minor]
The following rewrite provides a flow that is easier to read
"When the IOAM trace option is utilized for on-path data collection, partial trace data is replicated into the packet copy for each branch of the multicast tree. Consequently, at the leaves of the multicast tree, each copy of the multicast packet contains a complete trace. This results in data redundancy, as most of the data (except from the final leaf branch) appears in multiple copies, where only one is sufficient. This redundancy introduces unnecessary header overhead, wastes network bandwidth, and complicates data processing. The larger the multicast tree or the longer the multicast path, the more severe the redundancy problem becomes.
"

195   The postcard-based solutions (e.g., IOAM DEX), can be used to
196   eliminate such data redundancy, because each node on the tree only
197   sends a postcard covering local data.  However, they cannot track and
198   correlate the tree branches properly due to the lack of branching
199   information, so they can bring confusion about the multicast tree
200   topology.  For example, in a multicast tree, Node A has two branches,
201   one to Node B and the other to node C; further, Node B leads to Node
202   D and Node C leads to Node E.  When applying postcard-based methods,
203   one cannot tell whether or not Node D(E) is the next hop of Node B(C)
204   from the received postcards alone, unless one correlates the
205   exporting nodes with knowledge about the tree collected by other
206   means (e.g., mtrace).  Such correlation is undesirable because it
207   introduces extra work and complexity.

[major]
It is unclear what the D(E) and/or the B(C) is representing. I can guess what it
means, but for a standards track document guessing is discouraged

WOuld the following description be correct analysis?
"The postcard-based solutions, such as IOAM Direct Export (DEX), can eliminate data redundancy because each node on the multicast tree sends a postcard with only local data. However, these methods cannot accurately track and correlate tree branches due to the absence of branching information. For instance, in a multicast tree where Node A branches to Node B and Node C, and further, Node B leads to Node D and Node C leads to Node E, it is impossible to determine from postcards alone whether Node D is a continuation of Node B or Node C. This ambiguity necessitates additional correlation using external knowledge about the tree, such as through mtrace, which introduces extra complexity and effort.
"

213 4.  Modifications to Existing Solutions

215   We provide two solutions to address the above issues.  One is based
216   on IOAM DEX and requires an extension to the instruction header of
217   the IOAM DEX Option.  The second solution combines the IOAM trace
218   option and postcards for redundancy removal.

[major]
TWo solutions for the same problem in a single standards track document seems to make it not trivial to fully implemen the proposed standard. WOuld it make sense to flag one proposal as the prefered one and the other as the less prefered one? or maybe ballot conditions for when the first proposal is prefered above the second proposal and visa versa?
WHat ar ethe pro's and cons of each?

220 4.1.  Per-hop postcard using IOAM DEX

222   One way to mitigate the postcard-based telemetry's tree tracking
223   weakness is to augment it with a branch identifier field.  Note that

[major]
Not being overly familiar with IOAM, is this intended for each single packet of the mcast flow?
or will this logic happen for a subset of identified packets? processing each packet seems nt trivial in high volume mcast flows? THis could be a major operational usage issue

265   Conforming to the node ID specification in IOAM [RFC9197], the node
266   ID is a 3-octet unsigned integer.  The interface index is a two-octet
267   unsigned integer.  As shown in Figure 2, the branch ID consumes 8
268   octets in total.  The three unused octets MUST be set to 0.

[major]
What to do if the recipient gets these and they are not set to 0? drop, process, alert, etc?
What if there are so many interfaces thaht the Interface index is overflow?

280   Figure 3 shows that the branch ID is carried as an optional field
281   after the flow ID and sequence number optional fields in the IOAM DEX
282   option header.  Two bits "N" and "I" (i.e., the third and fourth bits
283   in the Extension-Flags field) are reserved to indicate the presence
284   of the optional branch ID field.  "N" stands for the Node ID and "I"
285   stands for the interface index.  If "N" and "I" are both set to 1,
286   the optional multicast branch ID field is present; otherwise it is
287   absent.

[major]
It was not entirely clear why exactlly these bits were selected? And why there are two bits?
WOUld a single bit noy be good enough? with 2 bits there are 4 states possible, and oly one causes that the information is present. WHat in the other three states? what if the info is there but shouldn't or what if it should be there, but the branch info is not? WHat happens in those situations

311 4.2.  Per-section postcard for IOAM Trace

[minor]
Maybe the intend of what postcard based IOAM trace can be helpful for a reader of the specification. WHat about adding something as the following proposed section
"
The postcard-based method for IOAM trace works by each node in the network independently sending "postcards," which are packets containing telemetry data about the packet processing at that specific node. These postcards are sent directly to a collection system and not carried within the data packet itself. This method eliminates redundancy because each node only reports its own data, but it also introduces challenges in reconstructing the full path and topology of the multicast tree due to the lack of inherent branching information in the individual postcards. This reconstruction often requires additional correlation using external tools or data, adding complexity.
"

313   The second solution is a combination of the IOAM trace option and the
314   postcard-based telemetry.  To avoid data redundancy, at each branch
315   fork node, the trace data accumulated up to this node is exported by
316   a postcard before the packet is replicated.  In this solution, each

[major]
How is this achievable for high volume mcast flows to retain each single packet, processing the telemetry and then forwarding once all telemetry is processed. Maybe this solution is intended for low volme mcast? (assuming there is some identification what low volume means for the branch node).


320   the trace of each branch.  This is also necessary because each
321   replicated multicast packet can have different telemetry data
322   pertaining to this particular copy (e.g., node delay, egress
323   timestamp, and egress interface).  As a consequence, the local data
324   exported by each branch fork node can only contain partial data
325   (e.g., ingress interface and ingress timestamp).

[major]
This text does not truely compute for me. postcards are not carried within
the packet itself, but sent independently. Hence i am slightly lost how this causes different telemetry for each copy? is that not always the situation, replicated or not?

353   There is no need to modify the IOAM trace option header format as
354   specified in [RFC9197].  We just need to configure the branch fork
355   nodes to export the postcards and refresh the IOAM header and data
356   (e.g., clear the node data list and reset the Remaining Length
357   field).

[minor]
Does this means that everything required for this to work already exists? If no, then what piece of encoding and formal procedures is missing?

[major]
WHat does the formal procedure to clear node data list and length fields exactly mean?

359 5.  Application Considerations for Multicast Protocols

[major]
What about segment routing replication segments?
https://www.rfc-editor.org/rfc/rfc9524.html
There seems some ongoing work wrt SRV6 replication segments (currently still work in progress and expired, but nevertheless one can expect this to be developed sooner or later)

From a high level perspective this sections seems slightly overkill and i am not sure it adds alot of value. Maybe i am missing a introduction of what this section is all about?. If this is saying that telemetry can be used for these types of tunnels, is there then need for so much text and acronyms? WHy not simply list all of them and reduce the complete section to some bullet points?

366   diagnostic information.  Unlike unicast traceroute, Mtrace2 traces
367   the path that the tree building messages follow from receiver to
368   source.  It is usually initiated from an Mtrace2 client by sending an

[minor]
These follow the control plane messages to build the tree? for all tree building technologies?
how would that work for things like MOSPF for example? Maybe there is assumtion that this is some PIM style of messaging involved?

382   status data through direct measurements.  There are various multicast
383   protocols that are used to forward the multicast data.  Each will
384   require their own unique on-path telemetry solution.  Mtrace2 doesn't

[minor]
I am not sure what this is saying exactly with 'multicast data'. I assume that this is saying that there are multiple multicast protcols to build forwarding trees? or is the 'multicast data' refering to somethign else?

388 5.2.  Application in PIM

[major]
What about IPv6
What about PIM-BIDIR? PIM-DM (eventhough it is non-optimal technology)
I am not sure what the intend of this section is? Is it only to say the telemetry can be usefull when PIM is used?

405 5.3.  Application of MVPN X-PMSI Tunnel Encapsulation Attribute

[major]
What is this section trying to achieve? so many acronyms and very different tunnel types.

433 5.4.  Application in BIER

[major]
This section is not providing an IOAM procedures, but seems to be saying that there are BIER requirements and that there is possibility for adding additional metadata in the BIER headers. However no formal procedures are provided, but only indicated. if there are formal procedures to make such mapping, then that should be made explicitly cristal clear in the prescriptive text on how to acieve such

454 6.  Security Considerations

[minor]
high volume mcast streams can be filling up BW very rapidly. IOAM sampling will be important to protect the infrastructure
2024-05-27
09 Gunter Van de Velde Ballot comment text updated for Gunter Van de Velde
2024-05-27
09 Gunter Van de Velde
[Ballot discuss]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-09.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Thanks for writing up this …
[Ballot discuss]
# Gunter Van de Velde, RTG AD, comments for draft-ietf-mboned-multicast-telemetry-09.txt

Please find https://www.ietf.org/blog/handling-iesg-ballot-positions/ documenting the handling of ballots.

Thanks for writing up this draft and to start introducing Telemetry to multicast.

One of the items that confused me when reading trough the document was that the I and N bit were new. I have troubles understanding why a single bit is not sufficient? There are not that many flag field available, hence being sonservative is ot a bad habit. ALso, what is recipient node to do if t received the node-id/interface-id when it should not or if if it should receive it, but it wasn't added?

The section og Applicability introduced me some confusion on what it was trying to achieve. If the intent is to say that the introduced techology procedures can be used on these multicast ontrol plane technologies, then why not have a short list without all the details? It makes it hard read, especially the X-PMSI section (so many acronyms in that section, not all have a reference i think)

Below you find 6 different DISCUSS items to be looked at and to see how to resolve. I think some will be easy to resolve, others may be less trivial.

And finally, in the COMMENTS section i have added a series of comments with additional context and classified them into [minor] and [major].

I hope this review and the various observations provide a way to help improve the document.

G/

#DISCUSS items
#=============
##DISCUSS1
Some multicast tree bilding technologies have been mentioned, while another set was silently ignored (maybe due to historical or lesser used?)
Can these be taken into the story flow and mentioned if considered, not considered or deemend irrelivant for Telemetry extensions?

i.e. PIM-SM (Protocol Independent Multicast - Sparse Mode), PIM-DM (Protocol Independent Multicast - Dense Mode), CBT (Core-Based Tree), DVMRP (Distance Vector Multicast Routing Protocol), MOSPF (Multicast Extensions to OSPF), Bidir-PIM (Bidirectional PIM), SR Replication Segments (SR-MPLS and SRv6 (work in progress)

##DISCUSS2
It is unclear if the 2nd method documented in the section "Modifications to Existing Solutions" needs modification. Maybe the exact nature of the modification can be more explicit documented?

##DISCUSS3
When "Per-hop postcard using IOAM DEX" is used and per hop it seems operationally desireable to achieve such based upon sampled packets within a multicast flow. The sampling requirements for multicast may be different from unicast traffic. This is not discussed and considered. Is there a reason it is not discussed?

###DISCUSS4
How is postcard based telemetry achievable for high volume mcast flows when retaining each single packet, then process the telemetry and finally forwarding once all telemetry is processed. Maybe this solution is intended for low volme mcast? (assuming there is some identification what low volume means for the branch node).

##DISCUSS5
Eric Vyncke pointed out that IPv6 needs to be considered, or at least not excluded. (i support his DISCUSS)

##DISCUSS6
The formal procedures when using BIER are a little light. The applicability section talks about "would be possible" or has handwaiving on the different encapsulation types of BIER. The various types shouuld maybe be explicit mentioned and associated formal procedures discussed?
2024-05-27
09 Gunter Van de Velde
[Ballot comment]
#DETAILED COMMENTS
#=================
##classified as [minor] and [major]

91   Multicast has many use cases.  For example, it can be used by
92 …
[Ballot comment]
#DETAILED COMMENTS
#=================
##classified as [minor] and [major]

91   Multicast has many use cases.  For example, it can be used by
92   residential broadband customers across operator networks, private
93   MPLS customers, and internal customers within corporate intranet.
94   Multicast provides real time interactive online meetings or podcasts,
95   IPTV, and financial markets real-time data, which all have a reliance
96   on UDP's unreliable transport.  End-to-end QOS, therefore, should be
97   a critical component of multicast deployment in order to provide a
98   good end user experience.  In multicast real-time media streaming,
99   loss of a single packet containing a reference frame can result in
100   the inability of thousands of receivers to decode a whole sequence of
101   packets called Group-of-Picture, introducing black picture for
102   periods of a few seconds.  Unexpected long delay in propagation of a
103   packet in such real-time media streaming may equally result in the
104   packet not being received and create the same results.  Multicast
105   packet drops and delay can therefore severely affect the application
106   performance and user experience.

[minor]
This section seems to flow not so well when reading and observations are made with seemingly handwaiving to what is believed well known artifacts. In general i think this paragraph tries to describe that mcast uses UDP and that it is inherently unreliable, and that a single packet loss may result in amplified impacts across many receivers. Not only streaming servies should maybe be flaged, but loss of single packet in financial envirenment (it was mentioned in the text, but not mentioned in the negative imacts) may cause a wrong tick and inequality between brokers using such data. For these, some informative references could be appreciated.

What about following initial rewrite, assuming references are added during later moment:
"Multicast has numerous use cases environments, including residential broadband services across operator networks, private MPLS customer networks, and internal corporate intranets. It enables applications such as real-time interactive online meetings, podcasts, IPTV, and financial market real-time data feeds, all of which rely on the unreliable transport of UDP.

To ensure a positive end-user experience, superior end-to-end Quality of Service (QoS) is essential in multicast deployments. In multicast real-time media streaming, the loss of a single packet containing a reference frame can prevent thousands of receivers from decoding an entire sequence of packets, known as a Group-of-Pictures (GoP), resulting in a black screen for several seconds. Similarly, unexpected delays in packet propagation can cause packets to be received late or not at all, leading to the same issues. Therefore, packet drops and delays in multicast streaming can significantly degrade application performance and user experience.
"

108   It is important to monitor the performance of the multicast traffic.
109   New on-path telemetry techniques such as In-situ OAM (IOAM)
110   [RFC9197], IOAM Direct Export (DEX) [RFC9326] IOAM Marking-based
111   Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid
112   Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step] are useful and
113   complementary to the existing active OAM performance monitoring
114   methods (e.g., ICMP ping [RFC0792]), provide promising means to
115   directly monitor the network experience of multicast traffic.
116   However, multicast traffic has some unique characteristics which pose
117   some challenges on applying such techniques in an efficient way.

[minor]
Fixed some typos and readability in the textblob with following proposal:
"
It is essential to monitor the performance of multicast traffic. New on-path telemetry techniques, such as In-situ OAM (IOAM) [RFC9197], IOAM Direct Export (DEX) [RFC9326], IOAM Marking-based Postcard (PBT-M) [I-D.song-ippm-postcard-based-telemetry], and Hybrid Two-Step (HTS) [I-D.ietf-ippm-hybrid-two-step], complement existing active OAM performance monitoring methods like ICMP ping [RFC0792]. These techniques offer promising means to directly monitor multicast traffic. However, multicast traffic's unique characteristics present challenges in applying these techniques efficiently.
"

119   The IP multicast packet data for a particular (S, G) state is
120   identical from one branch to another on its way to multiple
121   receivers.  When adding IOAM trace data to multicast packets, each
122   replicated packet would keep the telemetry data for its entire
123   forwarding path.  Since the replicated packets all share common path
124   segments, redundant data will be collected for the same original
125   multicast packet.  Such redundancy consumes extra network bandwidth
126   unnecessarily.  For a large multicast tree, such redundancy is
127   considerable.  Alternatively, it could be more efficient to collect
128   the telemetry data using solutions such as IOAM DEX to eliminate the
129   data redundancy.  However, IOAM DEX lacks a branch identifier, making
130   telemetry data correlation and multicast-tree reconstruction
131   difficult.

[minor]
Fixing some typos and making the text flow easier to read. THis could use an example of how such IOAM trace data is redundant. for common segments.

"The IP multicast packet data for a particular (S, G) state remains identical across different branches to multiple receivers. When IOAM trace data is added to multicast packets, each replicated packet retains telemetry data for its entire forwarding path. This results in redundant data collection for common path segments, unnecessarily consuming extra network bandwidth. For large multicast trees, this redundancy is substantial. Using solutions like IOAM DEX could be more efficient by eliminating data redundancy, but IOAM DEX lacks a branch identifier, complicating telemetry data correlation and multicast tree reconstruction.
"

140 2.  Requirements for Multicast Traffic Telemetry

142   Multicast traffic is forwarded through a multicast tree.  With PIM
143   and P2MP, the forwarding tree is established and maintained by the
144   multicast routing protocol.  With BIER, no state is created in the
145   network to establish a forwarding tree; instead, a bier header
146   provides the necessary information for each packet to know the egress
147   points.  Multicast packets are only replicated at each tree branch
148   fork node for efficiency.

[major]
This sections discusses various technologies to build mcast trees, however not all of them are mentioned. Maybe the following can be added in addition to BIER to make the overview more complete.

PIM-SM (Protocol Independent Multicast - Sparse Mode):
* Builds shared trees rooted at a Rendezvous Point (RP) and can switch to source-based trees for more efficient delivery.
PIM-DM (Protocol Independent Multicast - Dense Mode):
* Initially floods multicast traffic to all nodes and then prunes back the unwanted branches.
CBT (Core-Based Tree):
* Constructs a shared tree rooted at a core router, minimizing state information in the network.
* RFC 2189: "Core Based Trees (CBT) Multicast Routing Architecture"
* RFC 2201: "Core Based Trees (CBT) Multicast Routing Protocol Specification"
DVMRP (Distance Vector Multicast Routing Protocol):
* Uses distance vector algorithms to build source-based trees, suitable for small to medium-sized networks.
MOSPF (Multicast Extensions to OSPF):
* Extends OSPF to support multicast by building source-based trees.
Bidir-PIM (Bidirectional PIM):
* Builds bidirectional shared trees to support efficient many-to-many communication.
SR Replication Segments (SR-MPLS and SRv6 (work in progress)

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

[minor]
s/a few of which are/a non exclusive list is/

150   There are several requirements for multicast traffic telemetry, a few
151   of which are:

153   *  Reconstruct and visualize the multicast tree through data plane
154       monitoring.

156   *  Gather the multicast packet delay and jitter performance on each
157       path.

159   *  Find the multicast packet drop location and reason.

161   *  Gather the VPN state and tunnel information in case of P2MP
162       multicast.

[major]
THis list was created with the solution being proposed already in mind and what it intends to fullfill for multicast. It is as result not fully objective list.

I believe important for multicast telemetry is also:

Scalability:
* Handle large-scale networks with numerous multicast groups and receivers.
Minimal Overhead:
* Ensure telemetry collection does not significantly impact network performance or consume excessive bandwidth.
Real-Time Data Collection:
* Provide timely insights for monitoring and troubleshooting.
Accuracy and Precision:
* Capture detailed and accurate network performance metrics.
Compatibility:
* Integrate with existing network protocols and telemetry systems.
Security:
* Protect telemetry data from unauthorized access and tampering.
Support for Various Telemetry Techniques:

164   In order to meet these requirements, we need the ability to directly
165   monitor the multicast traffic and derive data from the multicast
166   packets.  The conventional OAM mechanisms, such as multicast ping
167   [RFC6450] and trace [RFC8487], are not sufficient to meet these
168   requirements.

[minor]
I believe there is more to the eye then what is listed here. Maybe this can be taken as an opportunity to isolate from the many requirements those requirements that are addressed by the proposed solution? THis will lead to a more objective document by showing requiremets that were maybe not met.

184   If the IOAM trace option is used for on-path data collection, the
185   partial trace data will also be replicated into the packet copy for
186   each branch.  The end result is that, at the multicast tree leaves,
187   each copy of the multicast packet has a complete trace.  Most of the
188   data (except data from the last leaf branch) appear in multiple
189   copies while only one copy is sufficient.  Data redundancy introduces
190   unnecessary header overhead, wastes network bandwidth, and
191   complicates the data processing.  The larger the multicast tree, or
192   the longer the multicast path, the more severe the redundancy problem
193   becomes.

[minor]
The following rewrite provides a flow that is easier to read
"When the IOAM trace option is utilized for on-path data collection, partial trace data is replicated into the packet copy for each branch of the multicast tree. Consequently, at the leaves of the multicast tree, each copy of the multicast packet contains a complete trace. This results in data redundancy, as most of the data (except from the final leaf branch) appears in multiple copies, where only one is sufficient. This redundancy introduces unnecessary header overhead, wastes network bandwidth, and complicates data processing. The larger the multicast tree or the longer the multicast path, the more severe the redundancy problem becomes.
"

195   The postcard-based solutions (e.g., IOAM DEX), can be used to
196   eliminate such data redundancy, because each node on the tree only
197   sends a postcard covering local data.  However, they cannot track and
198   correlate the tree branches properly due to the lack of branching
199   information, so they can bring confusion about the multicast tree
200   topology.  For example, in a multicast tree, Node A has two branches,
201   one to Node B and the other to node C; further, Node B leads to Node
202   D and Node C leads to Node E.  When applying postcard-based methods,
203   one cannot tell whether or not Node D(E) is the next hop of Node B(C)
204   from the received postcards alone, unless one correlates the
205   exporting nodes with knowledge about the tree collected by other
206   means (e.g., mtrace).  Such correlation is undesirable because it
207   introduces extra work and complexity.

[major]
It is unclear what the D(E) and/or the B(C) is representing. I can guess what it
means, but for a standards track document guessing is discouraged

WOuld the following description be correct analysis?
"The postcard-based solutions, such as IOAM Direct Export (DEX), can eliminate data redundancy because each node on the multicast tree sends a postcard with only local data. However, these methods cannot accurately track and correlate tree branches due to the absence of branching information. For instance, in a multicast tree where Node A branches to Node B and Node C, and further, Node B leads to Node D and Node C leads to Node E, it is impossible to determine from postcards alone whether Node D is a continuation of Node B or Node C. This ambiguity necessitates additional correlation using external knowledge about the tree, such as through mtrace, which introduces extra complexity and effort.
"

213 4.  Modifications to Existing Solutions

215   We provide two solutions to address the above issues.  One is based
216   on IOAM DEX and requires an extension to the instruction header of
217   the IOAM DEX Option.  The second solution combines the IOAM trace
218   option and postcards for redundancy removal.

[major]
TWo solutions for the same problem in a single standards track document seems to make it not trivial to fully implemen the proposed standard. WOuld it make sense to flag one proposal as the prefered one and the other as the less prefered one? or maybe ballot conditions for when the first proposal is prefered above the second proposal and visa versa?
WHat ar ethe pro's and cons of each?

220 4.1.  Per-hop postcard using IOAM DEX

222   One way to mitigate the postcard-based telemetry's tree tracking
223   weakness is to augment it with a branch identifier field.  Note that

[major]
Not being overly familiar with IOAM, is this intended for each single packet of the mcast flow?
or will this logic happen for a subset of identified packets? processing each packet seems nt trivial in high volume mcast flows? THis could be a major operational usage issue

265   Conforming to the node ID specification in IOAM [RFC9197], the node
266   ID is a 3-octet unsigned integer.  The interface index is a two-octet
267   unsigned integer.  As shown in Figure 2, the branch ID consumes 8
268   octets in total.  The three unused octets MUST be set to 0.

[major]
What to do if the recipient gets these and they are not set to 0? drop, process, alert, etc?
What if there are so many interfaces thaht the Interface index is overflow?

280   Figure 3 shows that the branch ID is carried as an optional field
281   after the flow ID and sequence number optional fields in the IOAM DEX
282   option header.  Two bits "N" and "I" (i.e., the third and fourth bits
283   in the Extension-Flags field) are reserved to indicate the presence
284   of the optional branch ID field.  "N" stands for the Node ID and "I"
285   stands for the interface index.  If "N" and "I" are both set to 1,
286   the optional multicast branch ID field is present; otherwise it is
287   absent.

[major]
It was not entirely clear why exactlly these bits were selected? And why there are two bits?
WOUld a single bit noy be good enough? with 2 bits there are 4 states possible, and oly one causes that the information is present. WHat in the other three states? what if the info is there but shouldn't or what if it should be there, but the branch info is not? WHat happens in those situations

311 4.2.  Per-section postcard for IOAM Trace

[minor]
Maybe the intend of what postcard based IOAM trace can be helpful for a reader of the specification. WHat about adding something as the following proposed section
"
The postcard-based method for IOAM trace works by each node in the network independently sending "postcards," which are packets containing telemetry data about the packet processing at that specific node. These postcards are sent directly to a collection system and not carried within the data packet itself. This method eliminates redundancy because each node only reports its own data, but it also introduces challenges in reconstructing the full path and topology of the multicast tree due to the lack of inherent branching information in the individual postcards. This reconstruction often requires additional correlation using external tools or data, adding complexity.
"

313   The second solution is a combination of the IOAM trace option and the
314   postcard-based telemetry.  To avoid data redundancy, at each branch
315   fork node, the trace data accumulated up to this node is exported by
316   a postcard before the packet is replicated.  In this solution, each

[major]
How is this achievable for high volume mcast flows to retain each single packet, processing the telemetry and then forwarding once all telemetry is processed. Maybe this solution is intended for low volme mcast? (assuming there is some identification what low volume means for the branch node).


320   the trace of each branch.  This is also necessary because each
321   replicated multicast packet can have different telemetry data
322   pertaining to this particular copy (e.g., node delay, egress
323   timestamp, and egress interface).  As a consequence, the local data
324   exported by each branch fork node can only contain partial data
325   (e.g., ingress interface and ingress timestamp).

[major]
This text does not truely compute for me. postcards are not carried within
the packet itself, but sent independently. Hence i am slightly lost how this causes different telemetry for each copy? is that not always the situation, replicated or not?

353   There is no need to modify the IOAM trace option header format as
354   specified in [RFC9197].  We just need to configure the branch fork
355   nodes to export the postcards and refresh the IOAM header and data
356   (e.g., clear the node data list and reset the Remaining Length
357   field).

[minor]
Does this means that everything required for this to work already exists? If no, then what piece of encoding and formal procedures is missing?

[major]
WHat does the formal procedure to clear node data list and length fields exactly mean?

359 5.  Application Considerations for Multicast Protocols

[major]
What about segment routing replication segments?
https://www.rfc-editor.org/rfc/rfc9524.html
There seems some ongoing work wrt SRV6 replication segments (currently still work in progress and expired, but nevertheless one can expect this to be developed sooner or later)

From a high level perspective this sections seems slightly overkill and i am not sure it adds alot of value. Maybe i am missing a introduction of what this section is all about?. If this is saying that telemetry can be used for these types of tunnels, is there then need for so much text and acronyms? WHy not simply list all of them and reduce the complete section to some bullet points?

366   diagnostic information.  Unlike unicast traceroute, Mtrace2 traces
367   the path that the tree building messages follow from receiver to
368   source.  It is usually initiated from an Mtrace2 client by sending an

[minor]
These follow the control plane messages to build the tree? for all tree building technologies?
how would that work for things like MOSPF for example? Maybe there is assumtion that this is some PIM style of messaging involved?

382   status data through direct measurements.  There are various multicast
383   protocols that are used to forward the multicast data.  Each will
384   require their own unique on-path telemetry solution.  Mtrace2 doesn't

[minor]
I am not sure what this is saying exactly with 'multicast data'. I assume that this is saying that there are multiple multicast protcols to build forwarding trees? or is the 'multicast data' refering to somethign else?

388 5.2.  Application in PIM

[major]
What about IPv6
What about PIM-BIDIR? PIM-DM (eventhough it is non-optimal technology)
I am not sure what the intend of this section is? Is it only to say the telemetry can be usefull when PIM is used?

405 5.3.  Application of MVPN X-PMSI Tunnel Encapsulation Attribute

[major]
What is this section trying to achieve? so many acronyms and very different tunnel types.

433 5.4.  Application in BIER

[major]
This section is not providing an IOAM procedures, but seems to be saying that there are BIER requirements and that there is possibility for adding additional metadata in the BIER headers. However no formal procedures are provided, but only indicated. if there are formal procedures to make such mapping, then that should be made explicitly cristal clear in the prescriptive text on how to acieve such

454 6.  Security Considerations

[minor]
high volume mcast streams can be filling up BW very rapidly. IOAM sampling will be important to protect the infrastructure
2024-05-27
09 Gunter Van de Velde [Ballot Position Update] New position, Discuss, has been recorded for Gunter Van de Velde
2024-05-25
09 Erik Kline [Ballot Position Update] New position, No Objection, has been recorded for Erik Kline
2024-05-23
09 Éric Vyncke
[Ballot discuss]

# Éric Vyncke, INT AD, comments for draft-ietf-mboned-multicast-telemetry-09

Thank you for the work put into this document. Mcast is indeed a special 'beast' …
[Ballot discuss]

# Éric Vyncke, INT AD, comments for draft-ietf-mboned-multicast-telemetry-09

Thank you for the work put into this document. Mcast is indeed a special 'beast' deserving specific metrics and measurement methods.

Please find below one blocking DISCUSS points (easy to address), some non-blocking COMMENT points (but replies would be appreciated even if only for my own education), and some nits.

Special thanks to Max Franke for the shepherd's detailed write-up including the WG consensus and the justification of the intended status and of the authors count.

I hope that this review helps to improve the document,

Regards,

-éric


# DISCUSS (blocking)

As noted in https://www.ietf.org/blog/handling-iesg-ballot-positions/, a DISCUSS ballot is a request to have a discussion on the following topics:

## Section 4.1

What happens when only either the N or the I bit is set and the other one not? I fail to understand the need of having two bits to indicate the presence of a single field. RFC 9197 does not name bits, why changing this in this I-D ? (even if I prefer myself having names for bits).

What are the default value of "unused" bits in figure 2? What is the expected behaviour of the receiver when those fields are non zero?

## Section 5.2

It is unclear whether IPv6 is supported by this document per:
```
IP Multicast packets fall within the range of 224.0.0.0 through 239.255.255.255. The telemetry solution will need to work within this IPv4 address range and provide telemetry data for this UDP traffic.
```

The second paragraph gives some "hints" but to a draft, as this I-D is intended to proposed standard, it must be clearer about IPv6 support.
2024-05-23
09 Éric Vyncke
[Ballot comment]

# COMMENTS (non-blocking)

## Goal of the document

I am unclear whether this I-D lists requirements or specificies solutions. Only section 4.1 seems …
[Ballot comment]

# COMMENTS (non-blocking)

## Goal of the document

I am unclear whether this I-D lists requirements or specificies solutions. Only section 4.1 seems to be normative, the other are plans for future work. It appears as a mix of proposed standard and informational in a single document. Strongly suggest to to move section 4.1 in its own PS I-D and keep the rest as informational.

## Abstract

`This document specifies the requirements` should it rather be "This document lists the requirements" ?

## Section 2

Add an informative reference to PIM & BIER.

s/a bier header/a BIER header/ (or do you mean beer foam ? -- sorry I am from Belgium)

Excuse my ignorance but where is VPN coming from in `Gather the VPN state and tunnel information in case of P2MP` or do yo mean tunnels ?

## Section 3

A small SVG/ASCII ART graphics will be welcome.

I also wonder whether the postcard technique won't overwhelm / DoS the postcard receivers if there is no aggregation.

OTOH, the iOAM technique is mainly useful on the receiver(s), how can the source(s) and network operator collect the data ?

## Section 4.1

What is `PBT-M`?

Figure 1 does not seem to fit the topology described above in the text...

Figure 3 (or some text) should refer to figure 2 for `Multicast Branch ID` (just to be clear).

# NITS (non-blocking / cosmetic)

## Section 5.1

s/5.1. Mtrace verson 2/5.1. Mtrace version 2/
2024-05-23
09 Éric Vyncke [Ballot Position Update] New position, Discuss, has been recorded for Éric Vyncke
2024-05-23
09 Bernard Aboba
Request for Last Call review by TSVART Completed: Ready with Nits. Reviewer: Bernard Aboba. Sent review to list. Submission of review completed at an earlier …
Request for Last Call review by TSVART Completed: Ready with Nits. Reviewer: Bernard Aboba. Sent review to list. Submission of review completed at an earlier date.
2024-05-23
09 Bernard Aboba Request for Last Call review by TSVART Completed: Ready with Nits. Reviewer: Bernard Aboba.
2024-05-22
09 Cindy Morgan Placed on agenda for telechat - 2024-05-30
2024-05-22
09 Warren Kumari Ballot has been issued
2024-05-22
09 Warren Kumari [Ballot Position Update] New position, Yes, has been recorded for Warren Kumari
2024-05-22
09 Warren Kumari Created "Approve" ballot
2024-05-22
09 Warren Kumari IESG state changed to IESG Evaluation from Waiting for AD Go-Ahead
2024-05-22
09 (System) IESG state changed to Waiting for AD Go-Ahead from In Last Call
2024-05-21
09 (System) IANA Review state changed to IANA OK - Actions Needed from IANA - Review Needed
2024-05-21
09 David Dong
(Via drafts-lastcall@iana.org): IESG/Authors/WG Chairs:

IANA has completed its review of draft-ietf-mboned-multicast-telemetry-09. If any part of this review is inaccurate, please let us know.

IANA …
(Via drafts-lastcall@iana.org): IESG/Authors/WG Chairs:

IANA has completed its review of draft-ietf-mboned-multicast-telemetry-09. If any part of this review is inaccurate, please let us know.

IANA understands that, upon approval of this document, there is a single action which we must complete.

In the IOAM DEX Extension-Flags registry in the In Situ OAM (IOAM) registry group located at:

https://www.iana.org/assignments/ioam/

two new registrations are to be made as follows:

Bit: [ TBD-at-Registration ]
Description: Multicast Branching Node ID
Reference: [ RFC-to-be ]

Bit: [ TBD-at-Registration ]
Description: Multicast Branching Interface Index
Reference: [ RFC-to-be ]

IANA understands that the authors have requested that bits 2 and 3 be assigned to these new registrations. Please note that specific values cannot be reserved. However, early allocation is available for some types of registrations. For more information, please see RFC 7120.

We understand that this is the only action required to be completed upon approval of this document.

NOTE: The actions requested in this document will not be completed until the document has been approved for publication as an RFC. This message is meant only to confirm the list of actions that will be performed.

For definitions of IANA review states, please see:

https://datatracker.ietf.org/help/state/draft/iana-review

Thank you,

David Dong
IANA Services Sr. Specialist
2024-05-20
09 Adam Montville Request for Last Call review by SECDIR Completed: Ready. Reviewer: Adam Montville. Sent review to list.
2024-05-17
09 Magnus Westerlund Request for Last Call review by TSVART is assigned to Bernard Aboba
2024-05-13
09 Carlos Pignataro Request for Last Call review by OPSDIR is assigned to Menachem Dodge
2024-05-13
09 Roni Even Request for Last Call review by GENART Completed: Ready. Reviewer: Roni Even. Sent review to list.
2024-05-09
09 Tero Kivinen Request for Last Call review by SECDIR is assigned to Adam Montville
2024-05-09
09 Jean Mahoney Request for Last Call review by GENART is assigned to Roni Even
2024-05-08
09 Liz Flynn IANA Review state changed to IANA - Review Needed
2024-05-08
09 Liz Flynn
The following Last Call announcement was sent out (ends 2024-05-22):

From: The IESG
To: IETF-Announce
CC: draft-ietf-mboned-multicast-telemetry@ietf.org, mboned-chairs@ietf.org, mboned@ietf.org, mfranke@inet.tu-berlin.de, warren@kumari.net …
The following Last Call announcement was sent out (ends 2024-05-22):

From: The IESG
To: IETF-Announce
CC: draft-ietf-mboned-multicast-telemetry@ietf.org, mboned-chairs@ietf.org, mboned@ietf.org, mfranke@inet.tu-berlin.de, warren@kumari.net
Reply-To: last-call@ietf.org
Sender:
Subject: Last Call:  (Multicast On-path Telemetry using IOAM) to Proposed Standard


The IESG has received a request from the MBONE Deployment WG (mboned) to
consider the following document: - 'Multicast On-path Telemetry using IOAM'
  as Proposed Standard

The IESG plans to make a decision in the next few weeks, and solicits final
comments on this action. Please send substantive comments to the
last-call@ietf.org mailing lists by 2024-05-22. Exceptionally, comments may
be sent to iesg@ietf.org instead. In either case, please retain the beginning
of the Subject line to allow automated sorting.

Abstract


  This document specifies the requirements of on-path telemetry for
  multicast traffic using In-situ OAM.  While In-situ OAM is
  advantageous for multicast traffic telemetry, some unique challenges
  are present.  This document provides the solutions based on the In-
  situ OAM trace option and direct export option to support the
  telemetry data correlation and the multicast tree reconstruction
  without incurring data redundancy.





The file can be obtained via
https://datatracker.ietf.org/doc/draft-ietf-mboned-multicast-telemetry/


The following IPR Declarations may be related to this I-D:

  https://datatracker.ietf.org/ipr/4633/





2024-05-08
09 Liz Flynn IESG state changed to In Last Call from Last Call Requested
2024-05-08
09 Liz Flynn Last call announcement was generated
2024-05-08
09 Warren Kumari Last call was requested
2024-05-08
09 Warren Kumari Last call announcement was generated
2024-05-08
09 Warren Kumari Ballot approval text was generated
2024-05-08
09 Warren Kumari IESG state changed to Last Call Requested from Publication Requested
2024-05-08
09 Warren Kumari Ballot writeup was changed
2024-04-05
09 Lenny Giuliano
# Document Shepherd Write-Up for Group Documents

*This version is dated 4 July 2022.*

Thank you for your service as a document shepherd. Among the …
# Document Shepherd Write-Up for Group Documents

*This version is dated 4 July 2022.*

Thank you for your service as a document shepherd. Among the responsibilities is
answering the questions in this write-up to give helpful context to Last Call
and Internet Engineering Steering Group ([IESG][1]) reviewers, and your
diligence in completing it is appreciated. The full role of the shepherd is
further described in [RFC 4858][2]. You will need the cooperation of the authors
and editors to complete these checks.

Note that some numbered items contain multiple related questions; please be sure
to answer all of them.


## Document History

1. Does the working group (WG) consensus represent the strong concurrence of a
  few individuals, with others being silent, or did it reach broad agreement?

The document reached strong consensus to advance, with contributors from both the MBONED WG as well as the IPPM WG voicing support.


2. Was there controversy about particular points, or were there decisions where
  the consensus was particularly rough?

No.


3. Has anyone threatened an appeal or otherwise indicated extreme discontent? If
  so, please summarize the areas of conflict in separate email messages to the
  responsible Area Director. (It should be in a separate email because this
  questionnaire is publicly available.)

No.


4. For protocol documents, are there existing implementations of the contents of
  the document? Have a significant number of potential implementers indicated
  plans to implement? Are any existing implementations reported somewhere,
  either in the document itself (as [RFC 7942][3] recommends) or elsewhere
  (where)?

This is not a protocol document.


## Additional Reviews

5. Do the contents of this document closely interact with technologies in other
  IETF working groups or external organizations, and would it therefore benefit
  from their review? Have those reviews occurred? If yes, describe which
  reviews took place.

This draft was originally presented at the IPPM WG but it was decided there that it should either go to the PIM or MBONED WG. At the end, MBONED was the most fitting and took on the document. During the development of the document, close coordination with IPPM occurred, which also included reviews.


6. Describe how the document meets any required formal expert review criteria,
  such as the MIB Doctor, YANG Doctor, media type, and URI type reviews.

No models or types are used in the document so no expert reviews are necessary.


7. If the document contains a YANG module, has the final version of the module
  been checked with any of the [recommended validation tools][4] for syntax and
  formatting validation? If there are any resulting errors or warnings, what is
  the justification for not fixing them at this time? Does the YANG module
  comply with the Network Management Datastore Architecture (NMDA) as specified
  in [RFC 8342][5]?

The document does not contain a YANG module.


8. Describe reviews and automated checks performed to validate sections of the
  final version of the document written in a formal language, such as XML code,
  BNF rules, MIB definitions, CBOR's CDDL, etc.

The document does not include any formal language sections.


## Document Shepherd Checks

9. Based on the shepherd's review of the document, is it their opinion that this
  document is needed, clearly written, complete, correctly designed, and ready
  to be handed off to the responsible Area Director?

Yes, the document is well written and complete. The necessity to modify IOAM to reduce data redundancy when used with multicast is obvious.


10. Several IETF Areas have assembled [lists of common issues that their
  reviewers encounter][6]. For which areas have such issues been identified
  and addressed? For which does this still need to happen in subsequent
  reviews?

The listed issues do not appear in the document:

    - DNS: The Document is not related to DNS.
    - Use of IPv6 packets with extension headers or fragments: There are no special considerations for IPv6, the Document references RFC 9486 for this topic.
    - Assumptions of how end-user networks connect to the Internet: The document makes no such assupmtions.
    - Use of QoS markings: The document does not have any QoS markings.
    - Use of MIBs and YANG modules: The document does not have any MIBs or YANG modules.


11. What type of RFC publication is being requested on the IETF stream ([Best
  Current Practice][12], [Proposed Standard, Internet Standard][13],
  [Informational, Experimental or Historic][14])? Why is this the proper type
  of RFC? Do all Datatracker state attributes correctly reflect this intent?

Proposed Standard. This is appropriate given that this document defines two
extensions to other proposed standards. The data tracker reflects this intent.


12. Have reasonable efforts been made to remind all authors of the intellectual
  property rights (IPR) disclosure obligations described in [BCP 79][7]? To
  the best of your knowledge, have all required disclosures been filed? If
  not, explain why. If yes, summarize any relevant discussion, including links
  to publicly-available messages when applicable.

Yes, the co-authors have filed their disclosures on the MBONED mailing list. Two authors listed a potentially relevant IPR:
- https://mailarchive.ietf.org/arch/msg/mboned/zARa_b8Dww0JiYs8mdi7CqcwkWw/
- https://mailarchive.ietf.org/arch/msg/mboned/41YsoioPnPpWdUxJpnge7UxRJ44/
- https://datatracker.ietf.org/ipr/search/?submit=draft&id=draft-ietf-mboned-multicast-telemetry

13. Has each author, editor, and contributor shown their willingness to be
  listed as such? If the total number of authors and editors on the front page
  is greater than five, please provide a justification.

Yes, all authors are listed. There are 6 authors. All 6 authors made significant contributions to this document. As per RFC7322, it might be appropriate to name one or two editors.


14. Document any remaining I-D nits in this document. Simply running the [idnits
  tool][8] is not enough; please review the ["Content Guidelines" on
  authors.ietf.org][15]. (Also note that the current idnits tool generates
  some incorrect warnings; a rewrite is underway.)

There are some outdated references that have already been acknowledged by the authors and will be fixed in the next version of the ID.

15. Should any informative references be normative or vice-versa? See the [IESG
  Statement on Normative and Informative References][16].

No.


16. List any normative references that are not freely available to anyone. Did
  the community have sufficient access to review any such normative
  references?

None.


17. Are there any normative downward references (see [RFC 3967][9] and [BCP
  97
][10]) that are not already listed in the [DOWNREF registry][17]? If so,
  list them.

None.


18. Are there normative references to documents that are not ready to be
  submitted to the IESG for publication or are otherwise in an unclear state?
  If so, what is the plan for their completion?

None.


19. Will publication of this document change the status of any existing RFCs? If
  so, does the Datatracker metadata correctly reflect this and are those RFCs
  listed on the title page, in the abstract, and discussed in the
  introduction? If not, explain why and point to the part of the document
  where the relationship of this document to these other RFCs is discussed.

No.


20. Describe the document shepherd's review of the IANA considerations section,
  especially with regard to its consistency with the body of the document.
  Confirm that all aspects of the document requiring IANA assignments are
  associated with the appropriate reservations in IANA registries. Confirm
  that any referenced IANA registries have been clearly identified. Confirm
  that each newly created IANA registry specifies its initial contents,
  allocations procedures, and a reasonable name (see [RFC 8126][11]).

There are 2 new extension flag registrations to the "IOAM DEX Extension-Flags" registry requested by the document. Both are reflected and specified in the text of the document. Their names are reasonable.


21. List any new IANA registries that require Designated Expert Review for
  future allocations. Are the instructions to the Designated Expert clear?
  Please include suggestions of designated experts, if appropriate.

There are no IANA considerations that require designated expert review.



[1]: https://www.ietf.org/about/groups/iesg/
[2]: https://www.rfc-editor.org/rfc/rfc4858.html
[3]: https://www.rfc-editor.org/rfc/rfc7942.html
[4]: https://trac.ietf.org/trac/ops/wiki/yang-review-tools
[5]: https://www.rfc-editor.org/rfc/rfc8342.html
[6]: https://trac.ietf.org/trac/iesg/wiki/ExpertTopics
[7]: https://www.rfc-editor.org/info/bcp79
[8]: https://www.ietf.org/tools/idnits/
[9]: https://www.rfc-editor.org/rfc/rfc3967.html
[10]: https://www.rfc-editor.org/info/bcp97
[11]: https://www.rfc-editor.org/rfc/rfc8126.html
[12]: https://www.rfc-editor.org/rfc/rfc2026.html#section-5
[13]: https://www.rfc-editor.org/rfc/rfc2026.html#section-4.1
[14]: https://www.rfc-editor.org/rfc/rfc2026.html#section-4.2
[15]: https://authors.ietf.org/en/content-guidelines-overview
[16]: https://www.ietf.org/about/groups/iesg/statements/normative-informative-references/
[17]: https://datatracker.ietf.org/doc/downref/
2024-04-05
09 Lenny Giuliano IETF WG state changed to Submitted to IESG for Publication from WG Consensus: Waiting for Write-Up
2024-04-05
09 Lenny Giuliano IESG state changed to Publication Requested from I-D Exists
2024-04-05
09 (System) Changed action holders to Warren Kumari (IESG state changed)
2024-04-05
09 Lenny Giuliano Responsible AD changed to Warren Kumari
2024-04-05
09 Lenny Giuliano Document is now in IESG state Publication Requested
2024-04-01
09 Haoyu Song New version available: draft-ietf-mboned-multicast-telemetry-09.txt
2024-04-01
09 Haoyu Song New version accepted (logged-in submitter: Haoyu Song)
2024-04-01
09 Haoyu Song Uploaded new revision
2023-12-04
08 Max Franke
# Document Shepherd Write-Up for Group Documents

*This version is dated 4 July 2022.*

Thank you for your service as a document shepherd. Among the …
# Document Shepherd Write-Up for Group Documents

*This version is dated 4 July 2022.*

Thank you for your service as a document shepherd. Among the responsibilities is
answering the questions in this write-up to give helpful context to Last Call
and Internet Engineering Steering Group ([IESG][1]) reviewers, and your
diligence in completing it is appreciated. The full role of the shepherd is
further described in [RFC 4858][2]. You will need the cooperation of the authors
and editors to complete these checks.

Note that some numbered items contain multiple related questions; please be sure
to answer all of them.


## Document History

1. Does the working group (WG) consensus represent the strong concurrence of a
  few individuals, with others being silent, or did it reach broad agreement?

The document reached strong consensus to advance, with contributors from both the MBONED WG as well as the IPPM WG voicing support.


2. Was there controversy about particular points, or were there decisions where
  the consensus was particularly rough?

No.


3. Has anyone threatened an appeal or otherwise indicated extreme discontent? If
  so, please summarize the areas of conflict in separate email messages to the
  responsible Area Director. (It should be in a separate email because this
  questionnaire is publicly available.)

No.


4. For protocol documents, are there existing implementations of the contents of
  the document? Have a significant number of potential implementers indicated
  plans to implement? Are any existing implementations reported somewhere,
  either in the document itself (as [RFC 7942][3] recommends) or elsewhere
  (where)?

This is not a protocol document.


## Additional Reviews

5. Do the contents of this document closely interact with technologies in other
  IETF working groups or external organizations, and would it therefore benefit
  from their review? Have those reviews occurred? If yes, describe which
  reviews took place.

This draft was originally presented at the IPPM WG but it was decided there that it should either go to the PIM or MBONED WG. At the end, MBONED was the most fitting and took on the document. During the development of the document, close coordination with IPPM occurred, which also included reviews.


6. Describe how the document meets any required formal expert review criteria,
  such as the MIB Doctor, YANG Doctor, media type, and URI type reviews.

No models or types are used in the document so no expert reviews are necessary.


7. If the document contains a YANG module, has the final version of the module
  been checked with any of the [recommended validation tools][4] for syntax and
  formatting validation? If there are any resulting errors or warnings, what is
  the justification for not fixing them at this time? Does the YANG module
  comply with the Network Management Datastore Architecture (NMDA) as specified
  in [RFC 8342][5]?

The document does not contain a YANG module.


8. Describe reviews and automated checks performed to validate sections of the
  final version of the document written in a formal language, such as XML code,
  BNF rules, MIB definitions, CBOR's CDDL, etc.

The document does not include any formal language sections.


## Document Shepherd Checks

9. Based on the shepherd's review of the document, is it their opinion that this
  document is needed, clearly written, complete, correctly designed, and ready
  to be handed off to the responsible Area Director?

Yes, the document is well written and complete. The necessity to modify IOAM to reduce data redundancy when used with multicast is obvious.


10. Several IETF Areas have assembled [lists of common issues that their
  reviewers encounter][6]. For which areas have such issues been identified
  and addressed? For which does this still need to happen in subsequent
  reviews?

The listed issues do not appear in the document:

    - DNS: The Document is not related to DNS.
    - Use of IPv6 packets with extension headers or fragments: There are no special considerations for IPv6, the Document references RFC 9486 for this topic.
    - Assumptions of how end-user networks connect to the Internet: The document makes no such assupmtions.
    - Use of QoS markings: The document does not have any QoS markings.
    - Use of MIBs and YANG modules: The document does not have any MIBs or YANG modules.


11. What type of RFC publication is being requested on the IETF stream ([Best
  Current Practice][12], [Proposed Standard, Internet Standard][13],
  [Informational, Experimental or Historic][14])? Why is this the proper type
  of RFC? Do all Datatracker state attributes correctly reflect this intent?

Proposed Standard. This is appropriate given that this document defines two
extensions to other proposed standards. The data tracker reflects this intent.


12. Have reasonable efforts been made to remind all authors of the intellectual
  property rights (IPR) disclosure obligations described in [BCP 79][7]? To
  the best of your knowledge, have all required disclosures been filed? If
  not, explain why. If yes, summarize any relevant discussion, including links
  to publicly-available messages when applicable.

Yes, the co-authors have filed their disclosures on the MBONED mailing list. Two authors listed a potentially relevant IPR:
- https://mailarchive.ietf.org/arch/msg/mboned/zARa_b8Dww0JiYs8mdi7CqcwkWw/
- https://mailarchive.ietf.org/arch/msg/mboned/41YsoioPnPpWdUxJpnge7UxRJ44/
- https://datatracker.ietf.org/ipr/search/?submit=draft&id=draft-ietf-mboned-multicast-telemetry

13. Has each author, editor, and contributor shown their willingness to be
  listed as such? If the total number of authors and editors on the front page
  is greater than five, please provide a justification.

Yes, all authors are listed. There are 6 authors. All 6 authors made significant contributions to this document. As per RFC7322, it might be appropriate to name one or two editors.


14. Document any remaining I-D nits in this document. Simply running the [idnits
  tool][8] is not enough; please review the ["Content Guidelines" on
  authors.ietf.org][15]. (Also note that the current idnits tool generates
  some incorrect warnings; a rewrite is underway.)

There are some outdated references that have already been acknowledged by the authors and will be fixed in the next version of the ID.

15. Should any informative references be normative or vice-versa? See the [IESG
  Statement on Normative and Informative References][16].

No.


16. List any normative references that are not freely available to anyone. Did
  the community have sufficient access to review any such normative
  references?

None.


17. Are there any normative downward references (see [RFC 3967][9] and [BCP
  97
][10]) that are not already listed in the [DOWNREF registry][17]? If so,
  list them.

None.


18. Are there normative references to documents that are not ready to be
  submitted to the IESG for publication or are otherwise in an unclear state?
  If so, what is the plan for their completion?

None.


19. Will publication of this document change the status of any existing RFCs? If
  so, does the Datatracker metadata correctly reflect this and are those RFCs
  listed on the title page, in the abstract, and discussed in the
  introduction? If not, explain why and point to the part of the document
  where the relationship of this document to these other RFCs is discussed.

No.


20. Describe the document shepherd's review of the IANA considerations section,
  especially with regard to its consistency with the body of the document.
  Confirm that all aspects of the document requiring IANA assignments are
  associated with the appropriate reservations in IANA registries. Confirm
  that any referenced IANA registries have been clearly identified. Confirm
  that each newly created IANA registry specifies its initial contents,
  allocations procedures, and a reasonable name (see [RFC 8126][11]).

There are 2 new extension flag registrations to the "IOAM DEX Extension-Flags" registry requested by the document. Both are reflected and specified in the text of the document. Their names are reasonable.


21. List any new IANA registries that require Designated Expert Review for
  future allocations. Are the instructions to the Designated Expert clear?
  Please include suggestions of designated experts, if appropriate.

There are no IANA considerations that require designated expert review.



[1]: https://www.ietf.org/about/groups/iesg/
[2]: https://www.rfc-editor.org/rfc/rfc4858.html
[3]: https://www.rfc-editor.org/rfc/rfc7942.html
[4]: https://trac.ietf.org/trac/ops/wiki/yang-review-tools
[5]: https://www.rfc-editor.org/rfc/rfc8342.html
[6]: https://trac.ietf.org/trac/iesg/wiki/ExpertTopics
[7]: https://www.rfc-editor.org/info/bcp79
[8]: https://www.ietf.org/tools/idnits/
[9]: https://www.rfc-editor.org/rfc/rfc3967.html
[10]: https://www.rfc-editor.org/info/bcp97
[11]: https://www.rfc-editor.org/rfc/rfc8126.html
[12]: https://www.rfc-editor.org/rfc/rfc2026.html#section-5
[13]: https://www.rfc-editor.org/rfc/rfc2026.html#section-4.1
[14]: https://www.rfc-editor.org/rfc/rfc2026.html#section-4.2
[15]: https://authors.ietf.org/en/content-guidelines-overview
[16]: https://www.ietf.org/about/groups/iesg/statements/normative-informative-references/
[17]: https://datatracker.ietf.org/doc/downref/
2023-12-04
08 Max Franke Changed consensus to Yes from Unknown
2023-12-04
08 Max Franke Intended Status changed to Proposed Standard from None
2023-11-16
08 Lenny Giuliano Notification list changed to mfranke@inet.tu-berlin.de because the document shepherd was set
2023-11-16
08 Lenny Giuliano Document shepherd changed to Max Franke
2023-11-16
08 Lenny Giuliano Tag Doc Shepherd Follow-up Underway set.
2023-11-16
08 Lenny Giuliano IETF WG state changed to WG Consensus: Waiting for Write-Up from WG Document
2023-10-06
08 Haoyu Song New version available: draft-ietf-mboned-multicast-telemetry-08.txt
2023-10-06
08 (System) New version approved
2023-10-06
08 (System) Request for posting confirmation emailed to previous authors: Greg Mirsky , Gyan Mishra , Haoyu Song , Hitoshi Asaeda , Mike McBride , Tianran Zhou
2023-10-06
08 Haoyu Song Uploaded new revision
2023-09-06
07 Haoyu Song New version available: draft-ietf-mboned-multicast-telemetry-07.txt
2023-09-06
07 Haoyu Song New version approved
2023-09-06
07 (System) Request for posting confirmation emailed to previous authors: Greg Mirsky , Gyan Mishra , Haoyu Song , Hitoshi Asaeda , Mike McBride , Tianran Zhou
2023-09-06
07 Haoyu Song Uploaded new revision
2023-03-10
06 Haoyu Song New version available: draft-ietf-mboned-multicast-telemetry-06.txt
2023-03-10
06 (System) New version approved
2023-03-10
06 (System) Request for posting confirmation emailed to previous authors: Greg Mirsky , Gyan Mishra , Haoyu Song , Hitoshi Asaeda , Mike McBride , Tianran Zhou
2023-03-10
06 Haoyu Song Uploaded new revision
2023-02-13
05 Haoyu Song New version available: draft-ietf-mboned-multicast-telemetry-05.txt
2023-02-13
05 (System) New version approved
2023-02-13
05 (System) Request for posting confirmation emailed to previous authors: Greg Mirsky , Gyan Mishra , Haoyu Song , Hitoshi Asaeda , Mike McBride , Tianran Zhou
2023-02-13
05 Haoyu Song Uploaded new revision
2023-02-12
04 (System) Document has expired
2022-08-11
04 Haoyu Song New version available: draft-ietf-mboned-multicast-telemetry-04.txt
2022-08-11
04 (System) New version approved
2022-08-11
04 (System) Request for posting confirmation emailed to previous authors: Greg Mirsky , Gyan Mishra , Haoyu Song , Hitoshi Asaeda , Mike McBride , Tianran Zhou
2022-08-11
04 Haoyu Song Uploaded new revision
2022-07-05
03 Haoyu Song New version available: draft-ietf-mboned-multicast-telemetry-03.txt
2022-07-05
03 Haoyu Song New version accepted (logged-in submitter: Haoyu Song)
2022-07-05
03 Haoyu Song Uploaded new revision
2022-01-04
02 Mike McBride New version available: draft-ietf-mboned-multicast-telemetry-02.txt
2022-01-04
02 (System) New version accepted (logged-in submitter: Mike McBride)
2022-01-04
02 Mike McBride Uploaded new revision
2021-07-06
01 Mike McBride New version available: draft-ietf-mboned-multicast-telemetry-01.txt
2021-07-06
01 (System) New version accepted (logged-in submitter: Mike McBride)
2021-07-06
01 Mike McBride Uploaded new revision
2021-02-22
00 Lenny Giuliano This document now replaces draft-song-multicast-telemetry instead of None
2021-02-22
00 Mike McBride New version available: draft-ietf-mboned-multicast-telemetry-00.txt
2021-02-22
00 (System) WG -00 approved
2021-02-22
00 Mike McBride Set submitter to "Mike McBride ", replaces to draft-song-multicast-telemetry and sent approval email to group chairs: mboned-chairs@ietf.org
2021-02-22
00 Mike McBride Uploaded new revision