# Benchmarking Methodology Working Group (bmwg) - IETF 118 Prague {#benchmarking-methodology-working-group-bmwg---ietf-118-prague} Date and Time: 06/11/2023 Monday 9:30-11:30 CET Location: Hilton Prague, Amsterdam Chairs: Sarah Banks, Giuseppe Fioccola AD: Warren Kumari Zulip Stream: https://zulip.ietf.org/login/#narrow/stream/bmwg ## Note taker(s), Jabber, IPR {#note-takers-jabber-ipr} Paolo Volpato volunteered. ## Administrative and WG Status (Chairs) {#administrative-and-wg-status-chairs} * WG Adoptions and Proposals * Welcome and self-introduction from Giuseppe Agenda discussed: * WGLC is coming for Stateful NATxy Gateways BM draft * Request to revive the EVPN draft has been declined by the WG (lack of support) * Segment Routing drafts will remain as separate drafts (mailing list discussion and feedback from authors) BMWG activity presented and milestone revised. ## WG Drafts {#wg-drafts} ### Multiple Loss Ratio Search and YANG Model, Maciek Konstantynowicz and Vratko Polak {#multiple-loss-ratio-search-and-yang-model-maciek-konstantynowicz-and-vratko-polak} [draft-ietf-bmwg-mlrsearch][1] * Maciek and Vratko presented (https://datatracker.ietf.org/meeting/118/materials/slides-118-bmwg-2-multiple-loss-ratio-search-and-yang-model). It was presented the update to the draft. They are not asking yet for WGLC but are looking for reviewers/co-authors and feedback from testers and developers. It is a big problem to have a reliable test in a virtualized environment. If time permits, slot to present the results of open source testing. Acknowledgment to Al Morton. Carsten Rossenhoevel: good work to continue. Software and virtual environment need longer test duration, too short test duration may give wrong results. This does not influence the draft itself, but it is suggested to add a note on that. Maciek Konstantynowicz: good comment, even the perfect SW in a shared environment is not isolated. It is about SUT and DUT that should help in isolation. MLRserach comes with parameters from where you can get result compatible with RFC 2544. This represents test duration, not the actual duration. Vratko Polak: it is all about splitting the noise from useful signals in the spectrum, if interested, please give input. Carsten Rossenhoevel: if the draft becomes RFC it aims to amend RFC 2544. So, industry argues we use RFC 2544. If you write something, then that result becomes normative. The particular test time may be good for one environment but not for another. Maciek Konstantynowicz: We want to standardize it, but we need feedback. We give value to what you say, but the time value of MLRsearch is configurable. Experts will customize it for the situation. We did not ask for WGLC, but we aim to get support from the WG. ### Benchmarking Methodology for Stateful NATxy Gateways using RFC 4814 Pseudorandom Port numbers, Gabor Lencse {#benchmarking-methodology-for-stateful-natxy-gateways-using-rfc-4814-pseudorandom-port-numbers-gabor-lencse} [draft-ietf-bmwg-benchmarking-stateful][2] * Gabor presented (https://datatracker.ietf.org/meeting/118/materials/slides-118-bmwg-3-benchmarking-methodology-for-stateful-natxy-gateways-using-rfc-4814-pseudorandom-port-numbers). Presented the summary of the proposal. Asking for WGLC. No questions. Giuseppe Fioccola: WGLC will be done on the mailing list. ## Proposals {#proposals} ### Recommendations for using Multiple IP addresses in Benchmarking Tests, Gabor Lencse {#recommendations-for-using-multiple-ip-addresses-in-benchmarking-tests-gabor-lencse} [draft-lencse-bmwg-multiple-ip-addresses][3] * Gabor presented (https://datatracker.ietf.org/meeting/118/materials/slides-118-bmwg-4-recommendations-for-using-multiple-ip-addresses-in-benchmarking-tests) Problem described and solution on IP address pseudoradomization introduced. It is important to have the hash function with random fields, all forwarding engines now are multi-core. Question to the audience to acknowledge if the problem exists. IPv4 pools are typically smaller than IPv6. Maciek Konstantynowicz: regarding the problem statement with the 1st RSS and 2nd RSS implementations, port numbers and protocols use that for entropy. In synthetic tests, they may not represent real systems. The problem of using address ranges is interesting. This is also for stateful cases and we should understand how prescriptive it should be to define the benchmark. In general, it is good advice to use bigger IP pools. It looks like a useful augmentation of RFC 2544. Vratko Polak: it is important to have repeatable results, but different environments may not permit it. Tests are related to encapsulated flows, with different technologies. Sometimes the tests use multiple encapsulations. Gabor Lencse: We need to improve the RSS part. Sarah Banks: it's a decision based on implementation. You design the test based on the SUT or tester. Gabor Lencse: software systems need to load all cores. Maciek Konstantynowicz: if we go ahead with this work, including RSS, repeatability might be impacted. Cores are open systems and are configurable. In SW you specify the resources, so I agree partially with Sarah. Sarah Banks: thanks for clarification. As a tester you figure out what you have. BMWG stayed in RFC 2544 era for long. Need to have a conversation. ### Considerations for Benchmarking Network Performance in Containerized Infrastructures, Tran Minh Ngoc {#considerations-for-benchmarking-network-performance-in-containerized-infrastructures-tran-minh-ngoc} [draft-dcn-bmwg-containerized-infra][4] * Tran presented (https://datatracker.ietf.org/meeting/118/materials/slides-118-bmwg-5-considerations-for-benchmarking-network-performance-in-containerized-infrastructures). Update from -11 to -13 presented. Giuseppe Fioccola: thanks for addressing the comments Maciek Konstantynowicz: thanks for presenting. Vratko provided the specific comments. We try to do containerized network functions test. We faced the repeatability of tests. Kubernetes open source setup containerization setup in a repeatable manner is an issue. Do you have a section on this? Gabor Lencse: support the work. Maciek Konstantynowicz: also support. Giuseppe Fioccola: let's do a poll to understand how many people have read the draft. Sarah Banks: thanks to 7 people who read the draft. Adoption should happen in the mailing list. ### Benchmarking methodology for MPLS Segment Routing, Paolo Volpato {#benchmarking-methodology-for-mpls-segment-routing-paolo-volpato} [draft-vfv-bmwg-srmpls-bench-meth][5] * Paolo presented (https://datatracker.ietf.org/meeting/118/materials/slides-118-bmwg-6-benchmarking-methodology-for-mpls-segment-routing). The last version incorporates almost all text from referenced RFCs. Gabor Lencse: concern on 2 testers, suggest to add logic to manage both testers. Suggested to revise the figure about the DUT test Paolo Volpato: Sure, it will be added to the next version. Maciek Konstantynowicz: asking for other SR functions to test and highlighted that the two drafts on SR may also be merged. Paolo Volpato: we considered the services for other documents and authors still think that 2 separate drafts are better. Qi Zhang: Do we have new metrics for the SR. Any test results? Paolo Volpato: yes, new parameters described in a dedicated section. 3rd party test results are expected. Carsten Rossenhoevel: insist on merging the draft because 95% is common. SR-MPLS tests have a small incremental value from MPLS BM tests. Maciek Konstantynowicz: support drafts merge. Asking to expand to functions. We are also testing SR-MPLS and we can share some results. Giuseppe Fioccola: discussion to continue on the list. ### Benchmarking methodology for IPv6 Segment Routing, Eduard Vasilenko {#benchmarking-methodology-for-ipv6-segment-routing-eduard-vasilenko} [draft-vfv-bmwg-srv6-bench-meth][6] * Eduard presented (https://datatracker.ietf.org/meeting/118/materials/slides-118-bmwg-7-benchmarking-methodology-for-ipv6-segment-routing). Overview and scope presented. Why to keep services out of the initial scope? Because in particular for SRv6 there are too many. It would become an unmanageable document. If you want more, please be specific on the services. As an opinion, the two drafts should be kept separated so that a tester can choose what they are interested into. Carsten Rossenhoevel: one example of test areas for SRv6 is compressed SIDs. No EVPN. Eduard Vasilenko: apart from it, what else? Sarah Banks: We asked the list to discuss whether to merge or not to merge SR drafts. Giuseppe Fioccola: let's continue on the list. ### Problems and requirements of evaluation methodology for integrated space and terrestrial networks, Zeqi Lai {#problems-and-requirements-of-evaluation-methodology-for-integrated-space-and-terrestrial-networks-zeqi-lai} [draft-lai-bmwg-istn-methodology][7] * Zeqi presented (https://datatracker.ietf.org/meeting/118/materials/slides-118-bmwg-8-benchmarking-methodology-for-reliable-transport-protocols-in-integrated-space-and-terrestrial-networks). Proposal presented. Sarah Banks: thanks for showing up. Please send questions to the list. ## Presentations {#presentations} ### FD.io CSIT Performance Dashboard, Maciek Konstantynowicz and Vratko Polak {#fdio-csit-performance-dashboard-maciek-konstantynowicz-and-vratko-polak} [draft-ietf-bmwg-network-tester-cfg][8] * Vratko/Maciek presented (https://datatracker.ietf.org/meeting/118/materials/slides-118-bmwg-9-fdio-csit-performance-dashboard). Giuseppe Fioccola showed the first slide of Maciek's presentation on FD.io CSIT performance dashboard. Maciek proposes a side meeting on this topic and possibly an interim meeting. Sarah Banks: let's further discuss about interim meetings and let's try to reach consensus on whether to merge the two SR-related drafts. Thanks for the interesting session. [1]: https://datatracker.ietf.org/doc/draft-ietf-bmwg-mlrsearch/ [2]: https://datatracker.ietf.org/doc/draft-ietf-bmwg-benchmarking-stateful/ [3]: https://datatracker.ietf.org/doc/draft-lencse-bmwg-multiple-ip-addresses/ [4]: https://datatracker.ietf.org/doc/draft-dcn-bmwg-containerized-infra/ [5]: https://datatracker.ietf.org/doc/draft-vfv-bmwg-srmpls-bench-meth/ [6]: https://datatracker.ietf.org/doc/draft-vfv-bmwg-srv6-bench-meth/ [7]: https://datatracker.ietf.org/doc/draft-lai-bmwg-istn-methodology/ [8]: https://datatracker.ietf.org/doc/draft-ietf-bmwg-network-tester-cfg/