RFC 9511: Attribution of Internet Probes
- É. Vyncke,
- B. Donnet,
- J. Iurman
Abstract
Active measurements over the public Internet can target either collaborating parties or non
This document suggests some simple techniques for a source to identify its probes. This allows any party or organization to understand what an unsolicited probe packet is, what its purpose is, and, most importantly, who to contact. The technique relies on offline analysis of the probe; therefore, it does not require any change in the data or control plane. It has been designed mainly for layer 3 measurements.¶
Status of This Memo
This document is not an Internet Standards Track specification; it is published for informational purposes.¶
This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Not all documents approved by the IESG are candidates for any level of Internet Standard; see Section 2 of RFC 7841.¶
Information about the current status of this document, any
errata, and how to provide feedback on it may be obtained at
https://
Copyright Notice
Copyright (c) 2023 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(https://
1. Introduction
Most measurement research (e.g., [LARGE_SCALE], [RFC7872], and [JAMES]) is about sending IP packets (sometimes with extension headers or layer 4 headers) over the public Internet, and those packets can be destined to either collaborating parties or non
Sending unsolicited probes should obviously be done at a rate low enough to not unduly impact the other parties' resources. But even at a low rate, those probes could trigger an alarm that will request some investigations by either the party receiving the probe (i.e., when the probe destination address is one address assigned to the receiving party) or a third party having some devices through which those probes are transiting (e.g., an Internet transit router). The investigation will be done offline by using packet captures; therefore, probe attribution does not require any change in the data or control planes.¶
This document suggests some simple techniques for a source to identify its probes. This allows any party or organization to understand:¶
It is expected that only researchers with good intentions will use these techniques, although anyone might use them. This is discussed in Section 7.¶
While the technique could be used to mark measurements done at any layer of the protocol stack, it is mainly designed to work for measurements done at layer 3 (and its associated options or extension headers).¶
2. Probe Description
This section provides a way for a source to describe (i.e., to identify) its probes.¶
2.1. Probe Description URI
This document defines a Probe Description URI as a URI pointing to one of the following:¶
2.2. Probe Description File
As defined in Section 8, the Probe Description File must be made available at "
A new field "Description" should also be included to describe the measurement. To match the format defined in Section 4 of [RFC9116], this field must be a one-line string with no line break.¶
3. Out-of-Band Probe Attribution
A possibility for probe attribution is to build a specific URI based on the source address of the probe packet, following [RFC8615]. For example, with a probe source address 2001
The built URI must be a reference to the Probe Description File (see Section 2.2).¶
As an example, the UK National Cyber Security Centre [NCSC] uses a similar attribution. They scan for vulnerabilities across Internet
4. In-Band Probe Attribution
Another possibility for probe attribution is to include a Probe Description URI in the probe itself. Here is a non-exhaustive list of examples:¶
The Probe Description URI must start at the first octet of the payload and must be terminated by an octet of 0x00, i.e., it must be null terminated. If the Probe Description URI cannot be placed at the beginning of the payload, then it must be preceded by an octet of 0x00. Inserting the Probe Description URI could obviously bias the measurement itself if the probe packet becomes larger than the path MTU. Some examples are given in Appendix A.¶
Using a magic string (i.e., a unique, special opaque marker) to signal the presence of the Probe Description URI is not recommended as some transit nodes could apply different processing for packets containing this magic string.¶
For the record, in-band probe attribution was used in [JAMES].¶
5. Operational and Technical Considerations
Using either the out-of-band or in-band technique, or even both combined, highly depends on intent or context. This section describes the upsides and downsides of each technique so that probe owners or probe makers can freely decide what works best for their cases.¶
The advantages of using the out-of-band technique are that the probing measurement is not impacted by probe attribution and that it is easy to set up, i.e., by running a web server on a probe device to describe the measurements. Unfortunately, there are some disadvantages too. In some cases, using the out-of-band technique might not be possible due to several conditions: the presence of a NAT, too many endpoints to run a web server on, the probe source IP address cannot be known (e.g., RIPE Atlas [RIPE_ATLAS] probes are sent from IP addresses not owned by the probe owner), dynamic source addresses, etc.¶
The primary advantage of using the in-band technique is that it covers the cases where the out-of-band technique is not feasible (as described above). The primary disadvantage is that it could potentially bias the measurements, since packets with the Probe Description URI might be discarded. For example, data is allowed in TCP segments with the SYN flag [RFC9293] but may change the way they are processed, i.e., TCP segments with the SYN flag containing the Probe Description URI might be discarded. Another example is the Probe Description URI included in a Hop-by-Hop or Destination Options header inside a PadN option. Section 2.1.9.5 of [RFC4942] (an Informational RFC) suggests that a PadN option should only contain 0s and be smaller than 8 octets, thus limiting its use for probe attribution. If a PadN option does not respect the recommendation, it is suggested that one may consider dropping such packets. For example, since version 3.5, the Linux Kernel follows these recommendations and discards such packets.¶
Having both the out-of-band and in-band techniques combined also has a big advantage, i.e., it could be used as an indirect means of "authenticating
6. Ethical Considerations
Executing measurement experiences over the global Internet obviously requires ethical consideration, which is discussed in [ANRW_PAPER], especially when unsolicited transit or destination parties are involved.¶
This document proposes a common way to identify the source and the purpose of active probing in order to reduce the potential burden on the unsolicited parties.¶
But there are other considerations to be taken into account, from the payload content (e.g., is the encoding valid?) to the transmission rate (see also [IPV6_TOPOLOGY] and [IPV4_TOPOLOGY] for some probing speed impacts). Those considerations are out of scope of this document.¶
7. Security Considerations
This document proposes simple techniques for probe attribution. It is expected that only ethical researchers would use them, which would simplify and reduce the time to identify probes across the Internet. In fact, these techniques could be used by anyone, malicious or not, which means that the information obtained cannot be blindly trusted. Using these techniques should not mean that a probe can be trusted. Instead, third parties should use this solution to potentially understand the origin and context of such probes. This solution is not perfect, but it provides a way for probe attribution, which is better than no solution at all.¶
Probe attribution is provided to identify the source and intent of specific probes, but there is no authentication possible for the inline information. Therefore, a malevolent actor could provide false information while conducting the probes or spoof them so that the action is attributed to a third party. In that case, not only would this third party be wrongly accused, but it might also be exposed to unwanted solicitations (e.g., angry emails or phone calls if the malevolent actor used someone else's email address or phone number). As a consequence, the recipient of this information cannot trust it without confirmation. If a recipient cannot confirm the information or does not wish to do so, it should treat the flows as if there were no probe attribution. Note that using probe attribution does not create a new DDoS vector since there is no expectation that third parties would automatically confirm the information obtained.¶
As the Probe Description URI is transmitted in the clear and as the Probe Description File is publicly readable, Personally Identifiable Information (PII) should not be used for an email address and a phone number; a generic or group email address and phone number should be preferred. Also, the Probe Description File could contain malicious data (e.g., links) and therefore should not be blindly trusted.¶
8. IANA Considerations
IANA has added the following URI suffix to the "Well-Known URIs" registry in accordance with [RFC8615]:¶
9. References
9.1. Normative References
- [RFC0768]
-
Postel, J., "User Datagram Protocol", STD 6, RFC 768, DOI 10
.17487 , , <https:///RFC0768 www >..rfc -editor .org /info /rfc768 - [RFC0792]
-
Postel, J., "Internet Control Message Protocol", STD 5, RFC 792, DOI 10
.17487 , , <https:///RFC0792 www >..rfc -editor .org /info /rfc792 - [RFC4443]
-
Conta, A., Deering, S., and M. Gupta, Ed., "Internet Control Message Protocol (ICMPv6) for the Internet Protocol Version 6 (IPv6) Specification", STD 89, RFC 4443, DOI 10
.17487 , , <https:///RFC4443 www >..rfc -editor .org /info /rfc4443 - [RFC8200]
-
Deering, S. and R. Hinden, "Internet Protocol, Version 6 (IPv6) Specification", STD 86, RFC 8200, DOI 10
.17487 , , <https:///RFC8200 www >..rfc -editor .org /info /rfc8200 - [RFC8615]
-
Nottingham, M., "Well-Known Uniform Resource Identifiers (URIs)", RFC 8615, DOI 10
.17487 , , <https:///RFC8615 www >..rfc -editor .org /info /rfc8615 - [RFC9116]
-
Foudil, E. and Y. Shafranovich, "A File Format to Aid in Security Vulnerability Disclosure", RFC 9116, DOI 10
.17487 , , <https:///RFC9116 www >..rfc -editor .org /info /rfc9116 - [RFC9293]
-
Eddy, W., Ed., "Transmission Control Protocol (TCP)", STD 7, RFC 9293, DOI 10
.17487 , , <https:///RFC9293 www >..rfc -editor .org /info /rfc9293
9.2. Informative References
- [ANRW_PAPER]
-
Fiebig, T., "Crisis, Ethics, Reliability & a measurement
.network - Reflections on Active Network Measurements in Academia" , DOI 10.1145 , , <https:///3606464 .3606483 pure >..mpg .de /rest /items /item _3517635 /component /file _3517636 /content - [IPV4_TOPOLOGY]
-
Beverly, R., "Yarrp'ing the Internet: Randomized High-Speed Active Topology Discovery", DOI 10
.1145 , , <http:///2987443 .2987479 www >..cmand .org /papers /yarrp -imc16 .pdf - [IPV6_TOPOLOGY]
-
Beverly, R., Durairajan, R., Plonka, D., and J. Rohrer, "In the IP of the Beholder: Strategies for Active IPv6 Topology Discovery", DOI 10
.1145 , , <http:///3278532 .3278559 www >..cmand .org /papers /beholder -imc18 .pdf - [JAMES]
-
Vyncke, É., Léas, R., and J. Iurman, "Just Another Measurement of Extension header Survivability (JAMES)", Work in Progress, Internet-Draft, draft
-vyncke , , <https://-v6ops -james -03 datatracker >..ietf .org /doc /html /draft -vyncke -v6ops -james -03 - [LARGE_SCALE]
-
Donnet, B., Raoult, P., Friedman, T., and M. Crovella, "Efficient Algorithms for Large-Scale Topology Discovery", DOI 10
.1145 , DOI 10/1071690 .1064256 .1145 , , <https:///1071690 .1064256 dl >..acm .org /doi /pdf /10 .1145 /1071690 .1064256 - [NCSC]
-
UK NCSC, "The National Cyber Security Centre", <https://
www >..ncsc .gov .uk / - [NCSC_SCAN_INFO]
-
UK NCSC, "NCSC Scanning information", <https://
www >..ncsc .gov .uk /information /ncsc -scanning -information - [RFC4942]
-
Davies, E., Krishnan, S., and P. Savola, "IPv6 Transition
/Co , RFC 4942, DOI 10-existence Security Considerations" .17487 , , <https:///RFC4942 www >..rfc -editor .org /info /rfc4942 - [RFC7872]
-
Gont, F., Linkova, J., Chown, T., and W. Liu, "Observations on the Dropping of Packets with IPv6 Extension Headers in the Real World", RFC 7872, DOI 10
.17487 , , <https:///RFC7872 www >..rfc -editor .org /info /rfc7872 - [RIPE_ATLAS]
-
RIPE Network Coordination Centre (RIPE NCC), "RIPE Atlas", <https://
atlas >..ripe .net / - [SCAPY]
-
"Scapy", <https://
scapy >..net /
Appendix A. Examples of In-Band Attribution
Here are several examples generated by [SCAPY] and displayed in the 'tcpdump' format:¶
IP packet with Probe Description URI inside a Destination Options extension header:¶
IP packet with the URI in the data payload of a TCP SYN:¶
IP echo request with another URI in the data part of the ICMP ECHO_REQUEST:¶
IPv4 echo request with a URI in the data part of the ICMP ECHO_REQUEST:¶
Acknowledgments
The authors would like to thank Alain Fiocco, Fernando Gont, Ted Hardie, Mehdi Kouhen, and Mark Townsley for helpful discussions as well as Raphael Leas for an early implementation.¶
The authors would also like to gracefully acknowledge useful reviews and comments received from Warren Kumari, Jen Linkova, Mark Nottingham, Prapanch Ramamoorthy, Tirumaleswar Reddy.K, Andrew Shaw, and Magnus Westerlund.¶