A survey on securing inter-domain routing: Part 1

By Geoff Huston on 8 Jul 2021

The Border Gateway Protocol (BGP) is the Internet’s inter-domain routing protocol. And after some thirty years of operation, it’s now one of the more venerable of the Internet’s ‘core’ protocols.

One of the major ongoing concerns related to BGP is its lack of effective security measures, and as a result, the routing infrastructure of the Internet continues to be vulnerable to various forms of attack.

In Part 1 of this study, I will look at the design of BGP, the threat model and the requirements from a security framework for BGP.

In Part 2, we will look at the various proposals to add security to the routing environment and also evaluate the effort of the Internet Engineering Task Force (IETF) to provide a standard specification of the elements of a secure BGP framework.

Part 1 – BGP: Design, threats and security requirements

The Internet is a decentralized collection of interconnected component networks (Autonomous Systems, ASes). These networks are composed of end hosts (who originate and/or receive IP packets and are identified by IP addresses) and active forwarding elements (routers) whose role is to direct IP packets as they pass through the network.

The routing system is responsible for propagating the relative location of IP addresses to each routing element so that routers can make consistent and optimal routing decisions to pass a packet from its source to its destination. Routing protocols are used to perform this information propagation.

The Internet’s routing system is divided into a two-level hierarchy. One level is intra-domain routing, used by the set of autonomous routing systems operating within each component network. The other level is a single inter-domain routing system that maintains the inter-network connectivity information that straddles these component networks.

As a single inter-domain routing protocol, the BGP (RFC 4271) has provided interdomain routing services for the Internet’s disparate component networks since the late 1980s. Given the central role of routing in the operation of the Internet, BGP is one of the critical protocols that provide essential coherence to the Internet.

BGP’s underlying distributed distance-vector computations rely heavily on informal trust models associated with information propagation to produce reliable and correct results. It can be likened to a hearsay network — information is flooded across a network as a series of point-to-point exchanges, with the information being incrementally modified each time it is exchanged between BGP speakers.

The design of BGP was undertaken in the relatively homogeneous and mutually trusting environment of the early Internet. Consequently, its approach to information exchange was not primarily designed for robustness in the face of various forms of negotiated trust or overt hostility on the part of some routing actors.

Hostile actors are a fact of life in today’s Internet. It’s quite reasonable to characterize today’s Internet environment as one where trust must be explicitly negotiated rather than assumed by default. This environment is no longer consistent with the inter-domain trust framework originally assumed by BGP. BGP’s mutual trust model involves no explicit presentation of credentials, no propagation of instruments of authority, nor any reliable means of verifying the authenticity of the information being propagated through the routing system. Hostile actors can attack the network by exploiting this trust model in inter-domain routing to their own ends.

An attacker can easily transform routing information in ways that are extremely difficult for any third party to detect. For example, false routing information may be injected, valid routing information removed, or information altered to cause traffic redirection. This approach can be used to prevent the correct operation of applications, to conduct fraudulent activities, and disrupt the operation of the part (or even all) of the network in various ways. The consequences range from relatively inconsequential (minor degradation of application performance due to sub-optimal forwarding paths) to catastrophic (major disruption to connectivity and comprehensive loss of any form of cohesive Internet).

Resisting this subversion of integrity of routing information requires that each BGP speaker has:

Sufficient information at hand to verify the authenticity and completeness of the information being provided to it via the inter-domain routing system
The ability to generate authoritative information such that other BGP speakers may verify the authenticity of information that this speaker is passing into the inter-domain routing system

A key question is whether further information can be added into the inter-domain routing environment such that attempts to pervert, remove, or withhold routing information may be readily and reliably detected. Any proposed scheme must also be evaluated for its impact on the scaling properties of BGP.

To ground any such evaluation of BGP, it’s useful to briefly review the design of the BGP protocol.

The design of BGP

BGP has undergone several refinements over its early operational life. BGP was originally described in RFC 1105 in June 1989, allowing the Internet’s inter-domain architecture to move on from a constrained architecture of a ‘core’ and attached ‘stub’ domains into a framework of peer routing domains without any central ‘core’.

A refinement to this protocol, BGP-2, was described in RFC 1163 in June 1990, and further refinement, BGP-3, was described in RFC 1267 in October 1991. The current version, BGP-4, was deployed in the Internet in 1993; the RFC describing it, RFC 1771, was published in March 1995 and was subsequently refined with the publication of RFC 4271 in January 2006. The core protocol has been stable for some years now, although further refinement has been undertaken through the use of negotiated capabilities undertaken at the BGP session startup.

BGP is an instance of what we commonly refer to today as a Bellman-Ford distance vector routing algorithm. This algorithm allows a collection of connected devices (BGP speakers) to each learn the relative topology of the connecting network. The basic approach of this algorithm is very simple: each BGP speaker tells all its other neighbors about what it has learned if the newly learned information alters the local view of the network. This is a lot like a social rumour network, where every individual who hears a new rumour immediately informs all their friends.

In the case of BGP, each time a neighbor informs a BGP speaker about reachability to an IP address prefix, the BGP speaker compares this new reachability information against its stored knowledge that was gained from previous announcements from other neighbors. If this new information provides a ‘better’ path to the prefix, then the local speaker moves this prefix and associated next-hop forwarding decision to the local forwarding table and informs all its immediate neighbors of a new path to a prefix, implicitly citing itself as the next hop.

BGP keeps a track of the propagation of route advertisements across the inter-domain space by recording the sequence of networks (ASes) that propagate the route in a route attribute called the ‘AS Path’. A ‘better’ route is one with a shorter AS Path and a loop is detected when a BGP speaker sees its own AS in the received AS Path (Figure 1).

Diagram showing the propagation of a route in BGP. — Figure 1 – The propagation of a route in BGP.

In addition, there is a withdrawal mechanism, where a BGP speaker determines that it no longer has a viable path to a given prefix, in which case it announces a ‘withdrawal’ to all its neighbors. When a BGP speaker receives a withdrawal, it stores the withdrawal against this neighbor. If the withdrawn neighbor happened to be the currently preferred next hop for this prefix, then the BGP speaker will examine its per-neighbor data sets to determine which stored announcement represents the best path from those that are still remaining. If it can find such an alternative path, it will copy this into its local forwarding table and announce this new preferred path to all its BGP neighbors. If there is no such alternative path, it will announce a withdrawal to its neighbors, indicating that it no longer can reach this prefix.

Across the deployment lifetime of BGP-4, the IPv4 Internet has grown from an average of 20,000 distinct routing entries in 1993 to almost 1 million routing entries in 2021. The growth of the size of the Internet’s IPv4 routing table over time is shown in Figure 2.

Graph showing Internet IPv4 routing table size. — *Figure 2 – Internet IPv4 routing table size.*

Read: BGP in 2020

BGP and TCP

BGP is not a link-level topology maintenance protocol. BGP assumes the existence of a relatively robust IP forwarding environment at the link level between BGP peers. This has allowed BGP to use the IP transport protocol, TCP, to support the protocol’s transactions across a BGP peer session.

TCP manages reliable message delivery and flow control between BGP peers and allows BGP to operate across end-to-end connections whether they reside on the same subnet, or across the Internet. There is no requirement for BGP speakers to be connected on a common media connection, and the choice of TCP allows this flexibility of connectivity by requiring only that a BGP peering session is supported by an IP network.

The TCP stream is divided into messages using BGP-defined markers, where each message is between 19 and 4,096 octets in length, extensible to 65,535 octets. The use of a reliable transport service implies that BGP itself need not explicitly confirm receipt of protocol messages. This removes much of the protocol overhead seen in other routing protocols that sit directly on top of a media-level connection. There are no message identifiers, no message number initiation protocol, no explicit acknowledgement of messages nor any provision to manage lost, reordered or duplicated messages. All this is handled by TCP.

The use of a reliable transport protocol also obviates the need for BGP to periodically refresh the routing state by automatically re-flooding the entire routing information set between BGP speakers. After the initial exchange of routing information, a pair of BGP routers exchanges only incremental changes to routing information.

BGP messages

As TCP is a stream protocol rather than a record-oriented protocol, BGP uses record marking within the TCP stream to delineate logical protocol units or messages with a 16-byte marker as the BGP message delimiter. The marker is followed by a 2-byte length and a 1-byte type field, making the minimum BGP message size 19 bytes. The repertoire of defined messages are:

An OPEN message to start a BGP session
An UPDATE message to exchange reachability information
A NOTIFICATION message, which is used to convey a reason code before termination of the BGP session
A KEEPALIVE message, used to confirm the continued availability of the BGP peer
A ROUTE-REFRESH request message to request a resend of the routing information

The common format of BGP messages is shown in Figure 3.

Diagram showing BGP common header message format. — *Figure 3 – BGP common header message format.*

BGP uses an explicit OPEN message to commence a BGP peering session. This message exchange confirms the identity of the BGP speakers and includes the option for a capability negotiation to understand what optional or extended capabilities are supported by each BGP speaker. A session is active only when both BGP speakers have sent their OPEN messages and neither has rejected the other offered capabilities through a NOTIFICATION response.

Once the session is active, BGP operates via the exchange of UPDATE messages. Each UPDATE message contains a set of unreachable address prefixes (withdrawals), followed by a set of common route object attributes, and a set of address prefixes that share this set of attributes (announcements).

The withdrawn prefixes are those prefixes where the local BGP speaker sees no reachability and now wants to withdraw a previous advertisement of reachability. No routing attributes are associated with these withdrawn prefixes.

The announced prefixes are those prefixes where the local BGP instance has an updated view of the reachability of a prefix that was previously withdrawn or unannounced or has an updated view of the routing attributes of the locally selected ‘best’ route for a prefix.

BGP may group multiple prefixes together in a single UPDATE message but can only do so if all the updated prefixes share a common set of attributes. Within an UPDATE message the withdrawn prefix set, or the announced prefix set, may be empty, but not both. The layout of the BGP UPDATE message is shown in Figure 4.

Diagram showing BGP UPDATE message format. — *Figure 4 – BGP UPDATE message format.*

AS Path attribute

BGP binds together the concept of network address blocks and ASes into a path vector-based routing technology. Every route object represented within a BGP-4 route database contains an address prefix and an associated path vector of AS values.

BGP does not indicate the precise path a packet should follow within an AS, nor does it maintain a complete map of the topology of the Internet at a link-by-link level. Instead, it uses a level of abstraction that views the Internet as a set of per-AS routing domains. As such, the role of BGP is to maintain a routing map of the network at this AS level, associating every reachable address prefix with an AS transit path from the current location to the address prefix’s originating AS.

One of the most important route object attributes in BGP is the AS Path attribute of UPDATE messages that contain announced routes. As address prefix reachability information traverses the Internet in the form of individual route objects in BGP, this BGP routing information is augmented by the list of ASes that have processed this route information thus far, forming the AS Path attribute of a route object. Each BGP speaker adds its own AS value to the route object’s AS Path attribute when passing the route object through an eBGP session.

This AS Path attribute allows straightforward suppression of the looping of routing information, using the simple algorithm that a local AS will reject any forwarded route object that already contains its own AS in the AS Path attribute. Also, the length of the AS Path vector forms the BGP route metric. A local BGP system, when attempting to select one from several potential route objects that refer to the same address prefix, will, in the absence of any local policy directive, prefer the route object with the shortest AS Path length.

In addition to undertaking the role of path metric and loop detector, the AS Path attribute serves as a versatile mechanism for policy-based routing, where a local AS can alter the default preferences for route selection based on local policy settings coupled with pattern matching rules to be performed on the AS Path.

Withdrawals have no associated AS Path.

BGP route selection process and routing policies

A BGP speaker may receive two or more announcements for the same address prefix from different peers. The ‘best’ announcement is selected as the locally used announcement, and this announcement is the one that is announced to its BGP peers.

BGP defines an ordered sequence of comparisons to determine which route object is selected by the local BGP speaker as the preferred route to use. The sequence follows:

Prefer the route object with the highest value for LOCALPREF attribute value
Prefer the route object shortest AS-PATH attribute length
Prefer the lowest origin value
Prefer the lowest MULTI EXIT DISCRIMINATOR attribute value
Prefer the minimum IGP cost to the NEXT HOP address given in the route object
Prefer eBGP over iBGP-learned routes
If using iBGP, Prefer the lowest BGP Identifier value.

Although network administrators usually employ routing policies depending on their needs, within the generic BGP route selection process the highest priority selection rule is that a route for a more specific address prefix is to be preferred over that of a covering prefix.

The BGP threat model

One approach to providing a taxonomy for threats in routing in general, and BGP in particular, is to view a BGP peer session as a conversation between two BGP speakers and pose several questions relating to this conversation. These questions are:

How do we talk?

The manner in which the BGP session between the BGP speakers is secured such that the conversation is not altered, disrupted or hijacked and is protected from unauthorized eavesdropping.

Who am I talking to?

Verifying the identity of the other party and verifying they are authorized to speak for the routing entity they purport to represent.

What are you saying?

Verifying the authenticity and completeness of the routing information being passed in the BGP session.

Why should I believe you?

Verifying the routing information represents the current state of the forwarding system.

How recent is your information and is it still valid?

Verifying for how long routing information is valid and whether the information is still current.

Each of these security questions can be further deconstructed to a set of specific objectives, as well as recognizing a set of specific threats.

Securing a BGP session

A BGP session between two BGP speakers is assumed to have some level of integrity at the session transport level.

BGP assumes the messages sent by one party are precisely the same as received by the other party, and assumes the messages have not been altered, reordered, have spurious messages added into the stream or have messages removed from the conversation stream in any way.

Given that BGP uses a TCP transport session, some of these assumptions are reasonable, but others less so. As with any long-held TCP session, a BGP peer session is vulnerable to eavesdropping, spurious session reset, session capture, message alternation and denial of service attacks, all via what we might think of as conventional TCP attack vectors. The threat at the BGP level is that a third party may attempt to break into the TCP session as an interception attack in the middle, and thereby alter the BGP message flow between the two endpoints.

One form of threat is by injection, where the attacker injects spurious messages into the BGP session. Direct on-the-wire interception allows the attacker to know the TCP sequence numbers, thereby making injection a trivial task. Even if the attacker is not able to intercept or eavesdrop on the BGP session, it is still possible to attempt to guess the current sequence number. While this is often impractical in the case of injecting data into the session, if all that is to be injected is a TCP reset, then the sequence number guess only has to sit within the current TCP window to be recognized as a valid reset TCP message.

Another form of threat is by active intermediation where the attacker sits on the connection between the two BGP speakers and intercepts all traffic in both directions. In this case, the attacker has complete control of the BGP message stream and can perform any form of message alteration. A variation of this form of threat is by session hijacking, where the third party intrudes upon an active BGP session and injects its own traffic into the message stream that allows the third party to take over the session and masquerade as one of the parties to the BGP session.

As timing is important in the overall performance of BGP another form of attack at the session-level is to delay messages. While the content of the messages is unaltered, the implicit timing signals within the message stream are altered by this form of intervention, potentially causing the local BGP speaker to behave differently and fall out of sync with its routing peers.

Another form of attack is a replay attack, where older BGP messages are replayed into a hijacked TCP session. One form of this replay attack could be to replay a pair of messages that withdraw and then announce the same address prefix.

Route Flap Damping (RFD) (RFC 2439, RIPE 580) is a widespread defensive BGP configuration that monitors the frequency of BGP updates for a given prefix from each peer, and if the update rate exceeds a locally set threshold, the peer’s advertisement of this prefix will be locally suppressed for a damping interval. The replay of updates could be used to trigger an RFD response in the remote BGP speaker. If a route is fully dampened through RFD, updates for this prefix will not be advertised by the BGP speaker for a damping interval (commonly 60 minutes), possibly causing a route to be disrupted within that time frame. Another form of replay attack is to replay a route advertisement for a previously withdrawn prefix, possibly in conjunction with some form of prefix hijack attack.

Another form of threat is by withholding traffic. BGP uses ‘keep-alive timers’ to determine remote end ‘liveness’. By intercepting and withholding all messages for the hold-down timer interval, a third party can force the BGP session to be terminated and reset. This causes the entire route set to be re-advertised upon session resumption so that repeated attacks of this form can be an effective form of denial of service for BGP.

It is also possible to undertake a saturation attack on a BGP speaker by sending it a rapid stream of invalid TCP packets. In this case, the processing capability of the BGP speaker is put under pressure, and the objective of the attack is to overwhelm the BGP speaker and cause the BGP session to fail and be reset. This is particularly problematic if the BGP session uses MD5 or IPSEC as session protection protocols, as the cryptographic function overhead also applies to the injected packets, increasing the processing overhead on these spurious injected packets.

The underlying aspect of the BGP protocol is that BGP itself has no enforced minimum level of message protection. BGP messages are, by default, placed into the TCP stream without encryption or additional message wrapping of message sequencing. Any threat that applies to long-held TCP sessions, applies to this default mode of BGP operation.

Verifying BGP identity

BGP sessions commence by passing the local AS to the remote end of the session in the BGP OPEN message and receiving the remote end’s AS in the received OPEN message. BGP itself does not verify these asserted AS identities, and it is theoretically possible for a remote party to masquerade itself as another AS and assert an identity in BGP that cannot be directly verified by the other party, or by any third party that subsequently receives this routing information.

Most BGP implementations provide a level of protection against this threat. They do so by applying a constraint that the local BGP speaker will only initiate a peer session with a configured remote IP address, and reject all other TCP connection attempts. Furthermore, the BGP speaker will not complete the BGP OPEN message exchange if the AS in the OPEN message does not match the AS number associated with the remote end IP address in the configuration.

This approach places a heavy reliance on the out-of-band process of BGP configuration, and if an attacker can compromise or take control over BGP equipment connected to the Internet or use social engineering to convince a network administrator to configure incorrect information into the BGP configuration, then it is possible to masquerade as a different party in BGP and potentially inject incorrect information into the routing system.

The real question here is: ‘Are you really who you claim to be?’ Here the BGP speaker must be able to confirm the validity of the peer’s claim to be speaking for an AS.

Verifying BGP information

The objective here is that of verifying the authenticity and completeness of the routing information being passed in the BGP session.

The intention of BGP is that a local BGP speaker provides to all its BGP peers a complete feed of its locally selected route objects. Once a session is opened with a remote BGP speaker, the local BGP instance believes everything it is told without further qualification.

The threat is that a BGP peer can deliberately feed false information to the local BGP instance, which BGP itself will be unable to detect as false. This could be in the form of suppression of routing information, or in the form of alteration of the route object that is being passed, or the invention of spurious route objects. The BGP speaker could be asserting that an AS Path is genuine when it reflects an artificial path, or that it has the authority to originate an advertisement for a prefix when, in fact, no such authority exists.

A BGP speaker may preserve all the attributes of a route object, but alter the prefix set to be the equivalent collection of more specific prefixes. The deliberate alteration of routing information can cause the local BGP instance to make an incorrect choice of a local best path and also cause the local BGP instance to propagate this incorrect information to its neighbors.

Not only could the BGP speaker be passing incorrect attributes for an address prefix to bias the local route selection process, but it could also be providing incorrect information regarding the prefix itself. The prefix that is the subject of the route object could be a prefix that has never been allocated and should not be legitimately routed, or the prefix is an aggregate address prefix that spans both allocated and unallocated address space.

Prefix hijacking is a major threat to the integrity of the BGP routing. The fundamental weakness here is that BGP provides no explicit means of verifying the authenticity of the address prefixes that are listed in a BGP UPDATE message, nor the authenticity of the attributes of the prefix, including the origination information and the AS Path vector. The threat here is that by deliberately altering this information the local BGP speaker can be induced to make incorrect route selection decisions and thereby make incorrect forwarding decisions for IP traffic.

A known common problem illustrative of exploiting this vulnerability is operational misconfiguration, which could result in propagating more specific routes, and other forms of route leakage or withholding that may impact on the routing decisions made by other BGP speakers. This form of verification of intentionality by a remote BGP speaker is far more challenging. While these forms of security mechanisms are intended to verify that the received information matches the original information that was passed into the routing system, they are incapable of verifying that such information is consistent with the true intent of the originator of the information.

Verifying forwarding paths

The overall intention of the BGP protocol is to distribute the current binding of an address to a location such that individual routers can make accurate judgements about how to populate their local forwarding tables and hence make optimal local decisions for each packet that passes along the shortest path to its ultimate destination.

BGP does not provide any ability for a local BGP speaker to validate that the route advertisements it receives from a BGP peer accurately represents the current state of the network’s forwarding system. The threat model here is that a bad actor in the routing system may make a different forwarding decision to that being advertised in the routing system.

This can represent a subversion of local policies, theft of carriage capacity, deliberate denial of service, the potential to eavesdrop on a conversation or to support the interception and alteration of application-level transactions. Even a completely secured control plane does not avert such vulnerabilities.

The consequences of attacks on the routing system

The ability to alter the routing system provides a broad array of potential consequences, which fall into several broad categories, as briefly described below.

The ability to eavesdrop. The forwarding system can be altered to pass all traffic to a class of destination addresses via a certain path. This allows the attacker to attempt to pass all such traffic through an eavesdropping location before conventional delivery. In such a case, the parties may not be aware that an eavesdropping attack is taking place.
Denial of service. The simplest form of a denial of service is where traffic to an address prefix is passed to a point where it is then discarded. Routing loops also are a form of denial of service, where not only will the traffic to a destination address prefix never reach its intended destination but the traffic will be held in the loop for the life of the packet TTL field. For sufficiently short loops the potential exists for the loop to act as a link load amplifier, where the traffic on the loop is several times the traffic load being addressed to the affected destination address prefix.
The potential to masquerade. Subversion of routing allows sites to masquerade as other sites, whereby the routing system will misdirect the traffic to the masquerading site. The consequences of such an attack can vary from the specific, where a particular site is targeted, to the more generic, where authoritative DNS servers are the subject of the masquerading attack and the DNS responses are believed as being authentic. In the latter case, if the masquerading occurs at the level of the root of the DNS hierarchy, incorrect information can be provided to any query, allowing for the attack to then be extended to any site.
The ability to steal addresses and obscure identities. Routing an unallocated address is subtly different to routing an already allocated address. Here the consequence is not the displacement of traffic forwarding to incorrect locations in the network, but the assertion of the existence of addresses and forwarding paths to those addresses that should not exist in the network in the first place. The consequence is the ability to use addresses on the network that have no allocation registration information associated with them, allowing the originator of the routing attack some degree of ease to mount an anonymous attack at the application level. Such forms of attack have been observed to be associated with SPAM and botnet controllers where the anonymity of the attack coordinator is desired.

Security requirements

The primary requirements for securing BGP are securing the transmission of the data payload of the BGP protocol and securing the semantics of that payload.

The security requirements for transmission are such that the data received by a BGP speaker can be cryptographically verified to have been sent by the BGP peer, the data is not a replay of previously transmitted data, and no data has been removed from the transmission.

There is no strict requirement for encryption of the BGP payload, as the routing information being exchanged is not intrinsically confidential to the two parties involved. The security requirements for the semantics of the payload concern specifically some selected fields (transitive attributes) of the BGP UPDATE message. The BGP speaker must be able to verify that the advertised prefix is valid and that the originating AS has been duly authorized by the legitimate right-of-use holder for that prefix. The BGP speaker should also be able to validate that the AS Path in the UPDATE represents a valid inter-AS transit path through the network in terms of inter-AS topology and AS transit policies and that the prefix reachability information has been propagated along the reverse inter-AS Path.

It is noted that route withdrawals and non-transitive announcement attributes are local in nature and therefore do not need to be transitively protected similarly to route origination and the AS Path attribute of announcements. Withdrawals and local attributes can be adequately protected by BGP peer session protection.

The associated requirements for a secure inter-domain routing system include the additional use of security credentials, and verification of routing information should not alter the temporal properties of the BGP protocol. Also, authentication of the security credentials should occur in the same time frame as the BGP message processing operation. It is also a requirement that piecemeal incremental deployment should be feasible. A secure operational mode should be a capability negotiation with each BGP peer, with the ability to support backward compatibility with those BGP peers that do not recognize such a capability. It seems to be a good idea to start deploying BGP security on the most connected nodes and incrementally deploy it towards the least connected nodes.

Additionally, it puts the question on how a party that uses security credentials deals with information arriving from a peer that does not use any security credentials. Having no security credentials does not necessarily mean the information is wrong. Importantly, in these piecemeal deployment scenarios, there should be some incremental benefit of piecemeal deployment to those actors who choose to supply such security credentials and to those who choose to validate routing information using these credentials.

A routing system, secure or otherwise, should never make route selections that include routing loops. It is preferred that in a fully secured environment, a secure routing system would be able to converge on best paths that are either identical to or no worse than an unsecured BGP speaker would select, assuming that such paths can be validated in a secure environment. In an environment of partial adoption of secure routing systems, it is recognized that a BGP speaker may use local preference settings that prefer sub-optimal paths that have preferred security credentials over unsecured paths.

The trust model of routing appears to involve two forms of trust. The first is a trust environment related to the public network and the legitimacy of the use of a public address and public AS number. It is necessary to be able to verify that a particular party has the right to use these number resources in a public context. The closest fit in the form of a trust model for verification of this assertion of the right of use is a public authority that can provide authoritative information on the distribution of these numbers. This approach leads to a rooted hierarchy model of trust, where the trust anchor is this public authority.

The second form is a trust environment in private contexts, where the use of an address or AS number is bounded by a specific context of use, and the trust in an assertion of a right of use is one made in the context of this bounded environment. In this environment, there is no clear ability to use public authorities as a trust anchor, and other means of trust that may involve reputation or web of trust concepts may be appropriate.

A general security approach to BGP should be able to encompass that diversity of deployment environments and the corresponding diversity of authority models.

Tools for securing BGP

The vulnerabilities of BGP arise from four fundamental weaknesses in the BGP and the inter-domain routing environment. These are:

No mechanism to protect the integrity, currency and source authenticity of BGP messages.
No mechanism to verify the authenticity of an address prefix and an AS origination of this prefix in the routing system.
No mechanism to verify the authenticity of the attributes of a BGP UPDATE message.
No mechanism to verify that the local cache RIB information is consistent with the current state of the forwarding table.

The other observation about BGP security is it appears the most straightforward form of attack is to obtain control and configuration access to a deployed router and use this compromised platform as the base for launching attacks on the routing system. In the face of such an encompassing attack on the control instruments of the routing system, BGP session-level security needs to be placed in some perspective. It is not possible to prevent routers from attempting to generate false information as long as routers themselves are in a position to be compromised.

The consequent vulnerability on the routing system, as distinct from a narrower view of BGP, is that no mechanism limits the extent to which a misbehaving routing element can make inaccurate claims about reachability in the routing system.

The security toolset for BGP session protection

The available tools for securing BGP start at the level of the BGP TCP session and encompass the tools that are used to protect TCP and the two ends of the TCP session.

The TCP protection mechanisms include the generalized TTL security mechanism (RFC 3682, RFC 5082), which is intended to limit the effective radius of a potential attack on the session to hosts that lie on or within the worst-case hop-count radius between the two BGP speakers, and host-level defences against TCP SYN attacks (RFC 4987). In many ways, this is an effective form of defence when using multi-hop BGP sessions, in that the attacker cannot subvert the defence. However, it still leaves the session vulnerable to any attacker that lies within the TTL radius.

Greater levels of session protection can be obtained by using cryptographic protection. Over time the IETF has worked on three approaches to protect the BGP TCP through cryptographic protection. These are:

The use of IPSEC. IPSEC has not been widely used for BGP sessions, and the reasons behind this relate to the complications for rekeying IKE/IPSEC sessions and the potential DDOS vector.
The TCP MD5 signature option. While the MD5 signature option has some potential weaknesses when compared with IPSEC, MD5 is considered preferable to no form of TCP protection at all, particularly for the TCP reset injection attack. However, there are issues with re-keying a long-held session, and the BGP speakers probably need to use graceful restart mechanisms in conjunction with MD5 to perform a re-key of the session.
The TCP Authentication Option, which the IETF has marked as a replacement to the earlier MD5 approach. The TCP Authentication Option supports stronger crypto algorithms compared to MD5. It uses a two-fold security approach that reduces the critical reliance on a user-configured key. This approach also allows the configuration of up to 64 keys for a session and provides a simple key coordination mechanism by giving the ability to change keys (move from one key to another) within the same connection without causing any TCP connection closure. By comparison, changing TCP MD5 keys during an established connection might cause a flap or restart in the connection, which in the content of BGP, may have operational implications.

From time to time the topic of BGP over Transport Level Security (TLS) is raised and it’s possible that sooner or later we might hear of BGP over QUIC. The salient question is one of balancing the additional burden of adding more transport choices to BGP implementations to the likely benefits that these additional choices may provide. As we’ve seen in the IPv6 transition and more recently in the increasing diversity of choices for encrypting DNS transactions adding more options can just add confusion and impede adoption instead of accelerating it.

However, the most important guideline in securing BGP sessions is to use multi-hop BGP and multi-access LAN sessions sparingly and prefer to use a direct 1:1 channel connection when such a choice is available.

The security toolset for BGP message protection – The number Resource Public Key Infrastructure

In addition to message integrity protection provided by transparent session-level protection mechanisms, the tools to protect the integrity of BGP messages relate to the use of digital signatures to provide a set of credentials that allow relying parties to verify the correctness of the information carried as the message payload in BGP.

The reason for the use of digital signatures, as opposed to an integrity check using some form of a shared secret, is due to the observation that the number and identities of all eventual recipients of the information are not known in advance, and non-repudiation is desirable. Verification of the contents of a message is not only a test of whether the message has been altered in any way during its transit between BGP speakers but a test of whether the message represents correct origination information and correct operation of the processing of the message during the process of message propagation (authenticity).

This requirement implies a need to establish a means of verification of information where the author of any security credentials relating to origination and propagation are not necessarily known to the relying party that is attempting to validate the information. This typically invokes a form of validation that relies upon third party transitive trust, where the relying party is attempting to build a testable chain of trust between its trust anchor and the party or action that is the subject of the verification operation.

Conventionally, this requirement implies the use of some form of Public Key Infrastructure (PKI). In this case, we are not looking to use such a PKI to validate claims of identity, authority to perform a particular function, or some form of verifiable attribution. We need some form of mechanism to associate a public key with an IP address prefix or an AS number in a sense of functional control, where the certification authorities in this PKI are attesting that the certified subject has functional control of a collection of IP number resources (AS numbers and IP address prefixes). The associated certificate issuance practices are intended to support transitive trust in such attestations of association.

We have adopted a structure using X.509 public key certificates and a certificate extension that uses a canonical list of IP address resources and AS numbers as the foundation for this Number Resource PKI (RPKI). Verification of a digital signature entails a test of the authenticity and current validity of the associated certificate that describes the public key of the address or AS number holder in the context of a structured set of signed relationships between certificate issuers and subjects. To put it another way, the holder of the matching private key is the current functional controller of those IP addresses and AS numbers and can digitally sign authorities and attestations about such number resources based on that functional control.

Given that the discourse of BGP messages is about address prefixes and AS numbers, the RPKI provides a solid foundation for digital signatures to be associated with various routing actions that are described in BGP messages. It does not attest in any way to the identity of these number resource holders.

Anchoring the model of authority and trust in the RPKI certificate structure has resulted in a framework where the issued certificates are aligned with the IP address and AS number allocation and assignment framework. If an Internet registry has issued a set of IP addresses and AS numbers to an entity, then this registry would be able to publish a public key certificate that associates a private key provided by the entity with the IP resource set. Further allocations from a registry to a registry address holder would result in reissuance of the certificate for the address holder with a larger resource set, while the reduction in this set would result in both reissuance and revocation of the previous certificate. This certificate framework would allow auditing of the certificate state by inspecting the registry contents of the Internet registries, as this PKI intends to mirror the overall state of the number registries with the set of issued certificates.

The RPKI is different from many other PKIs as the requirements related to adding digital signatures to the routing domain is different from many other PKI deployment environments. The common question that PKI’s attempt to answer is: ‘Is this data authentic?’

The data is signed with a digital signature, and the key used to generate that signature is described in a certificate. The validity of that certificate can be ascertained through the use of a collection of certificates and certificate revocation lists (CRLs), such that a relying party may validate the data by using their local trust anchor(s) and constructing a validation path of issuer-subject chained certificates from a trust point to the digital signature. If this collection of certificates is bundled with the digital signature and the data itself, then the only data item that needs to be distributed outside the data flow is the PKI trust anchors.

Distributing the RPKI data collection

When the RPKI is combined with a use case for the routing domain, we are looking at a design space that is somewhat atypical in the PKI world. For example, in the WebPKI, the certificates that are passed between server and client in the initial exchange of a TLS session are certificates that related to the particular domain name used in the TLS session being established.

The critical distinction here between the secure client/server transactions using the WebPKI and the promulgation of routing information in the routing system using RPKI is that the routing system presents the entire routing domain to each relying party on a continuous basis. Each relying party needs to have continual access to the entire RPKI certificate and CRL collection, rather than the TLS practice of processing individual signatures and certificates on an as-needed basis.

This requirement for all participating entities to have access to all the RPKI data at all times poses a design challenge in how to manage this RPKI and use it in a routing protocol such as BGP.

A basic approach here is for each Internet registry to publish their certificate products in their own publication point. This is analogous to the pre-CDN model of web content publication, where each element is independently published. Of course, in this case, while publication is easy, the onus is shifted to the relying party client or BGP validator, which has to assemble a local cache of all RPKI signed data. It becomes the task of clients of the RPKI to maintain a local cache of the entire RPKI by continuously sweeping across these publication points looking for, and retrieving, changes and validating all such signed objects as they are received.

At this point, BGP updates could be passed to this local RPKI engine and the data in the update can be compared against the validated information contained in the local RPKI cache. If RPKI validation was performed at the point of acceptance into the local cache (that is, discarding all RPKI products that cannot be validated within the framework of the RPKI validation procedures), then the route information could be verified against the assembled (and validated) crypto data without a high on-demand crypto processing overhead.

An alternative approach is expressing the validation outcomes from the local RPKI cache as a filter list. If this were maintained on a router then the overheads in passing route objects through such a filter would be little different to the many other routing policy maps used in operational configurations.

The drawback in this distributed approach is the need for these clients to constantly sweep all the RPKI publication points to ensure that their local cache is up to date. What is ‘up to date’ is relative here, but it is worth remembering that the average time to propagate a BGP update across the global Internet depends on the average AS Path length (around four to five ASes on average at present) and the interaction with BGP’s MRAI timers. While the worst case would be 300 seconds (assuming that the full MRAI delay would be applied on each eBGP session), the fastest case is well under a second.

So how quickly should the local cache be populated to keep up with the propagation of routing information in BGP?

Before leaping to a target time it’s also worth remembering the scaling question. With around 100,000 distinct ASes in the Internet’s routing system, today’s worst-case scenario is some 100,000 RPKI clients performing a sweep across 100,000 distinct RPKI publication points every few seconds (or even more frequently if the RPKI system is intended to be highly responsive).

In some ways, this is putting the load on the wrong side of the information distribution process. By making the relatively infrequent publication process one that involves a local action without any associated notification of a change then the burden is shifted to the client set, which has to poll every publication point continuously just to ascertain if anything has changed. To put it as plainly as possible, this particular information distribution design is completely broken! If the client set is known in advance (such as is the case in the DNS in synchronizing the information across primary and secondary authoritative services) one could use notification mechanisms. But in the case of the RPKI system, the publishers of authoritative information have no information as to who the clients are that need to be notified of a change in a CA’s part of the RPKI data collection. Hence, notification is not a viable option in this framework.

These relatively formidable scaling issues can be mitigated by changing the publication behaviour, in a manner analogous to how CDNs have improved web performance by shifting content publication models to various permutations of anycast-related models of content replication. In the context of the RPKI, this could entail the use of a smaller set of RPKI publication points that are shared by many RPKI certificate issuers, or the reduction in the number of independent Certification Authorities who each publish their own products through the extensive use of Registration Agents (RAs). The information being published is signed, there is no particular benefit to retrieve the data from any particular publication point. As long as the data can be validated by the client, then the client can be assured that the data that they have retrieved is most likely to be genuine, irrespective of the location used for the retrieval.

It is possible to use third-party aggregators in such a role, which would take on the task of continuous monitoring of all RPKI CA publication points and publish an aggregated data set of all current RPKI data. This could be taken further into a push model by having clients register their interest in updates from the intermediary and allow the intermediary to send them information updates as they are received from the primary CA publication point sources. Again, it has to be noted that the information is signed, so the potential of the intermediary to alter the RPKI information is limited. The design gap in such mediated distribution approaches is to provide a mechanism for clients of these aggregated intermediaries to be assured that the collection being provided by the intermediary is the entire collection of RPKI data, and any credible intermediary approach would need to explicitly address this issue of information completeness.

However, while these approaches reduce the load imposed on clients of the RPKI by increasing the load on information publication, such aggregated publication models also create critical points of concentration of routing data, and a sustained denial of service attack against such aggregate publication points could have a major impact on the routing system as the local RPKI caches lose currency and coherency.

These approaches have their own strengths and risks. Highly distributed publication models impose undue costs on clients as the clients need to maintain an aggregate and current data collection in their local cache. Aggregate data publishing models relieve the load from clients but have some unresolved issues in terms of assured completeness of the aggregated data collection, and also run the risk of creating new points of vulnerability in terms of the consequence of denial-of-service attacks launched against these aggregated publication points.

The current RPKI operational framework that is used in the Route Origination Validation (ROV) tool uses this approach of an out of band RPKI pull system together with some use of aggregated RPKI publication points. An RPKI client’s local cache currency performance level is phrased in units of minutes rather than seconds, and the overall system operates at a level of coherency that is at a time scale of hours rather than minutes.

The initial design of this RPKI distribution system is for each client to operate autonomously and maintains a local cache to keep synchronized with the current state of all the RPKI publication points using the rsync protocol together with the concept of a manifest that allows a client to ensure that they have retrieved the entirety of the data available at each RPKI publication point. The rsync protocol was subsequently found to be a poor choice for this role, and these days the RPKI Repository Delta Protocol (RRDP) is the preferred RPKI repository synchronization tool.

In terms of the application of the RPKI to the BGP environment, there is the obvious question to be asked here: If the intent of the flooding system is to provide a reliable and efficient way to flood current information to all clients, then why not just use BGP itself? BGP is an Internet-wide information flooding protocol using a push-based approach that is intended to ensure that all BGP speakers have a consistent and current collection of reachable route objects. If the set of clients who want to maintain an up-to-date synchronized local cache is isomorphic to the set of BGP speakers, then adding a BGP message payload type in the same manner that AFI/SAFI indicators are already used in Multi-Protocol BGP today seems only logical.

Part of the reason why the RPKI has had to reinvent this particular wheel of reliable flooding lies in the strictures imposed on the standardization effort in the IETF, where the SIDR Working Group was constrained from proposing changes to the BGP protocol itself. In retrospect, this appears to have been a rather suboptimal and, in hindsight, extremely poor piece of guidance from the Internet Engineering Steering Group at the time.

If one could contemplate changes to the BGP, then one approach to the RPKI distribution tasks is to maintain the association of the validation material with the data, and in the context of the routing environment, this would staple a collection of certificates (and CRLs) to each route object. In a sense, this would attempt to reproduce the TLS model in the BGP, where each prefix being updated would have a subset of the RPKI certificates stapled to the update that will permit an associated signed attestation to be validated within the framework of the RPKI.

This is not without additional impositions and costs that would be imposed on the operation of the BGP protocol and upon BGP speakers. Stapling crypto credentials to BGP updates would bloat both the volume of stapled data (through the use of long validation chained paths and long-term certificate issuance policies which, in turn, create extended CRL lists) and the amount of crypto processing of these stapled digital credentials. There would also be a significant level of retransmission of certificates on a pair-wise basis in such a system if the protocol was to bundle the entire RPKI validation chain data with every routing protocol update. The validation processing load would also likely be beyond the processing capabilities of most routers, and there are considerations of the maximum message size in the BGP protocol itself (which, until RFC 8654, published in October 2019, was 4,096 octets) that limited the amount of attached data that can be placed into the BGP.

None of these issues are intractable in nature, and there have been several proposals that attempt to optimize such additional loads and processing demands. We will look at some of these proposals in Part 2 of this survey.

Coming in Part 2

In Part 2, I will take these various requirements and tools and look at the various proposals that have been published for securing the BGP. I will also evaluate the current state of the effort in the IETF to standardize a secure BGP framework.

The views expressed by the authors of this blog are their own and do not necessarily reflect the views of APNIC. Please note a Code of Conduct applies to this blog.