Internet-Draft | EVPN AC-Aware Bundling Service Interface | September 2023 |
Sajassi, et al. | Expires 18 March 2024 | [Page] |
EVPN (Ethernet VPNs) provides an extensible and flexible multihoming VPN solution over an MPLS/IP network for intra-subnet connectivity among Tenant Systems and End Devices that can be physical or virtual.¶
EVPN multihoming with Integrated Routing and Bridging (IRB) is one of the common deployment scenarios. There are deployments which requires capability to have multiple subnets designated with multiple VLAN IDs in single broadcast domain.¶
EVPN technology defines three different types of service interface which serve different requirements but none of them address the requirement of supporting multiple subnets within single broadcast domain. In this draft we define new service interface type to support multiple subnets in single broadcast domain. Service interface proposed in this draft will be applicable to multihoming case only.¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 [RFC2119] and RFC 8174 [RFC8174].¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 18 March 2024.¶
Copyright (c) 2023 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.¶
EVPN based All-Active multihoming is becoming the basic building block for providing redundancy in next generation data center deployments as well as service provider access/aggregation network. For EVPN IRB mode, there are deployments which expect to be able to support multiple subnets within single broadcast domain. Each subnet would be differentiated by VLAN. Thus, single IRB interface can still serve multiple subnet.¶
Motivation behind such deployments are¶
[RFC7432] defines three types of service interface. None of them provide flexibility to achieve multiple subnets within single broadcast domain. The different types of service interface from [RFC7432] are:¶
From definition, it seems like VLAN Bundle Service Interface does provide flexibility to support multiple subnets within single broadcast domain. However, the requirement is to have multiple subnets from same ES on multihoming All-Active mode; that would not work. For example, lets take the case from Figure 1 where PE1 learns MAC of H1 on VLAN 1 (subnet S1). PE1 originates EVPN MAC route, as per [RFC7432], where the Ethernet Tag would be set to 0. Incoming packets from IRB interface, at PE2, are untagged packet. PE2 does not have any associated AC information from EVPN MAC routes advertised by PE1. PE2 can not forward traffic which is destined to H1.¶
This draft proposes an extension to existing service interface types defined in [RFC7432] and defines AC-aware Bundling service interface. AC-aware Bundling service interface would provide mechanism to have multiple subnets in single broadcast domain. This extension is applicable only for multihomed EVPN PEs.¶
EVPN topology with multihoming and non multihoming PE.¶
Figure 1 shows sample EVPN topology where PE1 and PE2 are multihomed PEs. PE3 is remote PE participating in the same EVPN instance (EVI-1). It illustrates four subnets S1, S2, S3 and S4 where numerical value provides associated VLAN information.¶
In Figure 1, BD-1 has multiple subnets where each subnet is distinguished by VLAN 1, 2 ,3 and 4. PE1 learns MAC address MAC-1 from AC associated with subnet S1. PE1 uses MAC route to advertise MAC-1 presence to PE PEs. As per [RFC7432] MAC route advertisement from PE1 does not carry any context providing information about MAC address association with AC. When PE2 receives MAC route with MAC-2 it can not determine which AC this MAC belongs to.¶
Since PE2 could not bind MAC-1 with correct AC, when it receives data traffic destined to MAC-1, it does not know the destination AC since multiple bridge ports have the same ESI assignment.¶
[RFC9251] defines mechanism to synchronize multicast routes between multihome PEs. In above case, if receiver behind S1 send IGMP membership request, CE could hash it to either of the PEs. When multicast route is originated, it does not contain any AC information. Once it reaches to PEing PE, it does not have any information about which subnet this IGMP membership request belong to. Similarly to unicast traffic problem, the incoming multicast traffic from IRB cannot be forearded to the proper AC.¶
In case of single subnet per broadcast domain, there is potential case of security issue. For example, PE1 has BD1 configured with VLAN-1 where as multihome PE PE2 has BD1 configured VLAN-2. Each of the IGMP membership requests on PE1 would be synchronized to PE2 and PE2 would process multicast routes and start forwarding multicast traffic on VLAN-2, which was not intended. Again, similar issue can potentially be seen with unicast traffic.¶
[RFC7432] section 9.1 describes different mechanism to learn Unicast MAC address locally. At those PEs where AC aware bundling is supported, MAC address is learnt along with VLAN associated with AC.¶
MAC/IP advertisement route construction follows mechanism defined in [RFC7432] section 9.2.1. An attachment circuit ID Extended Community (Section 6.1) MUST be attached to EVPN Route Type RT-2.¶
Presence of attachment circuit ID Extended Community (Section 6.1) MUST be ignored by non multihoming PEs. Remote PE (non multihome PE) MUST process MAC route as defined in [RFC7432]¶
Multihoming PE MUST process attachment circuit ID Extended Community (Section 6.1) to associate the remote MAC address to appropriate AC.¶
From Figure 1, PE2 receives MAC route for MAC-1. It MUST get attachment circuit ID from attachment circuit ID Extended Community (Section 6.1) in RT-2 and associate MAC address with specific subnet.¶
When a local multihomed PE in given broadcast domain receives IGMP membership request on local AC, it MUST synchronize multicast state by originating multicast route defined in [RFC9251]. When Service interface is AC aware it MUST attach attachment circuit ID Extended Community (Section 6.1) along with multicast route. For example in Figure 1 when H2 sends IGMP membership request for (S,G), CE hashed it to one of the PE. Lets say PE1 received IGMP membership request. PE1 MUST originate multicast route to synchronize multicast state with PE2. Multicast route MUST contain attachment circuit ID Extended Community (Section 6.1) along with multicast route.¶
PE1 MUST originate multicast route updates for any subsequent IGMP membership requests under same or different subnet attaching adequate Attachment Circuit ID Extended Community (Section 6.1).¶
If multihomed PE receives remote multicast route on broadcast domain for given ES, route MUST be programmed to correct subnet. Subnet information MUST be extracted from attachment circuit ID Extended Community. That value maps to the VLAN of a local AC where the multicast route is associated to.¶
Packet received from CE MUST follow same procedure as defined in [RFC7432] section 13.1¶
Unknown Unicast packets from a Remote PE MUST follow procedure as per [RFC7432] section 13.2.1.¶
Known unicast Received on a remote PE MUST follow procedure as per [RFC7432] section 13.2.2. In Figure 1, if PE3 receives known unicast packet for destination MAC MAC-1, it MUST follow procedure defined in [RFC7432] section 13.2.2.¶
If destination MAC lookup is performed on known unicast packet, destination MAC lookup MUST provide VLAN and local AC information. For example if PE2 receives unicast packet which is destined to MAC-1 (packet might be coming from IRB or remote PE with EVPN tunnel), destination MAC lookup on PE2 MUST provide outgoing port along with associated VLAN value.¶
Multicast traffic from CE and remote PE MUST follow procedure defined in [RFC7432]¶
Multicast traffic received from IRB interface or EVPN tunnel, route lookup would be performed based on IGMP snooping state and traffic would be forwarded to appropriate AC.¶
If there is mis-configuration of VLAN or VLAN range across multihoming PEs, same MAC address would be learnt with different VLAN per broadcast domain. In this case Error message MUST be thrown for operator to make configuration changes. Furthermore, the errored MAC route MUST be ignored.¶
This document defines one new BGP Extended Community for EVPN.¶
A new BGP Extended Community called attachment circuit ID Extended Community is introduced. This new extended community is a transitive extended community with the Type field of 0x06 (EVPN) and the Sub-Type of 0x0E. It is advertised along with EVPN MAC/IP Advertisement Route (Route Type 2) per [RFC7432] for AC-Aware Bundling Service Interface. It may also be advertised along with EVPN Multicast Route (Route Type 7 and 8) as per [RFC9251]. Generically speaking, the new extended community MUST be attached to any routes which require specific VLAN identification.¶
The attachment circuit ID Extended Community is encoded as an 8-octet value as follows:¶
0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Type=0x06 | Sub-Type=0x0E | Reserved (16 bits) | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | attachment circuit ID (32 bits) | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+¶
attachment circuit ID Extended Community¶
The attachment circuit ID plays the role of normalized VID. It is defined as per [I-D.ietf-bess-evpn-vpws-fxc].¶
The current proposal is entirely backward compabitible with [RFC7432] VLAN-aware bundling mode since the Ethernet-tag field remains intact. However, it has its own drawbacks. For instance with multicast, the same (S,G) maybe be used over different subnets. In that case, the same route MUST carry multiple AC ID Extended Community; one per attachment circuit ID / VLAN. It may happen that the number of VLAN is faily large. Multiple routes with different RD may be required to carry such amount of Extended Community. This approach is complexifying the overall solution and implementation.¶
To remedy to that situation, the attachment circuit ID MAY be set to 0xFFFF_FFFF. That value tells peer PE that the attachment circuit ID is carried has part of the Ethernet Tag field of the associated route. Since the key of the EVPN route is unique, multiple AC ID Extended Community per route is no longer required. There is drawback. It pose backward interoperability issue with PE expecting a zero Ethernet-TAG ID.¶
The same Security Considerations described in [RFC7432] are valid for this document.¶
A new transitive extended community Type of 0x06 and Sub-Type of 0x0E for EVPN attachment circuit Extended Community has been allocated by IANA.¶
We would like to thank Luc Andre Burdet, Tapraj Singh , Mei Zhang for providing valuable comments.¶
In addition to the authors listed on the front page, the following co-authors have also contributed to this document:¶
Patrice Brissette
Cisco Systems¶