REINFORCE
Achieving Efficient Failure Resiliency for Network Function Virtualization-Based Services
Article Ecrit par: Kulkarni, Sameer G. ; Fu, Xiaoming ; Ramakrishnan, K. K. ; Wood, Timothy ; Arumaithurai, Mayutan ; Liu, Guyue ;
Résumé: Ensuring high availability (HA) for software-based networks is a critical design feature that will help the adoption of software-based network functions (NFs) in production networks. It is important for NFs to avoid outages and maintain mission-critical operations. However, HA support for NFs on the critical data path can result in unacceptable performance degradation. We present REINFORCE, an integrated framework to support efficient resiliency for NF service chains. REINFORCE includes timely failure detection and consistent failover mechanisms. REINFORCE replicates state to standby NFs (local and remote) while enforcing correctness. It minimizes the number of state transfers by exploiting the concept of external synchrony, and leverages opportunistic batching and multi-buffering to optimize performance. Experimental results show that, even at line-rate packet processing (10 Gbps), REINFORCE achieves chain-level failover across servers in a LAN within 10ms, incurring less than 10% performance overhead, and adds average latency only ~400 ?s, with a worst-case latency of less than 1ms. REINFORCE also recovers from software failures within the same node in less than 100 ?s, incurring less than 1% performance overhead and adds less than 5 ?s latency during normal operation.
Langue:
Anglais