Responding to Network Failures at Data-plane Speeds with Network Programmability.

NOMS(2023)

引用 0|浏览9
暂无评分
摘要
Measurement studies show that equipment failures happen quite frequently and pose a challenge to reliable network operation. Quickly recovering from failures is critical to meeting service guarantees. Traditional routing protocols, due to being executed in a distributed fashion and involving multiple devices in a network, require non-negligible time to recompute routes upon failures. SDN with OpenFlow simplifies route recomputation, but the time to compute and install alternative forwarding entries can still result in significant packet loss. Existing fast failover mechanisms cannot handle all types of failure and do not guarantee the use of the best paths. In this paper, we present FELIX, an approach for failure recovery that reroutes around failures at data plane timescales. Felix works by efficiently pre-computing tactics to handle failure scenarios that can be quickly activated in the data plane in response to failures. Our evaluation shows that our approach can recover from failures up to three orders of magnitude faster than existing SDN approaches.
更多
查看译文
关键词
alternative forwarding entries,data plane timescales,data-plane speeds,equipment failures,failure recovery,failure scenarios,FELIX,involving multiple devices,measurement studies,meeting service guarantees,network failures,network programmability,nonnegligible time,OpenFlow,pre-computing tactics,reliable network operation,route recomputation,SDN approaches,traditional routing protocols
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要