TEAL: Learning-Accelerated Optimization of WAN Traffic Engineering

PROCEEDINGS OF THE 2023 ACM SIGCOMM 2023 CONFERENCE, SIGCOMM 2023(2023)

引用 0|浏览108
暂无评分
摘要
The rapid expansion of global cloud wide-area networks (WANs) has posed a challenge for commercial optimization engines to efficiently solve network traffic engineering (TE) problems at scale. Existing acceleration strategies decompose TE optimization into concurrent subproblems but realize limited parallelism due to an inherent tradeoff between run time and allocation performance. We present TEAL, a learning-based TE algorithm that leverages the parallel processing power of GPUs to accelerate TE control. First, TEAL designs a flow-centric graph neural network (GNN) to capture WAN connectivity and network flows, learning flow features as inputs to downstream allocation. Second, to reduce the problem scale and make learning tractable, TEAL employs a multi-agent reinforcement learning (RL) algorithm to independently allocate each traffic demand while optimizing a central TE objective. Finally, TEAL fine-tunes allocations with ADMM (Alternating Direction Method of Multipliers), a highly parallelizable optimization algorithm for reducing constraint violations such as overutilized links. We evaluate TEAL using traffic matrices from Microsoft's WAN. On a large WAN topology with >1,700 nodes, TEAL generates near-optimal flow allocations while running several orders of magnitude faster than the production optimization engine. Compared with other TE acceleration schemes, TEAL satisfies 6-32% more traffic demand and yields 197-625x speedups.
更多
查看译文
关键词
Traffic Engineering,Wide-Area Networks,Network Optimization,Machine Learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要