Sparse Discrete Communication Learning for Multi-Agent Cooperation Through Backpropagation.

IROS(2020)

引用 15|浏览14
暂无评分
摘要
Recent approaches to multi-agent reinforcement learning (MARL) with inter-agent communication have often overlooked important considerations of real-world communication networks, such as limits on bandwidth. In this paper, we propose an approach to learning sparse discrete communication through backpropagation in the context of MARL, in which agents are incentivized to communicate as little as possible while still achieving high reward. Building on top of our prior work on differentiable discrete communication learning, we develop a regularization-inspired message-length penalty term, that encourages agents to send shorter messages and avoid unnecessary communications. To this end, we introduce a variable-length message code that provides agents with a general means of modulating message length while keeping the overall learning objective differentiable. We present simulation results on a partially-observable robot navigation task, where we first show how our approach allows learning of sparse communication behavior while still solving the task. We finally demonstrate our approach can even learn an effective sparse communication behavior from demonstrations of an expert (potentially communication-free) policy.
更多
查看译文
关键词
sparse discrete communication learning,multiagent cooperation,backpropagation,multiagent reinforcement learning,MARL,inter-agent communication,communication networks,differentiable discrete communication learning,regularization-inspired message-length penalty term,variable-length message code,partially-observable robot navigation task
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要