谷歌浏览器插件
订阅小程序
在清言上使用

One-Shot Averaging for Distributed TD(λ) under Markov Sampling

IEEE control systems letters(2024)

引用 0|浏览12
关键词
Vectors,Markov decision processes,Function approximation,Trajectory,Servers,Finite element analysis,Electrical engineering,Multi-agent system,reinforcement learning,temporal difference learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要