Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces.

IEEE/ACM Transactions on Audio, Speech, and Language Processing(2018)

引用 83|浏览49
暂无评分
摘要
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue agents that can converse with humans. A part of this effort is the policy optimization task, which attempts to find a policy describing how to respond to humans, in the form of a function taking the current state of the dialogue and returning the response of the system. In this paper, we investigate de...
更多
查看译文
关键词
Optimization,Learning (artificial intelligence),Task analysis,Neural networks,Speech processing,Training,Markov processes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要