Contribution of statistical learning to mitigating the curse of dimensionality in reinforcement learning

Aryan Yazdanpanah, Chong Wang,Ethan Trepka, Marissa Benz,Alireza Soltani

crossref(2024)

引用 0|浏览6
暂无评分
摘要
Natural environments are abundant with patterns and regularities. It has been demonstrated that learning these regularities, especially through statistical learning, can greatly influence perception, memory, and other cognitive functions. Using a novel experimental paradigm involving two orthogonal tasks, we investigated whether regularities in the environment can enhance reward learning. In one task, human participants predicted the next stimulus in a sequence by recognizing regularities in a feature. In a separate, multidimensional learning task, they learned the predictive value of a different set of stimuli, based on reward feedback received after choosing between pairs of stimuli. Using both model-free and model-based approaches, we found that participants used regularities about features from the sequence-prediction task to bias their behavior in the learning task, resulting in the values associated with the regular feature having a greater influence. Fitting of choice behavior revealed that these effects were more consistent with attentional modulations of learning, rather than decision making, due to regularity manipulation. Specifically, the learning rates for the feature with regularity were higher, especially when learning from the forgone option during unrewarded trials. This demonstrates that feature regularities can intensify the confirmation bias observed in reward learning. Our results suggest that by enhancing learning about certain features, detecting regularities in the environment can reduce dimensionality and thus mitigate the curse of dimensionality in reward learning. Such interactions between statistical and reward learning have important implications for learning in naturalistic settings.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要