基本信息
浏览量:47
职业迁徙
个人简介
My research focuses on large language models:
Challenges in scaling. Scaling has been the main driver of progress in machine learning for the past few years: I am interested in how we can keep that engine churning. Specifically, I am interested in challenges brought forth by ML becoming a so-called big science, with novel research directions at the crossroads of large-scale engineering and pure research.
Data scalability. What makes some pretraining datasets better than others? How can we build quality datasets with trillions of tokens? Is the human part in RLHF truly needed, or can models bootstrap themselves?
Philosophy of mind. I am interested in how LLMs can gain human-like functions. This goes from deliberate reasoning and planning, to the acquisition of a theory of mind and its relation with works such as Julian Jaynes' bicameral mind. I am also interested in tool use, and how LLMs can learn to interact with their environment.
Challenges in scaling. Scaling has been the main driver of progress in machine learning for the past few years: I am interested in how we can keep that engine churning. Specifically, I am interested in challenges brought forth by ML becoming a so-called big science, with novel research directions at the crossroads of large-scale engineering and pure research.
Data scalability. What makes some pretraining datasets better than others? How can we build quality datasets with trillions of tokens? Is the human part in RLHF truly needed, or can models bootstrap themselves?
Philosophy of mind. I am interested in how LLMs can gain human-like functions. This goes from deliberate reasoning and planning, to the acquisition of a theory of mind and its relation with works such as Julian Jaynes' bicameral mind. I am also interested in tool use, and how LLMs can learn to interact with their environment.
研究兴趣
论文共 20 篇作者统计合作学者相似作者
按年份排序按引用量排序主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
Kilian Müller,Julien Launay,Iacopo Poli, Matthew Filipovich, Alessandro Capelli,Daniel Hesslow, Igor Carron, Laurent Daudet,Florent Krzakala,Sylvain Gigan
2023 Conference on Lasers and Electro-Optics Europe & European Quantum Electronics Conference (CLEO/Europe-EQEC)pp.1-1, (2023)
引用0浏览0EIWOS引用
0
0
arxiv(2022)
引用0浏览0EI引用
0
0
semanticscholar(2022)
引用40浏览0EI引用
40
0
International Conference on Machine Learning (2022): 22964-22984
International Conference on Language Resources and Evaluation (LREC) (2022): 4275-4284
引用3浏览0EI引用
3
0
加载更多
作者统计
合作学者
合作机构
D-Core
- 合作者
- 学生
- 导师
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn