基本信息
views: 88
![](https://originalfileserver.aminer.cn/sys/aminer/icon/show-trajectory.png)
Bio
I used to study representations in democracies (almost did a PhD in political science), and now I study representations of language and cognition. My current research focuses on two directions:
Instruction-tuned models. I co/first-authored various large language models (T0, Flan-T5/PaLM, and BLOOM), with a focus on zero-shot generalization to unseen tasks that go beyond statistical pattern matching. At the moment, I’m working on LM agents with recurrent and adaptive computation with reinforcement learning.
Finding where in pretraining and instruction-tuning corpora do models acquire zero-shot and few-shot abilities, understanding exactly how models generalize at test time, and explaining why models match human behaviors in profound ways in some settings (e.g., Dasgupta et al. 2022), while also exhibiting comically un-human-like behaviors in some other settings (e.g., Webson et al. 2023).
Instruction-tuned models. I co/first-authored various large language models (T0, Flan-T5/PaLM, and BLOOM), with a focus on zero-shot generalization to unseen tasks that go beyond statistical pattern matching. At the moment, I’m working on LM agents with recurrent and adaptive computation with reinforcement learning.
Finding where in pretraining and instruction-tuning corpora do models acquire zero-shot and few-shot abilities, understanding exactly how models generalize at test time, and explaining why models match human behaviors in profound ways in some settings (e.g., Dasgupta et al. 2022), while also exhibiting comically un-human-like behaviors in some other settings (e.g., Webson et al. 2023).
Research Interests
Papers共 21 篇Author StatisticsCo-AuthorSimilar Experts
By YearBy Citation主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
Khaled Saab,Tao Tu,Wei-Hung Weng,Ryutaro Tanno,David Stutz,Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves,Szu-Yeu Hu,
CoRR (2024)
Cited0Views0EIBibtex
0
0
Gemini Team, Petko Georgiev, Ving Ian Lei,Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding,
arxiv(2024)
Cited0Views0Bibtex
0
0
JOURNAL OF MACHINE LEARNING RESEARCH (2024)
CoRR (2024)
Cited0Views0EIBibtex
0
0
Mary Phuong,Matthew Aitchison,Elliot Catt, Sarah Cogan, Alexandre Kaskasoli,Victoria Krakovna,David Lindner,Matthew Rahtz,Yannis Assael, Sarah Hodkinson, Heidi Howard,Tom Lieberum,
CoRR (2024)
Cited0Views0EIBibtex
0
0
arXiv (Cornell University) (2023)
arXiv (Cornell University) (2023): 7662-7686
arXiv (Cornell University) (2023)
Load More
Author Statistics
Co-Author
Co-Institution
D-Core
- 合作者
- 学生
- 导师
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn