Language models enable zero-shot prediction of the effects of mutations on protein function.

Meier J,Rao R,Verkuil R,Liu J, Sercu T,Rives A

Annual Conference on Neural Information Processing Systems(2021)

引用 318|浏览118
暂无评分
摘要
Modeling the effect of sequence variation on function is a fundamental problem for understanding and designing proteins. Since evolution encodes information about function into patterns in protein sequences, unsupervised models of variant effects can be learned from sequence data. The approach to date has been to fit a model to a family of related sequences. The conventional setting is limited, since a new model must be trained for each prediction task. We show that using only zero-shot inference, without any supervision from experimental data or additional training, protein language models capture the functional effects of sequence variation, performing at state-of-the-art.
更多
查看译文
关键词
Language model,Inference,Function (mathematics),Machine learning,Task (project management),Experimental data,Computer science,Zero (linguistics),Artificial intelligence,Protein function,Sequence variation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要