Top-down design of protein nanomaterials with reinforcement learning

biorxiv(2022)

引用 3|浏览36
暂无评分
摘要
The multisubunit protein assemblies that play critical roles in biology are the result of evolutionary selection for function of the entire assembly, and hence the subunits in structures such as icosahedral viral capsids often fit together with remarkable shape complementarity[1][1],[2][2]. In contrast, the large multisubunit assemblies that have been created by de novo protein design, notably the icosahedral nanocages used in a new generation of potent vaccines[3][3]–[7][4], have been built by first designing symmetric oligomers with cyclic symmetry and then assembling these into nanocages while keeping the internal structure fixed[8][5]–[14][6], which results in more porous structures with less extensive shape matching between the components. Such hierarchical “bottom-up” design approaches have the advantage that one interface can be designed and validated in the context of the cyclic oligomer building block[15][7],[16][8], but the disadvantage that the structural and functional features of the assemblies are limited by the properties of the predesigned building blocks. To overcome this limitation, we set out to develop a “top-down” reinforcement learning based approach to protein nanomaterial design in which both the structures of the subunits and the interactions between them are built up coordinately in the context of the entire assembly. We developed a Monte Carlo tree search (MCTS) method[17][9],[18][10] which assembles protein monomer structures in the context of an overall architecture guided by a loss function which enables specification of any desired overall structural properties such as shape and porosity. We demonstrate the power of the approach by designing hyperstable icosahedral assemblies more compact than any previously observed protein icosahedral structure (designed or naturally occurring), that have very low porosity and are robust to fusion and display of proteins as complex as influenza hemagglutinin. CryoEM structures of two designs are very close to the computational design models. Our top-down reinforcement learning approach should enable the design of a wide variety of complex protein nanomaterials by direct optimization of overall system properties. ### Competing Interest Statement DB, SW, IL, CN, AD, NK and AB are inventors on a provisional patent application submitted by the University of Washington for the design, composition and function of the proteins created in this study. [1]: #ref-1 [2]: #ref-2 [3]: #ref-3 [4]: #ref-7 [5]: #ref-8 [6]: #ref-14 [7]: #ref-15 [8]: #ref-16 [9]: #ref-17 [10]: #ref-18
更多
查看译文
关键词
protein nanomaterials,learning,top-down
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要