Ribonanza: deep learning of RNA structure through dual crowdsourcing.

Shujun He, Rui Huang, Jill Townley, Rachael C Kretsch, Thomas G Karagianes, David B T Cox, Hamish Blair,Dmitry Penzar, Valeriy Vyaltsev, Elizaveta Aristova,Arsenii Zinkevich, Artemy Bakulin, Hoyeol Sohn, Daniel Krstevski, Takaaki Fukui, Fumiya Tatematsu, Yusuke Uchida, Donghoon Jang, Jun Seong Lee, Roger Shieh, Tom Ma, Eduard Martynov,Maxim V Shugaev, Habib S T Bukhari,Kazuki Fujikawa, Kazuki Onodera, Christof Henkel, Shlomo Ron,Jonathan Romano,John J Nicol, Grace P Nye, Yuan Wu, Christian Choe, Walter Reade, Eterna Participants,Rhiju Das

bioRxiv : the preprint server for biology(2024)

引用 0|浏览1
暂无评分
摘要
Prediction of RNA structure from sequence remains an unsolved problem, and progress has been slowed by a paucity of experimental data. Here, we present Ribonanza, a dataset of chemical mapping measurements on two million diverse RNA sequences collected through Eterna and other crowdsourced initiatives. Ribonanza measurements enabled solicitation, training, and prospective evaluation of diverse deep neural networks through a Kaggle challenge, followed by distillation into a single, self-contained model called RibonanzaNet. When fine tuned on auxiliary datasets, RibonanzaNet achieves state-of-the-art performance in modeling experimental sequence dropout, RNA hydrolytic degradation, and RNA secondary structure, with implications for modeling RNA tertiary structure.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要