Inferring protein from mRNA concentrations using convolutional neural networks

Patrick Maximilian Schwehn,Pascal Falter-Braun

bioRxiv (Cold Spring Harbor Laboratory)(2023)

引用 0|浏览4
暂无评分
摘要
Transcript abundance is a widely used but poor predictor of protein abundance. As proteins are the actual agents executing biological functions, and because signaling outcome depends in a non-linear manner on the concentration of the network components, we aimed to develop a convolutional neural network-(CNN-) based predictor for Homo sapiens and the reference plant Arabidopsis thaliana . After hyperparameter optimization and initial analysis of the training data, we employed a distinct training module for value and sequence data, respectively, predicting 40% of the variance in protein levels in Homo sapiens , respectively 48% in Arabidopsis thaliana . Codon counts and peptides had the greatest predictive power. Extracting the learned weight revealed generally similar trends but also some intriguing differences between human and Arabidopsis. Many learned motifs in the 5’ and 3’ UTRs correspond to previously described regulatory features demonstrating that the model can learn ab initio mechanistically relevant features. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
mrna concentrations,convolutional neural networks,protein,neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要