Predicting Alu exonization in the human genome with a deep learning model.

bioRxiv : the preprint server for biology(2024)

引用 0|浏览8
暂无评分
摘要
Alu exonization, or the recruitment of intronic Alu elements into gene sequences, has contributed to functional diversification; however, its extent and the ways in which it influences gene regulation are not fully understood. We developed an unbiased approach to predict Alu exonization events from genomic sequences implemented in a deep learning model, eXAlu, that overcomes the limitations of tissue or condition specificity and the computational burden of RNA-seq analysis. The model captures previously reported characteristics of exonized Alu sequences and can predict sequence elements important for Alu exonization. Using eXAlu, we estimate the number of Alu elements in the human genome undergoing exonization to be between 55-110K, 11-21 fold more than represented in the GENCODE gene database. Using RT-PCR we were able to validate selected predicted Alu exonization events, supporting the accuracy of our method. Lastly, we highlight a potential application of our method to identify polymorphic Alu insertion exonizations in individuals and in the population from whole genome sequencing data.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要