Long-range RNA structures in the human transcriptome beyond evolutionarily conserved regions

Sergey Margasyuk, Lev Zavileyskiy,Changchang Cao,Dmitri Pervouchine

PeerJ(2023)

引用 0|浏览0
暂无评分
摘要
RNA structure has been increasingly recognized as a critical player in the biogenesis and turnover of many transcripts classes. In eukaryotes, the prediction of RNA structure by thermodynamic modeling meets fundamental limitations due to the large sizes and complex, discontinuous organization of eukaryotic genes. Signatures of functional RNA structures can be found by detecting compensatory substitutions in homologous sequences, but a comparative approach is applicable only within conserved sequence blocks. Here, we developed a computational pipeline called PHRIC, which is not limited to conserved regions and relies on RNA contacts derived from RNA in situ conformation sequencing (RIC-seq) experiments. It extracts pairs of short RNA fragments surrounded by nested clusters of RNA contacts and predicts long, nearly perfect complementary base pairings formed between these fragments. In application to a panel of RIC-seq experiments in seven human cell lines, PHRIC predicted similar to 12,000 stable long-range RNA structures with equilibrium free energy below -15 kcal/mol, the vast majority of which fall outside of regions annotated as conserved among vertebrates. These structures, nevertheless, show some level of sequence conservation and remarkable compensatory substitution patterns in other clades. Furthermore, we found that introns have a higher propensity to form stable long-range RNA structures between each other, and moreover that RNA structures tend to concentrate within the same intron rather than connect adjacent introns. These results for the first time extend the application of proximity ligation assays to RNA structure prediction beyond conserved regions.
更多
查看译文
关键词
RNA structure,Alternative splicing,Proximity ligation,RIC-seq,RNAcontacts,Long-range
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要