An efficient circRNA-miRNA interaction prediction model by combining biological text mining and wavelet diffusion-based sparse network structure embedding

COMPUTERS IN BIOLOGY AND MEDICINE(2023)

引用 0|浏览2
暂无评分
摘要
Motivation: Accumulating clinical evidence shows that circular RNA (circRNA) plays an important regulatory role in the occurrence and development of human diseases, which is expected to provide a new perspective for the diagnosis and treatment of related diseases. Using computational methods can provide high probability preselection for wet experiments to save resources. However, due to the lack of neighborhood structure in sparse biological networks, the model based on network embedding and graph embedding is difficult to achieve ideal results. Results: In this paper, we propose BioDGW-CMI, which combines biological text mining and wavelet diffusionbased sparse network structure embedding to predict circRNA-miRNA interaction (CMI). In detail, BioDGWCMI first uses the Bidirectional Encoder Representations from Transformers (BERT) for biological text mining to mine hidden features in RNA sequences, then constructs a CMI network, obtains the topological structure embedding of nodes in the network through heat wavelet diffusion patterns. Next, the Denoising autoencoder organically combines the structural features and Gaussian kernel similarity, finally, the feature is sent to lightGBM for training and prediction. BioDGW-CMI achieves the highest prediction performance in all three datasets in the field of CMI prediction. In the case study, all the 8 pairs of CMI based on circ-ITCH were successfully predicted. Availability: The data and source code can be found at https://github.com/1axin/BioDGW-CMI-model.
更多
查看译文
关键词
circRNA-miRNA interaction,Structure embedding,Biological text mining,Biomarkers,Structural role discovery
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要