Towards Hierarchical Causal Representation Learning for Nonstationary Multi-Omics Data

biorxiv(2022)

引用 0|浏览7
暂无评分
摘要
As the available sequencing data modalities increase, so does the potential biological insight that they are able to provide. Most existing methods to integrate co-profiled single-cell multi-omics data focus only on learning representations that capture stationary and shared information among these modalities. Current methods do not account for time-dependent and modality-specific information delineating cell states and subtypes, nor do they consider dynamics resulting from causal relations among modalities. For example, open chromatin may cause active transcription; however, it is also possible that gene expression responses lag behind changes in chromatin accessibility. To account for this time lag, the epigenome and transcriptome relationship can be characterized as "coupled" (changing dependently) or "decoupled" (changing independently). We propose the framework HALO ( H ierarchical c A usal representation L earning for O mics data), which adopts a causal approach to model these non-stationary causal relations using independent changing mechanisms in co-profiled single-cell ATAC- and RNA-seq data. Our model factorizes these two modalities into both coupled and decoupled latent representations, allowing us to identify the dynamic interplay between chromatin accessibility and transcription through temporal modulations. In blood lineage and developing mouse brain data, where the balance between proliferation and differentiation is tightly regulated, HALO distinguishes between coupled and decoupled genes and links them with disparate processes that constitute these two complementary states. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
hierarchical causal representation learning,data,multi-omics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要