FISHFactor: A Probabilistic Factor Model for Spatial Transcriptomics Data with Subcellular Resolution

biorxiv(2022)

引用 7|浏览11
暂无评分
摘要
Motivation Factor analysis is a widely used tool for unsupervised dimensionality reduction of high-throughput data sets in molecular biology, with recently proposed extensions designed specifically for spatial transcriptomics data. However, these methods expect (count) matrices as data input and are therefore not directly applicable to single molecule resolution data, which are in the form of coordinate lists annotated with genes and provide insight into subcellular spatial expression patterns. To address this, we here propose FISHFactor, a probabilistic factor model that combines the benefits of spatial, non-negative factor analysis with a Poisson point process likelihood to explicitly model and account for the nature of single molecule resolution data. In addition, FISHFactor shares information across a potentially large number of cells in a common weight matrix, allowing consistent interpretation of factors across cells and yielding improved latent variable estimates. Results We compare FISHFactor to existing methods that rely on aggregating information through spatial binning and cannot combine information from multiple cells, and show that our method leads to more accurate results on simulated data. We demonstrate on a real data set that FISHFactor is able to identify major subcellular expression patterns and spatial gene clusters in a data-driven manner. Availability and Implementation The model implementation, data simulation and experiment scripts are available under . Contact b.velten{at}dkfz.de ### Competing Interest Statement O.S. is a payed consultant of Insitro Inc.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要