A Streamlined Scrna-Seq Data Analysis Framework Based On Improved Sparse Subspace Clustering

IEEE ACCESS(2021)

Cited 21|Views12
No score
Abstract
One advantage of single-cell RNA sequencing is its ability in revealing cell heterogeneity by cell clustering. However, cell clustering based on single-cell RNA sequencing is challenging due to the high transcript amplification noise, sparsity and outlier cell populations. In this study, we propose a novel sparse subspace clustering method called Structured Sparse Subspace Clustering and Completion for single-cell RNA sequencing analysis by assuming the cells related together are in the same subspace, and so the relationships among cells can be described within a subspace instead of between cell pairs. The proposed optimization model is solved by the Linearized Alternating Direction Method of Multipliers, in which data completion and spectral clustering are combined as a whole by mutual constraint. It is worth noting that random walk is used in the model to make the coefficient matrix more diagonal in the optimum iterative procedure, and the effect is significant. Our model is applied and compared with 5 state-of-the-art clustering methods on 6 public single cell datasets and a simulated data set with cell numbers varying from 56 to over 3000. As a result, our model outperforms the other clustering methods in clustering accuracy as evaluated by Adjusted Rand Index, Normalized Mutual Information, Homogeneity and Completeness, especially compared with the other improved sparse subspace clustering methods.
More
Translated text
Key words
Clustering methods, Optimization, Clustering algorithms, Sparse matrices, Sequential analysis, RNA, Robustness, Linearized alternating direction method of multipliers, low-rank representation, single-cell RNA sequencing, sparse subspace clustering
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined