SS-DBSCAN: Epsilon Estimation with Stratified Sampling for Density-Based Spatial Clustering of Applications with Noise
2023 International Conference on Automation, Control and Electronics Engineering (CACEE)(2023)
摘要
Clustering algorithms are crucial in uncovering hidden patterns and structures within datasets. Among the density-based clustering algorithms, DBSCAN (Density-Based Spatial Clustering of Applications with Noise) has gained considerable attention for its effectiveness in various applications. However, determining appropriate parameter values for this algorithm remains a challenging task. This paper presents a novel methodology for eps parameter estimation for an improved DBSCAN, namely SS-DBSCAN. The experimental results across nine datasets demonstrate the efficacy of our proposed method in accurately determining clusters with eps value from SS-DBSCAN algorithm. The clusters identified using estimated eps values by SS-DBSCAN align well with the inherent structure of the datasets, yielding better cluster results than the manually set parameters and other methods used for automatic estimations of the eps for DBSCAN. Our approach adapted well to the peculiarities of each dataset, whether dealing with different scales, dimensions, or densities; it proved the versatility and robustness across various datasets, thereby emphasizing its generalizability and potential for broader applications.
更多查看译文
关键词
SS-DBSCAN,eps estimation,density based clustering,stratified sampling,diverse dataset
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要