KNN Classifier with Self Adjusting Memory for Heterogeneous Concept Drift

2016 IEEE 16th International Conference on Data Mining (ICDM)(2016)

引用 212|浏览67
暂无评分
摘要
Data Mining in non-stationary data streams is gaining more attentionrecently, especially in the context of Internet of Things and Big Data. It is a highly challenging task, since the fundamentally different typesof possibly occurring drift undermine classical assumptions such asi.i.d. data or stationary distributions. Available algorithms are either struggling with certain forms of drift or require a priori knowledge in terms of a task specific setting. We propose the Self Adjusting Memory (SAM) model for the k Nearest Neighbor (kNN) algorithm since kNN constitutes a proven classifier within the streaming setting. SAM-kNN can deal with heterogeneous concept drift, i.e different drift types and rates, using biologically inspiredmemory models and their coordination. It can be easilyapplied in practice since an optimization of the meta parameters is not necessary. The basic idea is to construct dedicated models for thecurrent and former concepts and apply them according tothe demands of the given situation. An extensive evaluation on various benchmarks, consisting of artificial streamswith known drift characteristics as well as real world datasets is conducted. Thereby, we explicitly add new benchmarks enabling a precise performance evaluation on multiple types of drift. The highly competitive results throughout all experiments underline the robustness of SAM-kNN as well as its capabilityto handle heterogeneous concept drift.
更多
查看译文
关键词
Data streams,concept drift,kNN,data mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要