Explainable Correlation of Categorical Data and Bar Charts

Imre J. Rudas, I.M. Batyrshin

Studies in fuzziness and soft computing(2023)

引用 0|浏览0
暂无评分
摘要
We propose a method of calculating the correlation between frequency distributions defined on a finite set of categories. The method is based on the general approach to constructing an invertible correlation function on a set with involutive operation. This correlation function is constructed using a suitable similarity (or dissimilarity) function defined on such a set. For constructing a correlation function on the set of frequency distributions, we use the recently introduced involutive negation of probability distributions and a suitable dissimilarity function between them. Surprisingly, the obtained correlation function coincides with the Pearson correlation coefficient. The proposed approach is illustrated in an example of calculating the correlation between categories of a categorical variable corresponding to strings of a contingency table. The traditional graphical interpretation of the Pearson correlation by linear regression gives an explanation of obtained correlations.
更多
查看译文
关键词
explainable correlation,categorical data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要