谷歌Chrome浏览器插件
订阅小程序
在清言上使用

Calibrated imputation for multivariate categorical data

Ton de Waal, Jacco Daalmans

ASTA-ADVANCES IN STATISTICAL ANALYSIS(2023)

引用 0|浏览1
暂无评分
摘要
Non-response is a major problem for anyone collecting and processing data. A commonly used technique to deal with missing data is imputation, where missing values are estimated and filled in into the dataset. Imputation can become challenging if the variable to be imputed has to comply with a known total. Even more challenging is the case where several variables in the same dataset need to be imputed and, in addition to known totals, logical restrictions between variables have to be satisfied. In our paper, we develop an approach for a broad class of imputation methods for multivariate categorical data such that previously published totals are preserved while logical restrictions on the data are satisfied. The developed approach can be used in combination with any imputation model that estimates imputation probabilities, i.e. the probability that imputation of a certain category for a variable in a certain unit leads to the correct value for this variable and unit.
更多
查看译文
关键词
Non-response,Edit rules,Fully conditional specification,Mass imputation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要