A comparison of missing value imputation methods applied to daily precipitation in a semi-arid and a humid region of Mexico

ATMOSFERA(2023)

引用 0|浏览4
暂无评分
摘要
Climatological data with unreliable or missing values is an important area of research, and multiple methods are available to fill in missing data and evaluate data quality. Our study aims to compare the performance of different methods for estimating missing values explicitly designed for precipitation and multipurpose hydrological data. The climate variable used for the analysis was daily precipitation. We considered two different climate and orographic regions to evaluate the effects of altitude, precipitation regime, and percentage of missing data on the Mean Absolute Error of imputed values and performed a homogeneity evaluation of meteorological stations. We excluded meteorological stations with more than 25% missing data from the analysis. In the semi-arid region, ReddPrec (optimal for nine stations) and GCIDW (optimal for eight stations) were the best-performing methods for the 23 stations, with average MAE values of 1.63 mm/day and 1.46 mm/day, respectively. In the humid region, GCIDW was optimal in similar to 59% of stations, EM in similar to 24%, and ReddPrec in similar to 17%, with average MAE values of similar to 6.0 mm/day, 6.5 mm/day, and similar to 9.8 mm/day, respectively. This research makes a valuable contribution to identifying the most appropriate methods to impute daily precipitation in different climatic regions of Mexico based on efficiency indicators and homogeneity evaluation.
更多
查看译文
关键词
homogeneity,imputation,missing precipitation data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要