Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan

SOCIO-ECONOMIC PLANNING SCIENCES(2022)

引用 8|浏览7
暂无评分
摘要
Aims: We aimed at giving a preliminary analysis of the weakness of a current test strategy, and proposing a data driven strategy that was self-adaptive to the dynamic change of pandemic. The effect of driven-data selection over time and space was also within the deep concern.Methods: A mathematical definition of the test strategy were given. With the real COVID-19 test data from March to July collected in Lahore, a significance analysis of the possible features was conducted. A machine learning method based on logistic regression and priority ranking were proposed for the data-driven test strategy. With performance assessed by the area under the receiver operating characteristic curve (AUC), time series analysis and spatial cross-test were conducted.Results: The transition of risk factors accounted for the failure of the current test strategy. The proposed data driven strategy could enhance the positive detection rate from 2.54% to 28.18%, and the recall rate from 8.05% to 89.35% under strictly limited test capacity. Much more optimal utilization of test resources could be realized where 89.35% of total positive cases could be detected with merely 48.17% of the original test amount. The strategy showed self-adaptability with the development of pandemic, while the strategy driven by local data was proved to be optimal.Conclusions: We recommended a generalization of such a data-driven test strategy for a better response to the global developing pandemic. Besides, the construction of the COVID-19 data system should be more refined on space for local applications.
更多
查看译文
关键词
COVID-19, Test strategy, Policy making, Machine learning, Logistic regression, Time series analysis, Spatial analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要