Chrome Extension
WeChat Mini Program
Use on ChatGLM

Parallel integrative learning for large-scale multi-response regression with incomplete outcomes

Ruipeng Dong, Daoji Li, Zemin Zheng

Computational Statistics & Data Analysis(2021)

Cited 2|Views6
No score
Abstract
Multi-task learning is increasingly used to investigate the association structure between multiple responses and a single set of predictor variables in many applications. In the era of big data, the coexistence of incomplete outcomes, large number of responses, and high dimensionality in predictors poses unprecedented challenges in estimation, prediction and computation. In this paper, we propose a scalable and computationally efficient procedure, called PEER, for large-scale multi-response regression with incomplete outcomes, where both the numbers of responses and predictors can be high-dimensional. Motivated by sparse factor regression, we convert the multi-response regression into a set of univariate-response regressions, which can be efficiently implemented in parallel. Under some mild regularity conditions, we show that PEER enjoys nice sampling properties including consistency in estimation, prediction, and variable selection. Extensive simulation studies show that our proposal compares favorably with several existing methods in estimation accuracy, variable selection, and computation efficiency. (C) 2021 Elsevier B.V. All rights reserved.
More
Translated text
Key words
High dimensionality,Incomplete data,Latent factors,Multi-task learning,Singular value decomposition
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined