科研成果详情

发表状态已发表Published
题名Variable Selection for Distributed Sparse Regression Under Memory Constraints
作者
发表日期2024-06-01
发表期刊Communications in Mathematics and Statistics
ISSN/eISSN2194-6701
卷号12期号:2页码:307-338
摘要

This paper studies variable selection using the penalized likelihood method for distributed sparse regression with large sample size n under a limited memory constraint. This is a much needed research problem to be solved in the big data era. A naive divide-and-conquer method solving this problem is to split the whole data into N parts and run each part on one of N machines, aggregate the results from all machines via averaging, and finally obtain the selected variables. However, it tends to select more noise variables, and the false discovery rate may not be well controlled. We improve it by a special designed weighted average in aggregation. Although the alternating direction method of multiplier can be used to deal with massive data in the literature, our proposed method reduces the computational burden a lot and performs better by mean square error in most cases. Theoretically, we establish asymptotic properties of the resulting estimators for the likelihood models with a diverging number of parameters. Under some regularity conditions, we establish oracle properties in the sense that our distributed estimator shares the same asymptotic efficiency as the estimator based on the full sample. Computationally, a distributed penalized likelihood algorithm is proposed to refine the results in the context of general likelihoods. Furthermore, the proposed method is evaluated by simulations and a real example.

关键词62H12 62J12 Distributed penalized likelihood algorithm Distributed sparse regression Memory constraints Variable selection
DOI10.1007/s40304-022-00291-w
URL查看来源
收录类别SCIE
语种英语English
WOS研究方向Mathematics
WOS类目Mathematics
WOS记录号WOS:000921784400001
Scopus入藏号2-s2.0-85147185498
引用统计
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/11774
专题理工科技学院
通讯作者Jiang, Xuejun
作者单位
1.Department of Mathematics, Harbin Institute of Technology, Harbin, China
2.Department of Statistics and Data Science, Southern University of Science and Technology, Shenzhen, China
3.Beijing Normal University-Hong Kong Baptist University United International College, Zhuhai, China
4.Department of Mathematics and Statistics, University of North Carolina at Charlotte, Charlotte, United States
推荐引用方式
GB/T 7714
Wang, Haofeng,Jiang, Xuejun,Zhou, Minet al. Variable Selection for Distributed Sparse Regression Under Memory Constraints[J]. Communications in Mathematics and Statistics, 2024, 12(2): 307-338.
APA Wang, Haofeng, Jiang, Xuejun, Zhou, Min, & Jiang, Jiancheng. (2024). Variable Selection for Distributed Sparse Regression Under Memory Constraints. Communications in Mathematics and Statistics, 12(2), 307-338.
MLA Wang, Haofeng,et al."Variable Selection for Distributed Sparse Regression Under Memory Constraints". Communications in Mathematics and Statistics 12.2(2024): 307-338.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Wang, Haofeng]的文章
[Jiang, Xuejun]的文章
[Zhou, Min]的文章
百度学术
百度学术中相似的文章
[Wang, Haofeng]的文章
[Jiang, Xuejun]的文章
[Zhou, Min]的文章
必应学术
必应学术中相似的文章
[Wang, Haofeng]的文章
[Jiang, Xuejun]的文章
[Zhou, Min]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。