科研成果详情

发表状态已发表Published
题名LASSO-based false-positive selection for class-imbalanced data in metabolomics
作者
发表日期2019
发表期刊Journal of Chemometrics
ISSN/eISSN0886-9383
卷号33期号:10
摘要

Feature selection and rebalancing can be seen as two preprocessing ways in class-imbalanced learning. Recently, there have been many research achievements and applications on LASSO-type feature selection, whereas most of them are not directly designed for addressing class-imbalanced data. In this study, we proposed a LASSO-based stable feature selection algorithm for class-imbalanced data analysis, and false-positive selection (FPS) under balanced and imbalanced situations was calculated via selection frequency of each predictor in doing stable selection. The results on simulation studies and real data examples show that class imbalance contributes to avoid overselection caused by LASSO when the data are highly correlated and a lower FPS can be obtained with class-imbalanced data than balanced one in most of cases in the same settings. A statistical explanation was given for this phenomenon. In addition, it does not need to rebalance the class-imbalanced data for performing such LASSO-based feature selection with a stable strategy, and to some degree, intentionally disequilibrating the balanced data could be an alternative strategy to weaken overselection and to perform biomarker identification for finding a few of most important biomarkers. © 2019 John Wiley & Sons, Ltd.

关键词class imbalance false-positive selection LASSO-based feature selection rebalance
DOI10.1002/cem.3177
URL查看来源
收录类别SCIE
语种英语English
WOS研究方向Automation & Control Systems ; Chemistry ; Computer Science ; Instruments & Instrumentation ; Mathematics
WOS类目Automation & Control Systems ; Chemistry, Analytical ; Computer Science, Artificial Intelligence ; Instruments & Instrumentation ; Mathematics, Interdisciplinary Applications ; Statistics & Probability
WOS记录号WOS:000483611900001
引用统计
被引频次:10[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/5055
专题个人在本单位外知识产出
作者单位
1.School of Science, Kunming University of Science and Technology, Kunming, China
2.Faculty of Agriculture and Food, Kunming University of Science and Technology, Kunming, China
3.School of Mathematics, The University of Manchester, Manchester, United Kingdom
推荐引用方式
GB/T 7714
Fu, Guang-Hui,Yi, Lun-Zhao,Pan, Jianxin. LASSO-based false-positive selection for class-imbalanced data in metabolomics[J]. Journal of Chemometrics, 2019, 33(10).
APA Fu, Guang-Hui, Yi, Lun-Zhao, & Pan, Jianxin. (2019). LASSO-based false-positive selection for class-imbalanced data in metabolomics. Journal of Chemometrics, 33(10).
MLA Fu, Guang-Hui,et al."LASSO-based false-positive selection for class-imbalanced data in metabolomics". Journal of Chemometrics 33.10(2019).
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Fu, Guang-Hui]的文章
[Yi, Lun-Zhao]的文章
[Pan, Jianxin]的文章
百度学术
百度学术中相似的文章
[Fu, Guang-Hui]的文章
[Yi, Lun-Zhao]的文章
[Pan, Jianxin]的文章
必应学术
必应学术中相似的文章
[Fu, Guang-Hui]的文章
[Yi, Lun-Zhao]的文章
[Pan, Jianxin]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。