发表状态 | 已发表Published |
题名 | LASSO-based false-positive selection for class-imbalanced data in metabolomics |
作者 | |
发表日期 | 2019 |
发表期刊 | Journal of Chemometrics
![]() |
ISSN/eISSN | 0886-9383 |
卷号 | 33期号:10 |
摘要 | Feature selection and rebalancing can be seen as two preprocessing ways in class-imbalanced learning. Recently, there have been many research achievements and applications on LASSO-type feature selection, whereas most of them are not directly designed for addressing class-imbalanced data. In this study, we proposed a LASSO-based stable feature selection algorithm for class-imbalanced data analysis, and false-positive selection (FPS) under balanced and imbalanced situations was calculated via selection frequency of each predictor in doing stable selection. The results on simulation studies and real data examples show that class imbalance contributes to avoid overselection caused by LASSO when the data are highly correlated and a lower FPS can be obtained with class-imbalanced data than balanced one in most of cases in the same settings. A statistical explanation was given for this phenomenon. In addition, it does not need to rebalance the class-imbalanced data for performing such LASSO-based feature selection with a stable strategy, and to some degree, intentionally disequilibrating the balanced data could be an alternative strategy to weaken overselection and to perform biomarker identification for finding a few of most important biomarkers. © 2019 John Wiley & Sons, Ltd. |
关键词 | class imbalance false-positive selection LASSO-based feature selection rebalance |
DOI | 10.1002/cem.3177 |
URL | 查看来源 |
收录类别 | SCIE |
语种 | 英语English |
WOS研究方向 | Automation & Control Systems ; Chemistry ; Computer Science ; Instruments & Instrumentation ; Mathematics |
WOS类目 | Automation & Control Systems ; Chemistry, Analytical ; Computer Science, Artificial Intelligence ; Instruments & Instrumentation ; Mathematics, Interdisciplinary Applications ; Statistics & Probability |
WOS记录号 | WOS:000483611900001 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | https://repository.uic.edu.cn/handle/39GCC9TT/5055 |
专题 | 个人在本单位外知识产出 |
作者单位 | 1.School of Science, Kunming University of Science and Technology, Kunming, China 2.Faculty of Agriculture and Food, Kunming University of Science and Technology, Kunming, China 3.School of Mathematics, The University of Manchester, Manchester, United Kingdom |
推荐引用方式 GB/T 7714 | Fu, Guang-Hui,Yi, Lun-Zhao,Pan, Jianxin. LASSO-based false-positive selection for class-imbalanced data in metabolomics[J]. Journal of Chemometrics, 2019, 33(10). |
APA | Fu, Guang-Hui, Yi, Lun-Zhao, & Pan, Jianxin. (2019). LASSO-based false-positive selection for class-imbalanced data in metabolomics. Journal of Chemometrics, 33(10). |
MLA | Fu, Guang-Hui,et al."LASSO-based false-positive selection for class-imbalanced data in metabolomics". Journal of Chemometrics 33.10(2019). |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论