Details of Research Outputs

Status已发表Published
TitleLASSO-based false-positive selection for class-imbalanced data in metabolomics
Creator
Date Issued2019
Source PublicationJournal of Chemometrics
ISSN0886-9383
Volume33Issue:10
Abstract

Feature selection and rebalancing can be seen as two preprocessing ways in class-imbalanced learning. Recently, there have been many research achievements and applications on LASSO-type feature selection, whereas most of them are not directly designed for addressing class-imbalanced data. In this study, we proposed a LASSO-based stable feature selection algorithm for class-imbalanced data analysis, and false-positive selection (FPS) under balanced and imbalanced situations was calculated via selection frequency of each predictor in doing stable selection. The results on simulation studies and real data examples show that class imbalance contributes to avoid overselection caused by LASSO when the data are highly correlated and a lower FPS can be obtained with class-imbalanced data than balanced one in most of cases in the same settings. A statistical explanation was given for this phenomenon. In addition, it does not need to rebalance the class-imbalanced data for performing such LASSO-based feature selection with a stable strategy, and to some degree, intentionally disequilibrating the balanced data could be an alternative strategy to weaken overselection and to perform biomarker identification for finding a few of most important biomarkers. © 2019 John Wiley & Sons, Ltd.

Keywordclass imbalance false-positive selection LASSO-based feature selection rebalance
DOI10.1002/cem.3177
URLView source
Indexed BySCIE
Language英语English
WOS Research AreaAutomation & Control Systems ; Chemistry ; Computer Science ; Instruments & Instrumentation ; Mathematics
WOS SubjectAutomation & Control Systems ; Chemistry, Analytical ; Computer Science, Artificial Intelligence ; Instruments & Instrumentation ; Mathematics, Interdisciplinary Applications ; Statistics & Probability
WOS IDWOS:000483611900001
Citation statistics
Cited Times:10[WOS]   [WOS Record]     [Related Records in WOS]
Document TypeJournal article
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/5055
CollectionResearch outside affiliated institution
Affiliation
1.School of Science, Kunming University of Science and Technology, Kunming, China
2.Faculty of Agriculture and Food, Kunming University of Science and Technology, Kunming, China
3.School of Mathematics, The University of Manchester, Manchester, United Kingdom
Recommended Citation
GB/T 7714
Fu, Guang-Hui,Yi, Lun-Zhao,Pan, Jianxin. LASSO-based false-positive selection for class-imbalanced data in metabolomics[J]. Journal of Chemometrics, 2019, 33(10).
APA Fu, Guang-Hui, Yi, Lun-Zhao, & Pan, Jianxin. (2019). LASSO-based false-positive selection for class-imbalanced data in metabolomics. Journal of Chemometrics, 33(10).
MLA Fu, Guang-Hui,et al."LASSO-based false-positive selection for class-imbalanced data in metabolomics". Journal of Chemometrics 33.10(2019).
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Fu, Guang-Hui]'s Articles
[Yi, Lun-Zhao]'s Articles
[Pan, Jianxin]'s Articles
Baidu academic
Similar articles in Baidu academic
[Fu, Guang-Hui]'s Articles
[Yi, Lun-Zhao]'s Articles
[Pan, Jianxin]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Fu, Guang-Hui]'s Articles
[Yi, Lun-Zhao]'s Articles
[Pan, Jianxin]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.