Paper
13 January 2023 Predictors selection strategy based on stepwise random forests and logistic regression model
Chaozhi Li
Author Affiliations +
Proceedings Volume 12510, International Conference on Statistics, Data Science, and Computational Intelligence (CSDSCI 2022); 1251012 (2023) https://doi.org/10.1117/12.2656859
Event: International Conference on Statistics, Data Science, and Computational Intelligence (CSDSCI 2022), 2022, Qingdao, China
Abstract
Random Forests (RF) are a popular machine learning method for developing variables selection models. However, it may suffer from a lack of better model interpretability compared with traditional models, such as Logistic Regression (LR). In this paper, we propose a predictors selection strategy based on Stepwise Random Forests and Logistic Regression model (SRFLR) and validate it using HCV data provided by UCI machine learning repository platform. The Synthetic Minority Oversampling Technique (SMOTE) algorithm is adopted to deal with the problem of imbalance class in the dataset. The results demonstrate that the proposed SRFLR can obtain more predictors while preserving better prediction ability than LR alone, which will offer some references for clinical researchers to select relevant disease predictors.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Chaozhi Li "Predictors selection strategy based on stepwise random forests and logistic regression model", Proc. SPIE 12510, International Conference on Statistics, Data Science, and Computational Intelligence (CSDSCI 2022), 1251012 (13 January 2023); https://doi.org/10.1117/12.2656859
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Machine learning

Performance modeling

Mathematical modeling

Binary data

Error analysis

Statistical analysis

Back to Top