Paper
28 March 2023 FL-Lightgbm prediction method of unbalanced small sample anti-breast cancer drugs
Chenxiao Zhou, Lianying Zou, Chuang Liu, Ziwei Song
Author Affiliations +
Proceedings Volume 12566, Fifth International Conference on Computer Information Science and Artificial Intelligence (CISAI 2022); 125661T (2023) https://doi.org/10.1117/12.2667385
Event: Fifth International Conference on Computer Information Science and Artificial Intelligence (CISAI 2022), 2022, Chongqing, China
Abstract
The problem of small amount data and sample imbalance exists in the machine learning prediction of the molecular properties of anti breast cancer candidate drugs. Proposing a FL-Lightgbm prediction model based on WGAN-GP data enchance model in order to solve this problem. Firstly, WGAN-GP model is used for data enhancement to increase the sample size of the training data set. Considering the small difference between positive and negative samples, the enhanced data of positive and negative samples are generated respectively, and then combined them according to the original order to ensure that the generated data and the original data maintain the same distribution; Then the Focal Loss function is introduced into the Lightgbm model to increase learning ability for unbalanced samples, the model constructed is called FL-Lightgbm prediction model. After the training of the enhanced data set, the proposed model shows excellent prediction accuracy for 178 randomly selected validation samples in the experiment, and its highest accuracy, AUC and F1 values reach 0.882, 0.851 and 0.7272 respectively. In these three indexes, the proposed model has better prediction ability than the original Lightgbm model with over sampling algorithms such as BorderlineSMOTE and ADASYN.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Chenxiao Zhou, Lianying Zou, Chuang Liu, and Ziwei Song "FL-Lightgbm prediction method of unbalanced small sample anti-breast cancer drugs", Proc. SPIE 12566, Fifth International Conference on Computer Information Science and Artificial Intelligence (CISAI 2022), 125661T (28 March 2023); https://doi.org/10.1117/12.2667385
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Biological samples

Breast cancer

Data modeling

Machine learning

Performance modeling

Statistical modeling

Tumor growth modeling

Back to Top