Computer and Modernization ›› 2023, Vol. 0 ›› Issue (11): 36-43.doi: 10.3969/j.issn.1006-2475.2023.11.006

Previous Articles     Next Articles

Prediction of Diabetes Mellitus Using LightGBM Classifier with RF-RFECV

  

  1. (Basic Medical Science Academy, Air Force Military Medical University, Xi’an 710032, China)
  • Online:2023-11-29 Published:2023-11-29

Abstract: Abstract: In order to find the high-risk population of diabetes in China as early as possible and provide targeted intervention measures, the data set of China Health and Retirement Longitudinal Study (CHARLS), which represents the Chinese population, was selected as the research object, and a hybrid algorithm based on RF-RFECV and LightGBM (RF-RFECV-LightGBM) was proposed, and compared with five other algorithms through experiments. The results show that RF-RFECV- LightGBM has the best overall performance, the accuracy, precision, recall, F1 value and AUC value are 0.9772, 0.9952, 0.8178, 0.8978, and 0.9357, respectively. The prediction time is 0.0428 s, which is 0.0549 s shorter than the prediction time of LightGBM before feature selection (increased by 56.19%), indicating the effectiveness of RF-RFECV algorithm. Finally, the same prediction process was tested on the Pima Indian dataset, and the results achieved an accuracy of 0.9415, further verifying the excellent performance of the proposed algorithm RF-RFECV-LightGBM, which can assist in clinical diagnosis and treatment of diabetes.

Key words: Key words: LightGBM, RF-RFECV, prediction of diabetes, CHARLS, Pima

CLC Number: