计算机与现代化 ›› 2021, Vol. 0 ›› Issue (03): 88-93.

• 数据库与数据挖掘 • 上一篇    下一篇

一种为辅助诊断筛选机器学习模型的方法

  

  1. (1.长沙商贸旅游职业技术学院经济湘商学院,湖南长沙410116;2.湖南大学国家超级计算长沙中心,湖南长沙410082)
  • 出版日期:2020-03-30 发布日期:2021-03-24
  • 作者简介:邓子云(1979—),男,湖南双峰人,教授,博士,研究方向:大数据,产业经济学,高等职业教育,E-mail: dengziyun@126.com。
  • 基金资助:
    教育部“天诚汇智”基金资助项目(2018A01010); 湖南省自然科学基金资助项目(2020JJ7091)

A Screening Method of Machine Learning Model for Auxiliary Diagnosis

  1. (1. Faculty of Hunan Business, Changsha Commerce & Tourism College, Changsha 410116, China; 
     2. National Supercomputing Center in Changsha, Hunan University, Changsha 410082, China)
  • Online:2020-03-30 Published:2021-03-24

摘要: 为自动向医生推荐用于疾病辅助诊断的机器学习模型,提出一种筛选机器学习模型的方法。该筛选方法分为3个步骤:用训练准确度和测试准确度筛选机器学习模型;用查准率、召回率和F1成绩筛选机器学习模型;用带权值的总成绩计算公式推荐最优的机器学习模型。以乳腺癌辅助诊断为例,最终从8个机器学习模型中筛选并训练出高斯核心函数向量机模型(γ=0.5)推荐给医生使用,因为这个模型除满足筛选方法的3个条件外,总成绩最高,达到了0.985。

关键词: 机器学习模型, 辅助诊断, 筛选方法, 学习曲线, 模型成绩

Abstract: To automatically recommend machine learning models for auxiliary diagnosis of diseases to doctors, a screening method of machine learning model is proposed. The screening method consists of three steps including screening machine learning models with training accuracy and testing accuracy, screening machine learning models with precision rates, recall rates and F1 scores, recommending the optimal machine learning model using the total score formula with weights. Taking the auxiliary diagnosis of breast cancer as an example, the support vector machine model (γ=0.5) based on Gaussian RBF (Radial Basis Function) is finally selected among 8 machine learning models and recommended for doctors to use, because in addition to meeting the three conditions of the screening method, the model achieves the highest total score of 0.985.

Key words: machine learning model, auxiliary diagnosis, screening method, learning curve, model score