计算机与现代化 ›› 2024, Vol. 0 ›› Issue (01): 67-73.doi: 10.3969/j.issn.1006-2475.2024.01.011

• 人工智能 • 上一篇    下一篇

基于异质属性融合的危重疾病二阶段预测模型

  

  1. (1.广东工业大学计算机学院,广东 广州 510006; 2.广东省第二人民医院关节骨科科室,广东  广州 510317;
    3.广东工业大学自动化学院,广东 广州 510006)
  • 出版日期:2024-01-23 发布日期:2024-02-23
  • 作者简介:詹少强(1997—),男,广东汕尾人,硕士研究生,研究方向:机器学习,数据挖掘,E-mail: 1078370943@qq.com; 通信作者:曾安(1978—),女,湖南新化人,教授,博士,研究方向:人工智能,图像处理,E-mail: zengan@gdut.edu.cn; 张逸群(1991—),男,江苏滨海人,副教授,博士,研究方向:数据挖掘,无监督学习,E-mail: yqzhang@gdut.edu.cn; 孙鸿涛(1966—),男,河南洛阳人,主任医师,博士,研究方向:关节骨科,E-mail: 13808849918@163.com; 张小波(1977—),男,湖北随州人,讲师,博士,研究方向:智能控制,E-mail: zxb_leng@gdut.edu.cn。
  • 基金资助:
    广东省自然科学基金资助项目(2022A1515011592)

Two-stage Critical Disease Prediction Model Based on Heterogeneous Attribute Fusion

  1. (1. School of Computers, Guangdong University of Technology, Guangzhou 510006, China;
    2. Department of Orthopedics, The Second People’s Hospital of Guangdong Province, Guangzhou 510317, China;
    3. School of Automation, Guangdong University of Technology, Guangzhou 510006, China)
  • Online:2024-01-23 Published:2024-02-23

摘要: 摘要:随着电子健康记录(Electronic Health Record, EHR)的出现与广泛应用,基于EHR数据的预测模型可以起到早期检测和干预疾病的作用。异质属性在EHR数据中普遍存在,但是难以做到深度利用,因此可通过对数据样本进行异质属性融合的方法,为后续模型训练提供信息丰富的数据表征基础。本文设计一种高效的二阶段预测模型,用于解决重疾预测中存在的时效与成本等问题。该模型的第一阶段对病例样本进行粗粒度预测,将危重程度低的病例进行疾病初筛,起到提前分流病人的作用;第二阶段模型则基于第一阶段的粗滤结果,对潜在的危重病例进行更细粒度的预测。通过实验验证,经过异质属性融合处理后,在选择前6个时间点构造非时序模型时,二阶段模型可以较好地兼具疾病初筛以及疾病预测的效果。

关键词: 关键词:异质属性融合, 疾病初筛, 二阶段模型

Abstract: Abstract: With the emergence and wide application of Electronic Health Record (EHR), the prediction model based on EHR data can be used for early detection and intervention of diseases. Heterogeneous attributes are ubiquitous in EHR data, but it is difficult to thoroughly exploit their information. Therefore, the method of heterogeneous attribute fusion provides an informative data representation basis for subsequent model training. This paper designs an efficient two-stage prediction model for solving the problems of time and cost in predicting critical illness. In the first stage of the model, coarse-grained prediction is performed on patient samples. Patients with low severity are initially screened out, which plays a key role in patient diversion. The second stage makes more fine-grained predictions of potentially critical patients based on the coarse filtering results of the first stage. The experimental verified that, after heterogeneous attribute fusion, when we select the first 6 time points to construct a non-temporal model, the two-stage model has better performance in both initial disease screening and disease prediction.

Key words: Key words: heterogeneous attribute fusion, disease screening, two-stage model

中图分类号: