计算机与现代化 ›› 2023, Vol. 0 ›› Issue (10): 17-22.doi: 10.3969/j.issn.1006-2475.2023.10.003

• 人工智能 • 上一篇    下一篇

基于多模态特征融合的抑郁症识别

  

  1. (华南师范大学计算机学院,广东 广州 510631)
  • 出版日期:2023-10-26 发布日期:2023-10-26
  • 作者简介:谷明轩(1997—),男,广东广州人,硕士研究生,研究方向:多模态数据融合,E-mail: 1015851746@qq.com; 通信作者:范冰冰(1962—),男,江苏启东人,教授,博士,研究方向:云应用和云工程,移动互联网,大数据管理和应用,E-mail: fanbb1962@qq.com。
  • 基金资助:
    广东省重大科技专项(2016B030305003)

Feature-level Multimodal Fusion for Depression Recognition

  1. (School of Computer Science, South China Normal University, Guangzhou 510631, China)
  • Online:2023-10-26 Published:2023-10-26

摘要: 抑郁症是一种常见的精神疾病,现有的抑郁症诊断主要依赖于抑郁量表和精神科医生的访谈,具有较强的主观性。近年来,越来越多的研究者致力于通过脑电特征或音频特征识别抑郁症患者,但并未有研究将脑电信息与音频信息有效地结合起来,忽略了音频和脑电数据之间的相关性。因此本文提出一种基于全连接神经网络的多模态特征融合模型,通过对音频模态和脑电模态信息的特征融合提升抑郁症识别的准确率,为抑郁症的识别提供新的角度和方法。实验表明,多模态特征融合在MODMA数据集上的抑郁症识别准确率达到了81.58%且高于单模态抑郁症识别方法的准确率。这表明,相比于单模态识别,多模态特征融合模型能够提高抑郁症识别的准确率。

关键词: 关键词:多模态数据融合, 抑郁症识别, 特征融合, 全连接神经网络

Abstract: Abstract: Depression is a common psychiatric disorder. However, the existing diagnostic methods for depression mainly rely on scales and interviews with psychiatrists, which are highly subjective. In recent years, researchers have devoted themselves to identifying depressed patients by EEG features or audio features, but no study has effectively combined EEG information with audio information, ignoring the correlation between audio and EEG data. Therefore, this study proposes a feature-level multimodal fusion model to improve the accuracy of depression recognition. We combine the audio and EEG modality information based on a fully connected neural network. Our experiments show that the accuracy of depression recognition using feature-level multimodal fusion model on the MODMA dataset reaches 81.58%, which is higher than that of using single-modality. The results indicate that the feature-level multimodal fusion model can improve the accuracy of depression recognition compared to single-modality. Our research provides a new perspective and method for depression recognition.

Key words: Key words: multimodal data fusion, depression detection, feature-level fusion, fully-connected neural networks

中图分类号: