A Multi-Modal Stacked Ensemble Model for Bipolar Disorder Classification

IEEE Transactions on Affective Computing(2023)

引用 17|浏览47
暂无评分
摘要
We propose an automatic ternary classification model for Bipolar Disorder (BD) states. As input information, the model uses speech signals from patients’ audio-visual recordings of structured interviews. The model classifies the patient's clinical state as Mania, Hypo-Mania, or Remission. We capture Mel-Frequency Cepstral Coefficients (MFCCs) and Geneva Minimalistic Acoustic Parameter Set (GeMAPS) as audio features. We compute linguistic and sentiment features for each subject's transcript. We present a stacked ensemble classifier to classify all fused features after feature selection. A set of three homogeneous Convolutional Neural Networks (CNNs) and a Multi Layer Perceptron (MLP) construct the first-level and second-level of the stacked ensemble classifier respectively. Moreover, we use the Neural Architecture Search (NAS) reinforcement learning strategy to optimize the networks and their hyperparameters. We show that our stacked ensemble framework outperforms existing models on the BD Turkish corpus with a $ 59.3\%$ Unweighted Average Unit (UAR) on the test set. To the best of our knowledge, this is the highest UAR achieved on this dataset.
更多
查看译文
关键词
Affective computing,bipolar disorder,stacked ensemble learning,audio and textual processing,automated learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要