...
首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Cell-Coupled Long Short-Term Memory With L -Skip Fusion Mechanism for Mood Disorder Detection Through Elicited Audiovisual Features
【24h】

Cell-Coupled Long Short-Term Memory With L -Skip Fusion Mechanism for Mood Disorder Detection Through Elicited Audiovisual Features

机译:具有L-Skip融合机制的细胞耦合长时短期记忆,可通过有效的视听功能检测情绪障碍

获取原文
获取原文并翻译 | 示例
           

摘要

In early stages, patients with bipolar disorder are often diagnosed as having unipolar depression in mood disorder diagnosis. Because the long-term monitoring is limited by the delayed detection of mood disorder, an accurate and one-time diagnosis is desirable to avoid delay in appropriate treatment due to misdiagnosis. In this paper, an elicitation-based approach is proposed for realizing a one-time diagnosis by using responses elicited from patients by having them watch six emotion-eliciting videos. After watching each video clip, the conversations, including patient facial expressions and speech responses, between the participant and the clinician conducting the interview were recorded. Next, the hierarchical spectral clustering algorithm was employed to adapt the facial expression and speech response features by using the extended Cohn-Kanade and eNTERFACE databases. A denoizing autoencoder was further applied to extract the bottleneck features of the adapted data. Then, the facial and speech bottleneck features were input into support vector machines to obtain speech emotion profiles (EPs) and the modulation spectrum (MS) of the facial action unit sequence for each elicited response. Finally, a cell-coupled long short-term memory (LSTM) network with an $L$ -skip fusion mechanism was proposed to model the temporal information of all elicited responses and to loosely fuse the EPs and the MS for conducting mood disorder detection. The experimental results revealed that the cell-coupled LSTM with the $L$ -skip fusion mechanism has promising advantages and efficacy for mood disorder detection.
机译:在早期阶段,双相情感障碍患者在情绪障碍诊断中通常被诊断为患有单相抑郁症。由于长期监测受到情绪障碍延迟检测的限制,因此需要一种准确且一次性的诊断来避免由于误诊导致的适当治疗延迟。在本文中,提出了一种基于启发的方法,通过使用患者观看六个情感视频所引起的反应来实现一次性诊断。观看每个视频剪辑后,记录参与者和进行访谈的临床医生之间的对话,包括患者的面部表情和语音响应。接下来,通过使用扩展的Cohn-Kanade和eNTERFACE数据库,采用了层次谱聚类算法来适应面部表情和语音响应特征。去噪自动编码器进一步应用于提取自适应数据的瓶颈特征。然后,将面部和语音瓶颈特征输入到支持向量机中,以针对每个引发的响应获得语音情感配置文件(EP)和面部动作单元序列的调制谱(MS)。最后,提出了一种具有$ L $跳跃融合机制的细胞耦合长短期记忆(LSTM)网络,以对所有引发的反应的时间信息进行建模,并松散融合EP和MS以进行情绪障碍检测。实验结果表明,具有$ L $ -skip融合机制的细胞偶联LSTM在检测情绪障碍方面具有有前途的优势和功效。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号