【24h】

Detecting Deceptive Language in Crime Interrogation

机译:在犯罪讯问中检测欺骗性语言

获取原文

摘要

Deception detection is a vital research problem studied by fields as diverse as psychology, forensic science, sociology. With cooperation with National Investigation Bureau, we have 496 transcript files, each of which contains a conversation of an interrogator and a subject of a real-world polygraph test during interrogation. Researchers have explored the possibility of natural language process techniques in gaming, news articles, interviews, and criminal narratives. In this paper, we explore the effect of the frontier natural language process technique to detect deceptiveness in these conversations. We regard this task as a binary classification problem. We utilize four different methods, inclusive of part-of-speech extraction, one-hot-encoding, means of embedding vectors, and BERT pre-trained model, to capture hidden information of transcript files into vectors. After that, we take these vectors as training samples of a hierarchy neural network, which is constructed by a fully-connected layer and/or an LSTM layer. After training, our system can take a transcript file as its input and classify whether the subject is deceptive or not. An Fl score 0.733 is achieved from our system.
机译:欺骗检测是一个至关重要的研究问题,其研究领域涉及心理学,法医学和社会学。在与国家调查局的合作下,我们拥有496个成绩单文件,每个文件都包含一个审讯员的谈话和一个在审问期间进行的现实测谎仪测试的主题。研究人员探索了自然语言处理技术在游戏,新闻报道,访谈和犯罪叙事中的可能性。在本文中,我们探索了前沿自然语言处理技术对检测这些对话中的欺骗性的影响。我们将此任务视为二进制分类问题。我们利用四种不同的方法将词条文件的隐藏信息捕获到向量中,其中包括词性提取,单次热编码,嵌入向量的手段以及BERT预训练模型。之后,我们将这些向量作为层次神经网络的训练样本,该层次神经网络由完全连接的层和/或LSTM层构成。经过培训后,我们的系统可以将成绩单文件作为输入,并对受试者是否具有欺骗性进行分类。从我们的系统中获得F1分数0.733。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号