...
首页> 外文期刊>Digital investigation >SankeyVis: Visualizing active relationship from emails based on multiple dimensions and topic classification methods
【24h】

SankeyVis: Visualizing active relationship from emails based on multiple dimensions and topic classification methods

机译:SankeyVis:根据多维尺寸和主题分类方法从电子邮件中可视化主动关系

获取原文
获取原文并翻译 | 示例
           

摘要

The explosive growth of email has led to the rapid development of e-mail-based forensics, and at the same time, it has brought enormous challenges. As a special kind of data, email consists of structured metadata and unstructured email Body. Most attention is paid to visualization forensics of metadata at present, which is more focused on the mining of social network relationships between senders and recipients. These methods limit the range of email visualization forensics. Visual forensics of semantic analysis of the email body is relatively rare and difficult to connect semantic analysis with visualization. In recent years, the booming development of machine learning has extended the focus of forensics to the email body. This paper proposed SankeyVis: a visualization model for email forensics of active relation based on multiple dimensions and LDA topic classification methods, focusing on mining social relationships and semantic patterns in emails. SankeyVis conducts forensic work from the four data attributes of the email, "From," "To," "Date," and "Body," and the data is divided into two parts, the email header and email body according to the structure. The email header is used to get address pair with the working relationship after selecting and for the email body. Then introduced the Latent Dirichlet Allocation model to classify the topic of the email body discussed and adopt the adaption Sankey diagram to conduct forensic work from the topic semantic. It is proved that tested well by adapting to the Enron corpus. SankeyVis integrates structured and unstructured data in the visualization of email forensics, achieving visual forensics of email content. It breaks the limitations of dimensions and supports adding more than four attributes for forensics, extending the breadth of email forensics. SankeyVis reveals the topics of email senders and recipients with active relationships discussed at different time units and supports for forensics of email content to varying levels of relationships, extending the depth of email forensics. (C) 2020 Elsevier Ltd. All rights reserved.
机译:电子邮件的爆炸性增长导致了基于电子邮件的快速发展,同时,它带来了巨大的挑战。作为一种特殊的数据,电子邮件由结构化元数据和非结构化电子邮件机身组成。目前,最关注的是Metadata的可视化取证,这更为集中在发件人和收件人之间的社交网络关系的挖掘。这些方法限制了电子邮件可视化取证的范围。通过可视化进行语义分析,对电子邮件主体的语义分析的视觉取证是相对罕见的。近年来,机器学习的蓬勃发展将取证焦点扩展到电子邮件机构。本文提出了SankeyVis:基于多维和LDA主题分类方法的主动关系电子邮件取证的可视化模型,专注于在电子邮件中采矿社会关系和语义模式。 SankeyVis从电子邮件的四个数据属性进行法医工作,“来自”,“”,“”日期“和”body“,并且数据分为两个部分,电子邮件标题和电子邮件主体。电子邮件标题用于在选择和电子邮件主体后使用工作关系进行地址对。然后引入了潜在的Dirichlet分配模型,以对讨论的电子邮件机构的主题进行分类,并采用自适应SANKey图来从主题中进行法医工作。证明通过适应安龙语料库来测试。 SankeyVis在电子邮件取证可视化中集成了结构化和非结构化数据,实现了电子邮件内容的视觉取证。它破坏了维度的局限性,并支持对本的四分之一属性,扩展了电子邮件取证的广度。 SankeyVis在不同时间单位讨论的电子邮件发件人和收件人的主题,并支持电子邮件内容的质量级别与不同的关系级别,扩展了电子邮件取证的深度。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号