首页> 外文学位 >Adaptive Graph Walk Based Similarity Measures in Entity-Relation Graphs.
【24h】

Adaptive Graph Walk Based Similarity Measures in Entity-Relation Graphs.

机译:实体-关系图中基于自适应图行走的相似性度量。

获取原文
获取原文并翻译 | 示例

摘要

Relational or semi-structured data is naturally represented by a graph schema, where nodes denote entities and directed typed edges represent the relations between them. Such graphs are heterogeneous in the sense that they describe different types of objects and multiple types of links. For example, email data can be described in a graph that includes messages, persons, dates and other objects; in this graph, a message may be associated with a person with different relations, such as "sent-to", "sent-from" and so on. In the past, researchers have suggested to apply random graph walks in order to elicit a measure of similarity between entities that are not directly connected in a graph. In this thesis, we suggest a general framework, in which different arbitrary queries (for instance, "what persons are most related to this email message?") are addressed using random walks. Naturally, there are many types of queries possible that correspond to various flavors of inter-entity similarity; several learning techniques are therefore suggested and evaluated that adapt the graph-walk based search to a query type.;The framework is applied in the thesis to two different domains. The first domain is personal information management, where it is shown how seemingly different tasks like alias finding, intelligent message threading and person name disambiguation, can be addressed uniformly as search queries using the adaptive graph-walk based similarity measure. The second domain evaluated is the processing of parsed text, where a graph represents corpora of structured parsed text, and adaptive graph walks are applied to induce inter-word similarity measures for tasks such as coordinate term extraction.;Finally, design and scalability considerations are discussed.
机译:关系或半结构化数据自然地由图模式表示,其中节点表示实体,有向类型的边表示实体之间的关系。这些图在描述不同类型的对象和多种类型的链接的意义上是异构的。例如,电子邮件数据可以用包含消息,人员,日期和其他对象的图形描述。在此图中,消息可能与具有不同关系的人相关联,例如“发送给”,“发送自”等。过去,研究人员建议应用随机图遍历,以得出未直接连接到图中的实体之间的相似性度量。在本文中,我们提出了一个通用框架,其中使用随机游标处理不同的任意查询(例如,“与该电子邮件消息最相关的人是谁?”)。自然地,有许多类型的查询可能与实体间相似性的各种风格相对应。因此,提出了几种学习技术,并对其进行了评估,使基于图行走的搜索适合于查询类型。;该框架在本文中应用于两个不同的领域。第一个领域是个人信息管理,其中显示了如何使用基于自适应图行走的相似性度量将表面上看起来不同的任务(如别名查找,智能消息线程和人名歧义消除)作为搜索查询统一解决。评估的第二个域是解析文本的处理,其中图形表示结构化解析文本的语料库,并应用自适应图形漫游来诱导词间相似性度量以完成诸如坐标项提取之类的任务。最后,设计和可伸缩性考虑因素是讨论过。

著录项

  • 作者

    Minkov, Einat.;

  • 作者单位

    Carnegie Mellon University.;

  • 授予单位 Carnegie Mellon University.;
  • 学科 Artificial Intelligence.;Computer Science.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 176 p.
  • 总页数 176
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号