...
首页> 外文期刊>Journal of web semantics: >Entity summarization: State of the art and future challenges
【24h】

Entity summarization: State of the art and future challenges

机译:实体摘要:艺术状态和未来挑战

获取原文
获取原文并翻译 | 示例
           

摘要

The increasing availability of semantic data has substantially enhanced Web applications. Semantic data such as RDF data is commonly represented as entity-property-value triples. The magnitude of semantic data, in particular the large number of triples describing an entity, could overload users with excessive amounts of information. This has motivated fruitful research on automated generation of summaries for entity descriptions to satisfy users & rsquo; information needs efficiently and effectively. We focus on this prominent topic of entity summarization, and our research objective is to present the first comprehensive survey of entity summarization research. Rather than separately reviewing each method, our contributions include (1) identifying and classifying technical features of existing methods to form a high-level overview, (2) identifying and classifying frameworks for combining multiple technical features adopted by existing methods, (3) collecting known benchmarks for intrinsic evaluation and efforts for extrinsic evaluation, and (4) suggesting research directions for future work. By investigating the literature, we synthesized two hierarchies of techniques. The first hierarchy categories generic technical features into several perspectives: frequency and centrality, informativeness, and diversity and coverage. In the second hierarchy we present domain-specific and task-specific technical features, including the use of domain knowledge, context awareness, and personalization. Our review demonstrated that existing methods are mainly unsupervised and they combine multiple technical features using various frameworks: random surfer models, similarity-based grouping, MMR-like re-ranking, or combinatorial optimization. We also found a few deep learning based methods in recent research. Current evaluation results and our case study showed that the problem of entity summarization is still far from being solved. Based on the limitations of existing methods revealed in the review, we identified several future directions: the use of semantics, human factors, machine and deep learning, non-extractive methods, and interactive methods. (c) 2021 Elsevier B.V. All rights reserved.commentSuperscript/Subscript Available/comment
机译:增加语义数据的可用性大大增强了Web应用程序。诸如RDF数据之类的语义数据通常表示为实体属性值三元符。语义数据的大小,特别是描述实体的大量三元组,可以过度具有过多的信息的用户。这有促进了对满足用户和rsquo的实体描述的自动化生成的富有成果研究;信息有效且有效地需求。我们专注于这个实体摘要的突出主题,我们的研究目标是展示对实体摘要研究的第一个综合调查。而不是单独审查每个方法,我们的贡献包括(1)识别和分类现有方法的技术特征,以形成高级概述,(2)识别和分类框架,用于组合现有方法采用的多种技术功能(3)收集的框架内在评估和外在评估的努力的已知基准,以及(4)建议未来工作的研究方向。通过调查文献,我们合成了两种技艺的技术。第一层级类别通用技术功能分为几个视角:频率和中心,信息性和多样性和覆盖范围。在第二层次结构中,我们呈现特定于域的特定于任务特定的技术功能,包括使用域知识,上下文意识和个性化。我们的审核证明现有方法主要是无监督,并使用各种框架组合多种技术特征:随机冲浪模型,基于相似性的分组,MMR样重新排序或组合优化。我们还在最近的研究中发现了一些基于深度学习的方法。目前的评估结果和我们的案例研究表明,实体摘要问题仍未解决。根据审查中现有方法的局限性,我们确定了几个未来的方向:使用语义,人类因素,机器和深度学习,非提取方法和交互方法。 (c)2021 elestvier b.v.保留所有权利。&注释&上标/下标可用& /评论

著录项

  • 来源
    《Journal of web semantics:》 |2021年第5期|100647.1-100647.16|共16页
  • 作者单位

    Nanjing Univ State Key Lab Novel Software Technol Nanjing Peoples R China;

    Nanjing Univ State Key Lab Novel Software Technol Nanjing Peoples R China;

    Samsung Res Amer Mountain View CA USA|Wright State Univ Knoesis Ctr Dayton OH 45435 USA;

    Nanjing Univ State Key Lab Novel Software Technol Nanjing Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Entity summarization; Triple ranking; Semantic data;

    机译:实体摘要;三倍排名;语义数据;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号