...
首页> 外文期刊>World Wide Web >No-but-semantic-match: computing semantically matched xml keyword search results
【24h】

No-but-semantic-match: computing semantically matched xml keyword search results

机译:No-but-semantic-match:计算语义匹配的xml关键字搜索结果

获取原文
获取原文并翻译 | 示例
           

摘要

Users are rarely familiar with the content of a data source they are querying, and therefore cannot avoid using keywords that do not exist in the data source. Traditional systems may respond with an empty result, causing dissatisfaction, while the data source in effect holds semantically related content. In this paper we study this no-but-semantic-match problem on XML keyword search and propose a solution which enables us to present the top-k semantically related results to the user. Our solution involves two steps: (a) extracting semantically related candidate queries from the original query and (b) processing candidate queries and retrieving the top-k semantically related results. Candidate queries are generated by replacement of non-mapped keywords with candidate keywords obtained from an ontological knowledge base. Candidate results are scored using their cohesiveness and their similarity to the original query. Since the number of queries to process can be large, with each result having to be analyzed, we propose pruning techniques to retrieve the top-k results efficiently. We develop two query processing algorithms based on our pruning techniques. Further, we exploit a property of the candidate queries to propose a technique for processing multiple queries in batch, which improves the performance substantially. Extensive experiments on two real datasets verify the effectiveness and efficiency of the proposed approaches.
机译:用户很少熟悉他们要查询的数据源的内容,因此无法避免使用数据源中不存在的关键字。传统系统可能会以空结果进行响应,从而引起不满,而数据源实际上保留了语义相关的内容。在本文中,我们研究了XML关键字搜索中的这种非语义匹配问题,并提出了一种解决方案,使我们能够向用户展示与前k个语义相关的结果。我们的解决方案包括两个步骤:(a)从原始查询中提取与语义相关的候选查询,以及(b)处理候选查询并检索与前k个语义相关的结果。通过用从本体论知识库获得的候选关键字替换未映射的关键字来生成候选查询。使用候选结果的内聚性和与原始查询的相似性对候选结果进行评分。由于要处理的查询数量可能很大,因此必须分析每个结果,因此我们提出了修剪技术以有效检索前k个结果。我们基于修剪技术开发了两种查询处理算法。此外,我们利用候选查询的属性来提出一种批量处理多个查询的技术,从而大大提高了性能。在两个真实数据集上的大量实验验证了所提出方法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号