首页> 外文会议>第十七届国际万维网大会(the 17th International World Wide Web Conference)(WWW08)论文集 >RACE: Finding and Ranking Compact Connected Trees for Keyword Proximity Search over XML Documents
【24h】

RACE: Finding and Ranking Compact Connected Trees for Keyword Proximity Search over XML Documents

机译:RACE:查找和排名紧凑的连接树,以便通过XML文档进行关键字邻近搜索

获取原文

摘要

In this paper, we study the problem of keyword proximity search over XML documents and leverage the efficiency and effectiveness. We take the disjunctive semantics among input keywords into consideration and identify meaningful compact connected trees as the answers of keyword proximity queries. We introduce the notions of Compact Lowest Common Ancestor (CLCA) and Maximal CLCA (MCLCA) and propose Compact Connected Trees (CCTrees) and Maximal CCTrees (MCCTrees) to efficiently and effectively answer keyword queries. We propose a novel ranking mechanism, RACE, to Rank compAct Connected trEes, by taking into consideration both the structural similarity and the textual similarity. Our extensive experimental study shows that our method achieves both high search effciency and effectiveness, and outperforms existing approaches significantly.
机译:在本文中,我们研究了基于XML文档的关键字邻近搜索问题,并利用了效率和有效性。我们考虑了输入关键字之间的析取语义,并确定了有意义的紧凑连接树作为关键字邻近查询的答案。我们介绍了紧凑型最低共同祖先(CLCA)和最大CLCA(MCLCA)的概念,并提出了紧凑连接树(CCTrees)和最大CCTree(MCCTrees)来有效地回答关键字查询。我们考虑结构上的相似性和文本上的相似性,提出了一种新颖的排名机制RACE来对compact关联的域名进行排名。我们广泛的实验研究表明,我们的方法既能达到较高的搜索效率,又能有效地胜过现有方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号