首页> 外文学位 >Topic Sensitive SourceRank: Extending SourceRank for Performing Context-Sensitive Search over Deep Web.
【24h】

Topic Sensitive SourceRank: Extending SourceRank for Performing Context-Sensitive Search over Deep Web.

机译:主题敏感SourceRank:扩展SourceRank以在Deep Web上执行上下文相关搜索。

获取原文
获取原文并翻译 | 示例

摘要

Source selection is one of the foremost challenges for searching deep-web. For a user query, source selection involves selecting a subset of deep-web sources expected to provide relevant answers to the user query. Existing source selection models employ query-similarity based local measures for assessing source quality. These local measures are necessary but not sufficient as they are agnostic to source trustworthiness and result importance, which, given the autonomous and uncurated nature of deep-web, have become indispensible for searching deep-web. SourceRank provides a global measure for assessing source quality based on source trustworthiness and result importance. SourceRank's effectiveness has been evaluated in single-topic deep-web environments. The goal of the thesis is to extend sourcerank to a multi-topic deep-web environment. Topic-sensitive sourcerank is introduced as an effective way of extending sourcerank to a deep-web environment containing a set of representative topics. In topic-sensitive sourcerank, multiple sourcerank vectors are created, each biased towards a representative topic. At query time, using the topic of query keywords, a query-topic sensitive, composite sourcerank vector is computed as a linear combination of these pre-computed biased sourcerank vectors. Extensive experiments on more than a thousand sources in multiple domains show 18-85% improvements in result quality over Google Product Search and other existing methods.
机译:来源选择是搜索深度网络的首要挑战之一。对于用户查询,源选择涉及选择希望为用户查询提供相关答案的深层网络源的子集。现有的源选择模型采用基于查询相似性的本地度量来评估源质量。这些本地措施是必要的,但还不足以使它们与获得信任度和结果的重要性无关,鉴于深层网络的自主性和未经整理的性质,这些措施已成为搜索深层网络必不可少的手段。 SourceRank提供了一种基于源可信度和结果重要性来评估源质量的全局度量。 SourceRank的有效性已在单主题深度网络环境中进行了评估。本文的目的是将SourceRank扩展到多主题深网环境。引入主题敏感的sourcerank是将Sourcerank扩展到包含一组代表性主题的深层网络环境的有效方法。在对主题敏感的sourcerank中,创建了多个sourcerank向量,每个向量都偏向代表主题。在查询时,使用查询关键字的主题,将查询主题敏感的复合sourcerank向量计算为这些预先计算的有偏的sourcerank向量的线性组合。在多个域中的上千个源上进行的广泛实验表明,与Google购物和其他现有方法相比,结果质量提高了18-85%。

著录项

  • 作者

    Jha, Manishkumar.;

  • 作者单位

    Arizona State University.;

  • 授予单位 Arizona State University.;
  • 学科 Computer Science.;Information Science.
  • 学位 M.S.
  • 年度 2011
  • 页码 44 p.
  • 总页数 44
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号