首页> 外文学位 >System support for keyword-based search in structured peer-to-peer systems.
【24h】

System support for keyword-based search in structured peer-to-peer systems.

机译:对结构化对等系统中基于关键字的搜索的系统支持。

获取原文
获取原文并翻译 | 示例

摘要

In this dissertation, we present protocols for building a distributed search infrastructure over structured Peer-to-Peer systems. Unlike existing search engines which consist of large server farms managed by a centralized authority, our approach makes use of a distributed set of end-hosts built out of commodity hardware. These end-hosts cooperatively construct and maintain the search infrastructure.; The main challenges with distributing such a system include node failures, churn, and data migration. Localities inherent in query patterns also cause load imbalances and hot spots that severely impair performance. Users of search systems want their results returned quickly, and in ranked order. Our main contribution is to show that a scalable, robust, and distributed search infrastructure can be built over existing Peer-to-Peer systems through the use of techniques that address these problems. We present a decentralized scheme for ranking search results without prohibitive network or storage overhead. We show that caching allows for efficient query evaluation and present a distributed data structure, called the View Tree, that enables efficient storage, and retrieval of cached results. We also present a lightweight adaptive replication protocol, called LAR that can adapt to different kinds of query streams and is extremely effective at eliminating hotspots. Finally, we present techniques for storing indexes reliably. Our approach is to use an adaptive partitioning protocol to store large indexes and employ efficient redundancy techniques to handle failures. Through detailed analysis and experiments we show that our techniques are efficient and scalable, and that they make distributed search feasible.
机译:在本文中,我们提出了用于在结构化对等系统上构建分布式搜索基础结构的协议。与现有的由大型服务器场组成的搜索引擎不同,这些服务器场由中央机构管理,我们的方法利用由商品硬件构建的一组分布式终端主机。这些最终主机共同构建和维护搜索基础结构。分发这种系统的主要挑战包括节点故障,搅动和数据迁移。查询模式中固有的局部性也会导致负载不平衡和热点,从而严重影响性能。搜索系统的用户希望他们的结果按顺序快速返回。我们的主要贡献是表明可以通过使用解决这些问题的技术,在现有的对等系统上构建可扩展,健壮和分布式的搜索基础结构。我们提出了一种分散式方案,用于对搜索结果进行排名,而不会产生过多的网络或存储开销。我们展示了缓存可以进行有效的查询评估,并提供一种称为“视图树”的分布式数据结构,该结构可以实现高效的存储和缓存结果的检索。我们还提出了一种称为LAR的轻量级自适应复制协议,该协议可以适应各种查询流,并且在消除热点方面非常有效。最后,我们提出了可靠地存储索引的技术。我们的方法是使用自适应分区协议存储大索引,并采用有效的冗余技术来处理故障。通过详细的分析和实验,我们证明了我们的技术高效且可扩展,并且使分布式搜索变得可行。

著录项

  • 作者

    Gopalakrishnan, Vijay.;

  • 作者单位

    University of Maryland, College Park.;

  • 授予单位 University of Maryland, College Park.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2006
  • 页码 198 p.
  • 总页数 198
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号