首页> 外文会议>2010 International Conference on Information Science and Applications (ICISA 2010) >Analyzing the Web Crawler as a Feed Forward Engine for an Efficient Solution to the Search Problem in the Minimum Amount of Time through a Distributed Framework
【24h】

Analyzing the Web Crawler as a Feed Forward Engine for an Efficient Solution to the Search Problem in the Minimum Amount of Time through a Distributed Framework

机译:将Web爬网程序作为前馈引擎进行分析,以通过分布式框架在最短时间内有效解决搜索问题

获取原文
获取原文并翻译 | 示例

摘要

A web crawler forms the backbone of a search engine and this backbone needs a careful re- assessment that could enhance the efficiency of search engines. This paper conducts such a re- assessment from the perspective of systems and this is achieved through implementation and analysis of a web crawler "VisionerBOT" as a feed forward engine for search engines using the MapReduce distributed programming model. Our crawler implementations revisit the classical OS debate of threads vs. events, with a significant contribution from our work which concludes that events is the ideal way forward for web crawlers. Furthermore, in implementing the feed forward mechanisms within the web crawler, we came up with some important design considerations for the operating system research community which can lead to a whole new class of operating systems.
机译:Web搜寻器构成了搜索引擎的骨干,并且需要对该骨干进行仔细的重新评估,以提高搜索引擎的效率。本文从系统的角度进行了这种重新评估,这是通过使用MapReduce分布式编程模型对作为搜寻引擎前馈引擎的网络爬虫“ VisionerBOT”的实施和分析来实现的。我们的搜寻器实现重新审视了关于线程与事件的经典OS辩论,我们的工作做出了重大贡献,结论是事件是Web搜寻器前进的理想方式。此外,在Web爬网程序中实施前馈机制时,我们为操作系统研究界提出了一些重要的设计注意事项,这可能会导致全新的操作系统类别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号