首页> 外文会议>International Conference on Computational Intelligence and Communication Networks >Information Retrieval from the Web and Application of Migrating Crawler
【24h】

Information Retrieval from the Web and Application of Migrating Crawler

机译:从网页检索和迁移履带的应用程序

获取原文

摘要

Study reports that about 40% of current internet traffic and bandwidth consumption is due to the web crawlers that retrieve pages for indexing by the different search engines. As the size of the web continues to grow, searching it for useful information has become increasingly difficult. The centralized crawling techniques are unable to cope up with constantly growing web. In this paper it is presented that distributed crawling methods based on migrating crawlers are an essential tool for allowing such access that minimizes network utilization and also keeps up with document changes.
机译:研究报告称,当前Internet流量和带宽消耗的约40%是由于Web爬虫,可以检索由不同的搜索引擎索引的页面。随着Web的大小继续增长,搜索有用的信息已经变得越来越困难。集中式爬行技术无法应对不断增长的网络。在本文中,介绍了基于迁移爬虫的分布式爬网方法是允许这种访问最小化网络利用率并跟上文档变化的必要工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号