首页> 外文期刊>Computational Intelligence >WEB DISCOVERY AND FILTERING BASED ON TEXTUAL RELEVANCE FEEDBACK LEARNING
【24h】

WEB DISCOVERY AND FILTERING BASED ON TEXTUAL RELEVANCE FEEDBACK LEARNING

机译:基于文本相关反馈学习的Web发现和过滤

获取原文
获取原文并翻译 | 示例
           

摘要

We develop a new approach for Web information discovery and filtering. Our system, called WID, allows the user to specify long-term information needs by means of various topic profile specifications. An entire example page or an index page can be accepted as input for the discovery. It makes use of a simulated annealing algorithm to automatically explore new Web pages. Simulated annealing algorithms possess some favorable properties to fulfill the discovery objectives. Information retrieval techniques are adopted to evaluate the content-based relevance of each page being explored. The hyperlink information, in addition to the textual context, is considered in the relevance score evaluation of a Web page. WID allows users to provide three forms of the relevance feedback model, namely, the positive page feedback, the negative page feedback, and the positive keyword feedback. The system is domain independent and does not rely on any prior knowledge or information about the Web content. Extensive experiments have been conducted to demonstrate the effectiveness of the discovery performance achieved by WID.
机译:我们开发了一种用于Web信息发现和过滤的新方法。我们的系统称为WID,它允许用户通过各种主题资料规范来指定长期信息需求。可以接受整个示例页面或索引页面作为发现的输入。它利用模拟退火算法来自动浏览新网页。模拟退火算法具有一些有利的属性来满足发现目标。采用信息检索技术来评估正在浏览的每个页面的基于内容的相关性。在网页的相关性分数评估中,将考虑文本上下文之外的超链接信息。 WID允许用户提供三种形式的相关性反馈模型,即肯定页面反馈,否定页面反馈和肯定关键字反馈。该系统是域独立的,并且不依赖于有关Web内容的任何现有知识或信息。已经进行了广泛的实验,以证明WID实现的发现性能的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号