首页> 外国专利> Web document keyword and phrase extraction

Web document keyword and phrase extraction

机译:Web文档关键词和短语提取

摘要

Extraction analysis techniques biased, in part, by query frequency information from a query log file and/or search engine cache are employed along with machine learning processes to determine candidate keywords and/or phrases of web documents. Web oriented features associated with the candidate keywords and/or phrases are also utilized to analyze the web documents. A keyword and/or phrase extraction mechanism can be utilized to score keywords and/or phrases in a web document and estimate a likelihood that the keywords and/or phrases are relevant, for example, in an advertising system and the like.
机译:提取分析技术部分地受到来自查询日志文件和/或搜索引擎缓存的查询频率信息的偏见,并与机器学习过程一起使用,以确定Web文档的候选关键字和/或短语。与候选关键字和/或短语相关联的面向Web的功能也可用于分析Web文档。关键字和/或短语提取机制可以用于对网络文档中的关键字和/或短语进行评分,并估计关键字和/或短语相关的可能性,例如在广告系统等中。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号