首页> 外文期刊>SIGKDD explorations >A Deep Generative Approach to Search Extrapolation and Recommendation
【24h】

A Deep Generative Approach to Search Extrapolation and Recommendation

机译:一种搜索外推和推荐的深度生成方法

获取原文
获取原文并翻译 | 示例
           

摘要

Related search query recommendation is a standard feature in many modern search engines. Interesting and relevant queries often increase the active time of users and improve the overall search experience. However, conventional approaches based on tag extraction, keywords matching or click graph link analysis suffer from the common problem of limited coverage and generalizability, which means the system could only make suggestions for a small portion of well-formed search queries. In this work, we propose a deep generative approach to construct a related search query for recommendation in a word-by-word fashion, given either an input query or the title of a document. We propose a novel two-stage learning framework that partitions the task into two simpler sub-problems, namely, relevant context words discovery and context dependent query generation. We carefully design a Relevant Words Generator (RWG) based on recurrent neural networks and a Dual- Vocabulary Sequence-to-Sequence (DV-Seq2Seq) model to address these problems. We also propose automated strategies that have retrieved three large datasets with 500K to 1 million instances, from a search click graph constructed based on 8 days of search histories in Tencent QQ Browser, for model training. By leveraging the dynamically discovered context words, our proposed framework outperforms other Seq2Seq generative baselines on a wide range of BLEU, ROUGE and Exact Match (EM) metrics.
机译:相关搜索查询推荐是许多现代搜索引擎中的标准功能。有趣和相关查询通常会增加用户的活动时间并提高整体搜索体验。然而,基于标签提取的传统方法,关键词匹配或点击图形链路分析遭受了有限覆盖范围和概括性的常见问题,这意味着系统只能为一小部分进行良好成本的搜索查询提出建议。在这项工作中,我们提出了一种深深的生成方法,以构建相关搜索查询以以字词的方式,给定输入查询或文档的标题。我们提出了一种新颖的两阶段学习框架,将任务分区为两个更简单的子问题,即相关的上下文单词发现和上下文相关的查询生成。我们根据经常性神经网络和双词汇序列到序列(DV-SEQ2SEQ)模型来仔细设计相关的单词生成器(RWG),以解决这些问题。我们还提出了从根据腾讯QQ浏览器中8天的搜索历史的搜索历史,为模型培训,从搜索点击图中检索了三个大型数据集的自动化策略。通过利用动态发现的上下文词语,我们提出的框架在各种Bleu,Rouge和精确匹配(EM)指标上表现出其他SEQ2Seq生成基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号