An Improved Retrievability-Based Cluster-Resampling Approach for Pseudo Relevance Feedback

Shariq Bashir

首页> 外文期刊>Computers >An Improved Retrievability-Based Cluster-Resampling Approach for Pseudo Relevance Feedback

【24h】

An Improved Retrievability-Based Cluster-Resampling Approach for Pseudo Relevance Feedback

机译：一种改进的基于可检索性的伪相关反馈聚类重采样方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cluster-based pseudo-relevance feedback (PRF) is an effective approach for searching relevant documents for relevance feedback. Standard approach constructs clusters for PRF only on the basis of high similarity between retrieved documents. The standard approach works quite well if the retrieval bias of the retrieval model does not create any effect on the retrievability of documents. In our experiments we observed when a collection contains retrieval bias, then high retrievable documents of clusters are frequently retrieved at top positions for most of the queries, and these drift the relevance feedback away from relevant documents. For reducing (retrieval bias) noise, we enhance the standard cluster construction approach by constructing clusters on the basis of high similarity and retrievability. We call this retrievability and cluster-based PRF. This enhanced approach keeps only those documents in the clusters that are not frequently retrieve due to retrieval bias. Although this approach improves the effectiveness, however, it penalizes high retrievable documents even if these documents are most relevant to the clusters. To handle this problem, in a second approach, we extend the basic retrievability concept by mining frequent neighbors of the clusters. The frequent neighbors approach keeps only those documents in the clusters that are frequently retrieved with other neighbors of clusters and infrequently retrieved with those documents that are not part of the clusters. Experimental results show that two proposed extensions are helpful for identifying relevant documents for relevance feedback and increasing the effectiveness of queries.

机译：基于聚类的伪相关反馈（PRF）是一种用于搜索相关文档以获取相关反馈的有效方法。标准方法仅基于检索到的文档之间的高度相似性为PRF构建聚类。如果检索模型的检索偏差不会对文档的可检索性产生任何影响，则标准方法会很好地工作。在我们的实验中，我们观察到当集合包含检索偏向时，对于大多数查询，经常在顶部位置检索簇的高可检索文档，这些文档会使相关性反馈偏离相关文档。为了减少（检索偏差）噪声，我们通过在高度相似性和可检索性的基础上构造聚类来增强标准聚类构建方法。我们称之为可检索性和基于集群的PRF。这种增强的方法仅将那些由于检索偏差而不会经常检索的文档保留在群集中。尽管此方法提高了有效性，但是，即使这些文档与群集最相关，它也会对高可检索文档造成不利影响。为了解决这个问题，在第二种方法中，我们通过挖掘群集的频繁邻居来扩展基本可检索性概念。频繁邻居方法仅将那些与簇的其他邻居经常检索的文档和不属于簇的那些文档很少检索的文档保留在簇中。实验结果表明，提出的两个扩展名有助于识别相关文档以进行相关反馈并提高查询的有效性。

著录项

来源
《Computers》 |2016年第4期|共页
作者
Shariq Bashir;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类数学;
关键词

相似文献

外文文献
中文文献
专利

1. Improving pseudo relevance feedback based query expansion using genetic fuzzy approach and semantic similarity notion [J] . Pragati Bhatnagar, Narendra Pareek Journal of Information Science . 2014,第4期

机译：利用遗传模糊方法和语义相似度概念改进基于伪相关反馈的查询扩展
2. Experiments with dictionary-based CLIR using graded relevance assessments: Improving effectiveness by pseudo-relevance feedback [J] . Raija Lehtokangas, Heikki Keskustalo, Kalervo Jaervelin Information retrieval . 2006,第4期

机译：使用分级相关性评估进行基于字典的CLIR的实验：通过伪相关性反馈提高有效性
3. Improving retrievability with improved cluster-based pseudo-relevance feedback selection [J] . Shariq Bashir Expert systems with applications . 2012,第8期

机译：通过改进的基于聚类的伪相关反馈选择来提高可检索性
4. A Clustering Approach to Improving Pseudo-Relevance Feedback: Improving Retrieval Effetiveness by Removing Noisy Documents [C] . Li Changchun, Wang Jun-yi 2012 Fourth International Symposium on Information Science and Engineering. . 2012

机译：一种改进伪相关反馈的聚类方法：通过删除嘈杂的文档来提高检索效果
5. Relational information retrieval: Using relevance feedback and parallelism to improve accuracy and performance. [D] . Lundquist, Carol. 1997

机译：关系信息检索：使用相关反馈和并行性来提高准确性和性能。
6. Improved biomedical term selection in pseudo relevance feedback [O] . Muhammad Nabeel Asim, Muhammad Wasim, Muhammad Usman Ghani Khan, 2018

机译：伪相关反馈中改进的生物医学术语选择
7. An Improved Retrievability-Based Cluster-Resampling Approach for Pseudo Relevance Feedback [O] . Shariq Bashir 2016

机译：一种改进的基于可恢复性的伪相关反馈聚类重采样方法
8. Web-based Pseudo Relevance Feedback for Microblog Retrieval. [R] . A. S. El Din W. Magdy 2012

机译：基于Web的微相关检索伪相关反馈。

An Improved Retrievability-Based Cluster-Resampling Approach for Pseudo Relevance Feedback

摘要

著录项

相似文献

相关主题

期刊订阅