Beyond Bags of Words: Effectively Modeling Dependence and Features in Information Retrieval

Donald Metzler

首页> 外文期刊>ACM SIGIR FORUM >Beyond Bags of Words: Effectively Modeling Dependence and Features in Information Retrieval

【24h】

Beyond Bags of Words: Effectively Modeling Dependence and Features in Information Retrieval

机译：胜于千言万语：有效建模信息检索中的依存关系和特征

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Current state of the art information retrieval models treat documents and queries as bags of words. There have been many attempts to go beyond this simple representation. Unfortunately, few have shown consistent improvements in retrieval effectiveness across a wide range of tasks and data sets. Here, we propose a new statistical model for information retrieval based on Markov random fields. The proposed model goes beyond the bag of words assumption by allowing dependencies between terms to be incorporated into the model. This allows for a variety of textual and non-textual features to be easily combined under the umbrella of a single model. Within this framework, we explore the theoretical issues involved, parameter estimation, feature selection, and query expansion. We give experimental results from a number of information retrieval tasks, such as ad hoc retrieval and web search.

机译：当前最先进的信息检索模型将文档和查询视为单词袋。已经进行了超出这种简单表示的许多尝试。不幸的是，很少有人在跨各种任务和数据集的检索效率方面显示出持续改进。在此，我们提出了一种基于马尔可夫随机场的信息检索统计模型。所提出的模型通过允许将术语之间的依赖性合并到模型中，从而超出了单词假设的范围。这允许在单个模型的保护下轻松组合各种文本和非文本功能。在此框架内，我们探讨了涉及的理论问题，参数估计，特征选择和查询扩展。我们从许多信息检索任务（例如临时检索和Web搜索）中给出实验结果。

著录项

来源
《ACM SIGIR FORUM》 |2008年第1期|p.77|共1页
作者
Donald Metzler;
展开▼
作者单位

Computer Science Building 140 Governors Drive University of Massachusetts Amherst, MA 01003-9264;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Image moment invariants as local features for content based image retrieval using the Bag-of-Visual-Words model [J] . Karakasis E. G., Amanatiadis A., Gasteratos A., Pattern recognition letters . 2015,第apra1期

机译：图像矩不变性是使用“视觉袋”模型基于内容的图像检索的局部特征
2. Entropy Optimized Feature-Based Bag-of-Words Representation for Information Retrieval [J] . Nikolaos Passalis, Anastasios Tefas IEEE Transactions on Knowledge and Data Engineering . 2016,第7期

机译：熵优化的基于特征的词袋表示用于信息检索
3. Combining SURF and MSER along with Color Features for Image Retrieval System Based on Bag of Visual Words [J] . Heba A. Elnemr Journal of computer sciences . 2016,第4期

机译：基于视觉词袋的SURF，MSER与色彩特征相结合的图像检索系统
4. MODELING MULTIPLE VISUAL WORDS ASSIGNMENT FOR BAG-OF-FEATURES BASED MEDICAL IMAGE RETRIEVAL [C] . Jingyan Wang, Islam Almasri Signal processing, pattern recognition and applications ; Computer graphics and imaging . 2012

机译：基于特征包的医学图像检索对多个视觉单词分配建模
5. Beyond bags of words: Effectively modeling dependence and features in information retrieval. [D] . Metzler, Donald A., Jr. 2007

机译：字里行外：有效地建模信息检索中的依存关系和特征。
6. An effective content-based image retrieval technique for image visuals representation based on the bag-of-visual-words model [O] . Safia Jabeen, Zahid Mehmood, Toqeer Mahmood, -1

机译：基于视觉袋模型的基于内容的有效图像检索技术
7. Flower Image Retrieval Based on Bag-of-words Model and Multi-Features Fusion [O] . 苏秀英 2010

机译：基于词袋模型和多特征融合的花卉图像检索

Beyond Bags of Words: Effectively Modeling Dependence and Features in Information Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅