Improving Web Pages Retrieval Using Combined Fields

机译：使用组合字段改进网页检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article describes the participation of the REINA Research Group of the University of Salamanca in WebCLEF 2006. This year we participated in the Monolingual Mixed Task in Spanish. The entire EuroGOV collection was processed to select all the pages in Spanish. All the pages with domain .es were also pre-selected. Our objective this year was to try pre-retrieval techniques of combining information fields or elements from web pages as well as the retrieval capability of these fields. In vector-based retrieval systems, the combining of terms coming from different sources can be achieved by operating on the frequency of the terms in the document using a weight scheme of tf × idf. The BODY field is, of course, the most useful from the retrieval perspective, but the text of the backlinks brings considerable improvement. META fields or tags, however, contribute little to retrieval improvement.

机译：本文介绍了2006年萨拉曼卡大学雷纳研究小组的参与。今年我们参加了西班牙语中的单晶混合任务。整个欧洲欧洲欧洲欧洲欧洲欧洲欧洲欧洲猎户夫集合被处理以选择西班牙语中的所有页面。所有带有域名的页面也被预先选择。我们今年的目标是尝试从网页中组合信息字段或元素的预检索技术以及这些字段的检索能力。在基于向量的检索系统中，可以通过使用TF×IDF的权重方案在文档中的术语上运行来实现来自不同来源的术语的组合。当然，身体领域从检索角度最有用，但反向链接的文本带来了相当大的改进。然而，META字段或标签促进了检索改进。

著录项

来源
《Workshop of the Cross-Language Evaluation Forum》|2007年||共6页
会议地点
作者
Carlos G. Figuerola; Jose L. Alonso Berrocal; Angel F. Zazo Rodriguez; Emilio Rodriguez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类文字和语言;
关键词

相似文献

外文文献
中文文献
专利

1. Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task [J] . Masahiko MATSUSHITA, Hiromitsu NISHIZAKI, Takehito UTSURO, IEICE Transactions on Information and Systems . 2005,第3期

机译：通过组合多个语音识别器的输出以执行语音驱动的WEB检索任务，提高口语查询的关键字识别
2. Applying Semantic Web technologies to improve the retrieval, credibility and use of health-related web resources. [J] . Mayer MA, Karampiperis P, Kukurikos A, Health informatics journal . 2011,第2期

机译：应用语义Web技术来改善与健康相关的Web资源的检索，可信度和使用。
3. Detection of near-surface defects in rails combining Green's function retrieval of ultrasonic diffuse fields and sign coherence factor imaging [J] . Haiyan Zhang, Mintao Shao, Guopeng Fan, Insight . 2020,第4期

机译：检测轨道近表面缺陷，结合绿色功能检索超声漫射场的轨迹和标志相干因子成像
4. Improving Web Pages Retrieval Using Combined Fields [C] . Carlos G. Figuerola, Jose L. Alonso Berrocal, Angel F. Zazo Rodriguez, Workshop of the Cross-Language Evaluation Forum . 2007

机译：使用组合字段改进网页检索
5. Improving Web retrieval by mining the HTML tags for keywords and exploring the hyperlink structures of Web pages. [D] . Quevedo-Torrero, Jesus Ubaldo. 2004

机译：通过挖掘HTML标记的关键字并探索网页的超链接结构来改善Web检索。
6. Combined Use of Sentinel-1 SAR and Landsat Sensors Products for Residual Soil Moisture Retrieval over Agricultural Fields in the Upper Blue Nile Basin Ethiopia [O] . Getachew Ayehu, Tsegaye Tadesse, Berhan Gessesse, 2020

机译：Sentinel-1 SAR和Landsat传感器产品的组合使用用于在埃塞俄比亚上南尼罗河盆地的农田上进行土壤残留水分的反演
7. REINA at WebCLEF 2006: Mixing Fields to Improve Retrieval [O] . Zazo Ángel F., G.-Figuerola Carlos, Alonso-Berrocal José-Luis 2006

机译：REINA在WebCLEF 2006上：混合字段以改善检索

Improving Web Pages Retrieval Using Combined Fields

摘要

著录项

相似文献

相关主题

期刊订阅