首页> 外文会议>European conference on research and advanced technology for digital libraries >Term similarity-based query expansion for cross-language information retrieval
【24h】

Term similarity-based query expansion for cross-language information retrieval

机译:基于术语相似性的查询扩展,用于跨语言信息检索

获取原文

摘要

We propose a query expansion technique which is based on a statistical similarity measure among terms to improve the effectiveness of the dictionary-based cross-language information retrieval (CLIR) method. We employ a term similarity-based sense disambiguation technique proposed in our earlier work to enhance the accuracy of the dictionary-based query translation method. The query expansion technique is then applied to the translation of queries to further improve their retrieval performance. We demonstrate the effectiveness of the two techniques combined using queries in three languages, namely, German, Spanish, and Indonesian, to retrieve English documents from a standard TREC (Text Retrieval Conference) collection. The results of our experiments indicate that the terms similarity-based techniques work better when there are more pharases in the queries. In addition, our results also re-emphasize other researchers' finding that phrase recognition and translation are critical to CLIR's effectiveness.
机译:我们提出了一种基于术语的统计相似性测量来提高基于词典的跨语言信息检索(CLIR)方法的有效性的查询扩展技术。我们采用了一项基于相似性的感觉消歧技术,提出了我们之前的工作,以提高基于词典的查询翻译方法的准确性。然后将查询扩展技术应用于查询的翻译以进一步提高他们的检索性能。我们展示了两种技术的有效性,使用三种语言,即德语,西班牙语和印度尼西亚语,从标准TREC(文本检索会议)收集中检索英语文件。我们的实验结果表明,当查询中有更多的分法酶时,术语相似性的技术在更好的工作。此外,我们的结果还重新强调其他研究人员发现短语认可和翻译对Clir的有效性至关重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号