首页> 外文会议>International conference on computational linguistics >How to Get the Same News from Different Language News Papers
【24h】

How to Get the Same News from Different Language News Papers

机译:如何从不同的语言新闻报道获得同样的消息

获取原文

摘要

This paper presents an ongoing work on identifying similarity between documents across News papers in different languages. Our aim is to identify similar documents for a given News or event as a query, across languages and make cross lingual search more accurate and easy. For example given an event or News in English, all the English news documents related to the query are retrieved as well as in other languages such as Hindi, Bengali, Tamil, Telugu, Malayalam, Spanish. We use Vector Space Model, a known method for similarity calculation, but the novelty is in identification of terms for VSM calculation. Here a robust translation system is not used for translating the documents. The system is working with good recall and precision.
机译:本文介绍了持续识别不同语言的新闻报纸的文件之间的相似性。我们的目的是将给定新闻或事件的类似文档作为查询,跨语言,并使交叉语言搜索更准确和简单。例如,考虑到英语的事件或新闻,检索与查询相关的所有英语新闻文件以及其他语言,如印地语,班加利亚,泰米尔,泰卢固,马拉雅拉姆,西班牙语。我们使用Vector Space模型,一种已知的相似性计算方法,但新颖性是在识别VSM计算的术语。这里,强大的翻译系统不用于翻译文档。该系统正在使用良好的召回和精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号