首页> 外文会议>Workshop on biomedical natural language processing >PubTermVariants: biomedical term variants and their use for PubMed search
【24h】

PubTermVariants: biomedical term variants and their use for PubMed search

机译:Pubtermvariants:生物医学术语变体及其用于PubMed搜索的用途

获取原文

摘要

Term normalization is frequently used in information retrieval task to reduce variant word forms to a common form. The most general term normalization technique used in practice is stemming, however it has been found to not be completely reliable. Here we present PubTermVariants, a high-quality data-driven resource of term variant pairs that can improve search results in PubMed. For a given pair, we consider two terms to be variants if they stem to the same form, pass the hypergeometric test, and pass the morpho-semantic test. We perform manual evaluation of a subset of PubTermVariants that confirms the high quality of the candidate pairs. We further present experiments that demonstrate their usefulness for PubMed search.
机译:术语归一化经常用于信息检索任务,以将变体单词形式减少到常见形式。实践中使用的最通用术语正常化技术是肿胀的,但已发现它不完全可靠。在这里,我们呈现Pubtermvariants,一种高质量的数据驱动资源,可以改善Pubmed的搜索结果。对于给定对,如果它们源于相同的形式,我们认为两个术语是变体,通过过度距离测试,并通过Morpho-Semantic测试。我们对确认候选对高质量的高质量的Pubtermvariant的子集进行手动评估。我们进一步提出了展示其对PubMed搜索有用性的实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号