首页> 美国卫生研究院文献>Journal of Biomedical Semantics >Automated Patent Categorization and Guided Patent Search using IPC as Inspired by MeSH and PubMed
【2h】

Automated Patent Categorization and Guided Patent Search using IPC as Inspired by MeSH and PubMed

机译:受到MeSH和PubMed的启发使用IPC进行自动专利分类和指导专利搜索

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Document search on PubMed, the pre-eminent database for biomedical literature, relies on the annotation of its documents with relevant terms from the Medical Subject Headings ontology (MeSH) for improving recall through query expansion. Patent documents are another important information source, though they are considerably less accessible. One option to expand patent search beyond pure keywords is the inclusion of classification information: Since every patent is assigned at least one class code, it should be possible for these assignments to be automatically used in a similar way as the MeSH annotations in PubMed. In order to develop a system for this task, it is necessary to have a good understanding of the properties of both classification systems. This report describes our comparative analysis of MeSH and the main patent classification system, the International Patent Classification (IPC). We investigate the hierarchical structures as well as the properties of the terms/classes respectively, and we compare the assignment of IPC codes to patents with the annotation of PubMed documents with MeSH terms.Our analysis shows a strong structural similarity of the hierarchies, but significant differences of terms and annotations. The low number of IPC class assignments and the lack of occurrences of class labels in patent texts imply that current patent search is severely limited. To overcome these limits, we evaluate a method for the automated assignment of additional classes to patent documents, and we propose a system for guided patent search based on the use of class co-occurrence information and external resources.
机译:在生物医学文献的杰出数据库PubMed上的文档搜索,依靠医学主题词本体论(MeSH)中带有相关术语的文档注释来通过查询扩展来提高查全率。专利文献是另一个重要的信息来源,尽管很难获得。将专利搜索范围扩展到纯关键字之外的一种选择是包含分类信息:由于每个专利至少分配了一个类别代码,因此应该有可能像在PubMed中使用MeSH注释一样自动使用这些分配。为了开发用于此任务的系统,必须对两个分类系统的属性有充分的了解。本报告描述了我们对MeSH和主要专利分类系统国际专利分类(IPC)的比较分析。我们分别研究了术语/类别的层次结构和属性,并且比较了IPC代码与专利的标注以及带有MeSH术语的PubMed文档的注释。我们的分析表明层次结构具有很强的结构相似性,但意义重大术语和注释的差异。 IPC类别分配的数量少,专利文本中缺少类别标签,这意味着当前专利检索受到严重限制。为了克服这些限制,我们评估了一种为专利文件自动分配其他类别的方法,并且我们提出了一种基于类别共现信息和外部资源的指导专利检索系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号