首页> 外文会议>2011 IEEE International Conference on Granular Computing >Classification of PICO elements by text features systematically extracted from PubMed abstracts
【24h】

Classification of PICO elements by text features systematically extracted from PubMed abstracts

机译:通过从PubMed摘要中系统提取的文本特征对PICO元素进行分类

获取原文

摘要

We propose and evaluate a systematic approach to detect and classify Patient/Problem, Intervention, Comparison and Outcome (PICO) from the medical literature. The training and test corpora were generated systematically and automatically from structured PubMed abstracts. 23,472 sentences by exact pattern match of head words of P-I-O categories. Afterward, the terms with top frequencies were used as the features of Naïve Bayesian classifier. This approach achieves F-measure values of 0.91 for Patient/Problem, 0.75 for Intervention and 0.88 for Outcome, comparable to previous studied based on mixed textural, paragraphical, and semantic features. In conclusion, we show that by stricter pattern matching criteria of training set, detection and classification of PICO elements can be reproducible with minimal expert intervention. The results of this work are higher than previous studies.
机译:我们提出并评估了一种从医学文献中检测和分类患者/问题,干预,比较和结果(PICO)的系统方法。培训和测试语料库是从结构化的PubMed摘要中自动系统生成的。通过对P-I-O类别的主词进行精确模式匹配,可获得23,472个句子。之后,使用频率最高的术语作为朴素贝叶斯分类器的特征。与先前基于混合的纹理,段落和语义特征进行的研究相比,该方法的F-measure值对患者/问题而言为0.91,对于干预而言为0.75,对于结果为0.88。总而言之,我们表明,通过更严格的训练集模式匹配标准,可以在最少的专家干预的情况下重现PICO元素的检测和分类。这项工作的结果高于以前的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号