首页> 美国卫生研究院文献>AMIA Annual Symposium Proceedings >Recognizing Disjoint Clinical Concepts in Clinical Text Using Machine Learning-based Methods
【2h】

Recognizing Disjoint Clinical Concepts in Clinical Text Using Machine Learning-based Methods

机译:使用基于机器学习的方法识别临床文本中不相交的临床概念

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Clinical concept recognition (CCR) is a fundamental task in clinical natural language processing (NLP) field. Almost all current machine learning-based CCR systems can only recognize clinical concepts of consecutive words (called consecutive clinical concepts, CCCs), but can do nothing about clinical concepts of disjoint words (called disjoint clinical concepts, DCCs), which widely exist in clinical text. In this paper, we proposed two novel types of representations for disjoint clinical concepts, and applied two state-of-the-art machine learning methods to recognizing consecutive and disjoint concepts. Experiments conducted on the 2013 ShARe/CLEF challenge corpus showed that our best system achieved a “strict” F-measure of 0.803 for CCCs, a “strict” F-measure of 0.477 for DCCs, and a “strict” F-measure of 0.783 for all clinical concepts, significantly higher than the baseline systems by 4.2% and 4.1% respectively.
机译:临床概念识别(CCR)是临床自然语言处理(NLP)领域的一项基本任务。当前,几乎所有基于机器学习的CCR系统都只能识别连续词的临床概念(称为连续临床概念,CCC),而对不相交词的临床概念(称为不连续临床概念,DCC)无能为力。文本。在本文中,我们为不相交的临床概念提出了两种新颖的表示形式,并应用了两种最新的机器学习方法来识别连续和不相交的概念。在2013 ShARe / CLEF挑战语料库上进行的实验表明,我们最好的系统对CCC的“严格” F量度为0.803,对DCC的“严格” F量度为0.477,而“严格” F量度为0.783在所有临床概念中,它们分别比基线系统分别高出4.2%和4.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号