首页> 外文期刊>Journal of Biomedical Semantics >Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
【24h】

Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain

机译:在骨骼发育不良域中发现关联规则的语义趣味性度量

获取原文
           

摘要

Background Lately, ontologies have become a fundamental building block in the process of formalising and storing complex biomedical information. With the currently existing wealth of formalised knowledge, the ability to discover implicit relationships between different ontological concepts becomes particularly important. One of the most widely used methods to achieve this is association rule mining. However, while previous research exists on applying traditional association rule mining on ontologies, no approach has, to date, exploited the advantages brought by using the structure of these ontologies in computing rule interestingness measures. Results We introduce a method that combines concept similarity metrics, formulated using the intrinsic structure of a given ontology, with traditional interestingness measures to compute semantic interestingness measures in the process of association rule mining. We apply the method in our domain of interest – bone dysplasias – using the core ontologies characterising it and an annotated dataset of patient clinical summaries, with the goal of discovering implicit relationships between clinical features and disorders. Experimental results show that, using the above mentioned dataset and a voting strategy classification evaluation, the best scoring traditional interestingness measure achieves an accuracy of 57.33%, while the best scoring semantic interestingness measure achieves an accuracy of 64.38%, both at the recall cut-off point 5. Conclusions Semantic interestingness measures outperform the traditional ones, and hence show that they are able to exploit the semantic similarities inherently present between ontological concepts. Nevertheless, this is dependent on the domain, and implicitly, on the semantic similarity metric chosen to model it.
机译:背景技术近年来,本体已经成为形式化和存储复杂生物医学信息的基本基础。借助当前大量形式化知识,发现不同本体论概念之间隐式关系的能力变得尤为重要。实现此目的最广泛使用的方法之一是关联规则挖掘。然而,尽管已有关于将传统关联规则挖掘应用于本体的研究,但迄今为止,尚未有任何方法利用这些本体的结构在计算规则兴趣度中所带来的优势。结果我们引入了一种方法,该方法将使用给定本体的内在结构制定的概念相似性度量与传统的兴趣度度量相结合,以在关联规则挖掘过程中计算语义兴趣度度量。我们将这种方法应用到我们感兴趣的领域-骨发育异常-中,使用表征该方法的核心本体和带注释的患者临床摘要数据集,以发现临床特征与疾病之间的隐式关系。实验结果表明,使用上述数据集和投票策略分类评估,得分最高的传统兴趣度测度的准确性达到57.33%,而得分最高的语义兴趣度测度的准确性达到64.38%,两者均符合召回率。 off point5。结论语义趣味性度量优于传统度量,因此表明它们能够利用本体论概念之间固有的语义相似性。但是,这取决于域,并且隐含地取决于为模型建模的语义相似性度量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号