Clustering cliques for graph-based summarization of the biomedical research literature

Han Zhang; Marcelo Fiszman; Dongwook Shin; Bartlomiej Wilkowski; Thomas C Rindflesch

首页> 外文期刊>BMC Bioinformatics >Clustering cliques for graph-based summarization of the biomedical research literature

【24h】

Clustering cliques for graph-based summarization of the biomedical research literature

机译：用于基于图的生物医学研究文献综述的聚类集团

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background Graph-based notions are increasingly used in biomedical data mining and knowledge discovery tasks. In this paper, we present a clique-clustering method to automatically summarize graphs of semantic predications produced from PubMed citations (titles and abstracts). Results SemRep is used to extract semantic predications from the citations returned by a PubMed search. Cliques were identified from frequently occurring predications with highly connected arguments filtered by degree centrality. Themes contained in the summary were identified with a hierarchical clustering algorithm based on common arguments shared among cliques. The validity of the clusters in the summaries produced was compared to the Silhouette-generated baseline for cohesion, separation and overall validity. The theme labels were also compared to a reference standard produced with major MeSH headings. Conclusions For 11 topics in the testing data set, the overall validity of clusters from the system summary was 10% better than the baseline (43% versus 33%). While compared to the reference standard from MeSH headings, the results for recall, precision and F-score were 0.64, 0.65, and 0.65 respectively.

机译：背景技术基于图的概念越来越多地用于生物医学数据挖掘和知识发现任务中。在本文中，我们提出了一种集团聚类方法来自动汇总从PubMed引文（标题和摘要）产生的语义谓词图。结果SemRep用于从PubMed搜索返回的引文中提取语义谓词。从频繁出现的谓词中识别出集团，这些谓词具有通过程度中心性过滤的高度关联的论点。摘要中包含的主题通过基于群体之间共享的通用论证的层次聚类算法进行识别。将生成的摘要中聚类的有效性与Silhouette生成的基线进行比较，以了解内聚性，分离性和总体有效性。还将主题标签与主要MeSH标题产生的参考标准进行了比较。结论对于测试数据集中的11个主题，系统摘要中群集的总体有效性比基线好10％（43％对33％）。与MeSH标题的参考标准相比，召回率，精确度和F分数分别为0.64、0.65和0.65。

著录项

来源
《BMC Bioinformatics》 |2013年第1期|共页
作者
Han Zhang; Marcelo Fiszman; Dongwook Shin; Bartlomiej Wilkowski; Thomas C Rindflesch;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词

相似文献

外文文献
中文文献
专利

1. A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method [J] . Illhoi Yoo, Xiaohua Hu, Il-Yeol Song BMC Bioinformatics . 2007,第SUPPLEMENTa9期

机译：基于相干图的生物医学文献语义聚类和总结方法及新的评价方法
2. Intelligent multi-document summarization for biomedical literature by word embeddings and graph-based ranking [J] . Shen Chen, Lin Hongfei, Hao Huihui, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第4aPta1期

机译：Word Embeddings和基于Graph级别的生物医学文献智能多文件摘要
3. A Graph-Based Biomedical Literature Clustering Approach Utilizing Term抯 Global and Local Importance Information [J] . Zhang Xiaodan, Hu Xiaohua, Xia Jiali, International Journal of Data Warehousing and Mining . 2008,第4期

机译：基于术语的全局和局部重要性信息的基于图的生物医学文献聚类方法
4. Integrating biomedical literature clustering and summarization approaches using biomedical ontology [C] . Illhoi Yoo, Xiaohua Hu, Il-Yeol Song Proceedings of the 1st international workshop on Text mining in bioinformatics . 2006

机译：使用生物医学本体整合生物医学文献聚类和总结方法
5. Contextualized Semantic Maps for Retrieval and Summarization of Biomedical Literature [D] . Garcia-Gathright, Jean Imelda 2016

机译：语境化语义图的检索和生物医学文献综述
6. Clustering cliques for graph-based summarization of the biomedical research literature [O] . Han Zhang, Marcelo Fiszman, Dongwook Shin, 2013

机译：基于聚类的生物医学研究文献基于图的摘要
7. Clustering cliques for graph-based summarization of the biomedical research literature [O] . Han Zhang, Marcelo Fiszman, Dongwook Shin, 2013

机译：基于聚类的生物医学研究文献基于图的摘要

Clustering cliques for graph-based summarization of the biomedical research literature

摘要

著录项

相似文献

相关主题

期刊订阅