Graph Based Keyword Extraction for Similarity Identification among Born-Digital News Contents

机译：基于图的关键词提取在数字新闻内容中的相似性识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Increasing influence of internet has led to huge amount of born-digital news articles being published on the internet. It is also becoming increasingly difficult to forage through the vast warehouse of these documents for preventing duplicity. Keywords are the most salient words in any textual document. We have introduced a graph-based approach for keyword extraction, using term co-occurrence in the textual news articles and integrating weighted closeness centrality (CC) with weighted clustering coefficient (WC). We have also proposed a metric namely co-occurrence index (CI) based on the extracted keywords for finding the amount of similarity between any two textual news articles. Our proposed method is independent of the ‘bag-of-word model’ and has shown significant performance improvement over the other existing methods.

机译：互联网的影响力日益增强，导致大量的数字新闻新闻在互联网上发表。为了防止重复，在这些文件的庞大仓库中觅食也变得越来越困难。关键字是任何文本文档中最突出的单词。我们引入了一种基于图的关键字提取方法，该方法在文本新闻文章中使用术语共现，并将加权的紧密度中心度（CC）与加权的聚类系数（WC）集成在一起。我们还提出了一种度量标准，即基于提取的关键字的共现指数（CI），以查找任意两个文本新闻文章之间的相似度。我们提出的方法独立于“单词袋模型”，并且与其他现有方法相比，已显示出显着的性能改进。

著录项

来源
《International Conference on Computing, Communication and Networking Technologies》|2020年|1-7|共7页
会议地点
作者
Susmita Das; Samya Muhuri; Susanta Chakraborty; Samit Biswas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Graphs; Keyword Extraction; Centrality;

机译：图;关键词提取;中心性;

相似文献

外文文献
中文文献
专利

1. News Keyword Extraction Algorithm Based on Semantic Clustering and Word Graph Model [J] . Ao Xiong, Derong Liu, Hongkang Tian, 清华大学学报（英文版） . 2021,第006期

机译：新闻关键字基于语义聚类和字图模型的提取算法
2. Keywords extraction in Chinese–Vietnamese bilingual news based on hypergraph: [J] . Jiaxin Zhai, Shengxiang Gao, Zhengtao Yu, International Journal of Distributed Sensor Networks . 2018,第11期

机译：基于超图的汉语-越南语双语新闻关键词提取：
3. Automatic keyword extraction from documents based on multiple content-based measures [J] . KunYue, Wei-Yi Liu, Li-Ping Zhou International Journal of Computer Systems Science & Engineering . 2011,第2期

机译：基于多种基于内容的措施自动从文档中提取关键字
4. Keywords Similarity Based Topic Identification for Indonesian News Documents [C] . Fuddoly Aini, Jaafar Jafreezal, Zamin Norshuhani UKSim-AMSS 7th European Modelling Symposium . 2013

机译：基于关键词相似度的印尼新闻文献主题识别
5. Content-based image retrieval by similarity learning for digital mammography. [D] . El Naqa, Issam M. 2002

机译：通过基于相似性学习的数字乳房X线照片基于内容的图像检索。
6. FNG-IE: an improved graph-based method for keyword extraction from scholarly big-data [O] . Noman Tahir, Muhammad Asif, Shahbaz Ahmad, 2021

机译：FNG-IE：从学术大数据的关键字提取的基于基于图的基于图形方法
7. Keywords extraction in Chinese–Vietnamese bilingual news based on hypergraph [O] . Jiaxin Zhai, Shengxiang Gao, Zhengtao Yu, 2018

机译：基于超图的越南双语新闻中的关键词提取

Graph Based Keyword Extraction for Similarity Identification among Born-Digital News Contents

摘要

著录项

相似文献

相关主题

期刊订阅