A Document Exploring System on LDA Topic Model for Wikipedia Articles

Zhou Tong; Haiyi Zhang

首页> 外文期刊>International Journal of Multimedia & Its Applications >A Document Exploring System on LDA Topic Model for Wikipedia Articles

【24h】

A Document Exploring System on LDA Topic Model for Wikipedia Articles

机译：有关Wikipedia文章的LDA主题模型的文档探索系统

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A Large number of digital text information is generated every day. Effectively searching, managing and exploring the text data has become a main task. In this paper, we first present an introduction to text mining and LDA topic model. Then we deeply explained how to apply LDA topic model to text corpus by doing experiments on Simple Wikipedia documents. The experiments include all necessary steps of data retrieving, pre-processing, fitting the model and an application of document exploring system. The result of the experiments shows LDA topic model working effectively on documents clustering and finding the similar documents. Furthermore, the document exploring system could be a useful research tool for students and researchers.

机译：每天都会产生大量的数字文本信息。有效地搜索，管理和探索文本数据已成为一项主要任务。在本文中，我们首先介绍文本挖掘和LDA主题模型。然后，我们通过对简单维基百科文档进行实验，深入解释了如何将LDA主题模型应用于文本语料库。实验包括数据检索，预处理，模型拟合和文档浏览系统应用的所有必要步骤。实验结果表明，LDA主题模型在文档聚类和查找相似文档方面有效地工作。此外，文件浏览系统对于学生和研究人员可能是有用的研究工具。

著录项

来源
《International Journal of Multimedia & Its Applications》 |2016年第4期|共页
作者
Zhou Tong; Haiyi Zhang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Local Context-Aware LDA Model for Topic Modeling in a Document Network [J] . Yang Liu, Songhua Xu Journal of the American Society for Information Science and Technology . 2017,第6期

机译：用于文档网络中主题建模的本地上下文感知LDA模型
2. Short text topic modeling by exploring original documents [J] . Li Ximing, Li Changchun, Chi Jinjin, Knowledge and information systems . 2018,第2期

机译：通过探索原始文档的简短文本主题建模
3. LF-LDA: A Supervised Topic Model for Multi-Label Documents Classification [J] . Jialin Ma, Yongjun Zhang, Zijian Wang, International Journal of Data Warehousing and Mining . 2018,第2期

机译：LF-LDA：多标签文档分类的监督主题模型
4. Exploring Topics in the Field of Data Science by Analyzing Wikipedia Documents: A Preliminary Result [C] . Yanyan Wang, Soohyung Joo, Kun Lu Proceedings of the 77th ASISamp;T annual meeting, Connecting collections, cultures, and communities . 2014

机译：通过分析Wikipedia文档探索数据科学领域的主题：初步结果
5. Detection of Claims and Supporting Evidence in Wikipedia Articles on Controversial Topics [D] . Mebane, Waleed. 2017

机译：维基百科有关有争议主题的文章中的声明和支持证据的检测
6. The Impact of Topic Characteristics and Threat on Willingness to Engage with Wikipedia Articles: Insights from Laboratory Experiments [O] . Seren Yenikent, Peter Holtz, Joachim Kimmerle -1

机译：主题特征和威胁对参与维基百科文章的意愿的影响：来自实验室实验的见解
7. Neural labeled LDA: a topic model for semi-supervised document classification [O] . Wei Wang, Bing Guo, Yan Shen, 2021

机译：神经标记的LDA：半监督文件分类主题模型

A Document Exploring System on LDA Topic Model for Wikipedia Articles

摘要

著录项

相似文献

相关主题

期刊订阅