Pairwise Topic Model via relation extraction

机译：通过关系提取的成对主题模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Topic modeling is a powerful tool to model documents to find their underlying topics. However, the unstructured nature of the raw text makes it hard to model the semantic relationship between the text units, which may be the words, phrases or sentences, and thus even harder to model their corresponding underlying topics. In our work, we try to examine the pairwise relationship of the underlying topics through relation extraction. We first extract the entity pairs within one relation tuple out of the raw text. Then, we model the relationship between the entity pairs by adding the dependencies between entities and their corresponding topics. We propose six different versions of Pairwise Topic Model (PTM) to simultaneously discover the latent topics and their pairwise relationship. The experiment on four data sets (AP news articles, DUC 2004 task2, Clinical Notes and Neuroscience Papers) shows the PTM models are better-structured language model than the traditional topic model Latent Dirichlet Allocation (LDA). Also, empirical results show that the proposed Pairwise Topic Models (PTMs) can explicitly explain how two topics are related.

机译：主题建模是一种功能强大的工具，可以对文档进行建模以查找其基础主题。但是，原始文本的非结构化性质使得很难对文本单元（可能是单词，短语或句子）之间的语义关系进行建模，因此甚至更难于对其相应的基础主题进行建模。在我们的工作中，我们尝试通过关系提取来检查基础主题的成对关系。我们首先从原始文本中提取一个关系元组中的实体对。然后，我们通过添加实体及其对应主题之间的依赖关系来对实体对之间的关系进行建模。我们提出了成对主题模型（PTM）的六个不同版本，以同时发现潜在主题及其成对关系。对四个数据集（AP新闻，DUC 2004 task2，Clinical Notes和Neuroscience Papers）进行的实验表明，PTM模型比传统主题模型Latent Dirichlet Allocation（LDA）具有更好的结构化语言模型。此外，经验结果表明，提出的成对主题模型（PTM）可以明确解释两个主题之间的关系。

著录项

来源
《IEEE International Congress on Big Data》|2014年|96-103|共8页
会议地点
作者
Xiaoli Song; Yue Shang; Yuan Ling; Mengwen Liu; Xiaohua Hu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
text analysis; LDA; PTM; documents modeling; entity pairs extraction; latent Dirichlet allocation; latent topics; pairwise relationship; pairwise topic model; phrases; raw text relation tuple; relation extraction; semantic relationship; sentences; structured language model; text units; words; Data mining; Data models; Data structures; Educational institutions; Hidden Markov models; Joints; Syntactics; Pairwise Topic Modeling; Relation Extraction; Structured Data;

机译：文本分析; LDA; PTM;文档建模;实体对提取;潜在Dirichlet分配;潜在主题;成对关系;成对主题模型;短语;原始文本关系元组;关系提取;语义关系;句子;结构化语言模型;文本单元;词;数据挖掘;数据模型;数据结构;教育机构;隐马尔可夫模型;关节;句法;成对主题建模;关系提取;结构化数据;

相似文献

外文文献
中文文献
专利

1. Learning Hidden Markov Models from Pairwise Co-occurrences with Application to Topic Modeling [J] . Kejun Huang, Xiao Fu, Nicholas Sidiropoulos JMLR: Workshop and Conference Proceedings . 2018,第2011期

机译：使用应用于主题建模的成对共同出现学习隐藏的马尔可夫模型
2. Sentence level topic models for associated topics extraction [J] . Jiang Haixin, Zhou Rui, Zhang Limeng, World Wide Web . 2019,第6期

机译：用于关联主题提取的句子级主题模型
3. Combining paper cooperative network and topic model for expert topic analysis and extraction [J] . Gao Shengxiang, Li Xian, Yu Zhengtao, Neurocomputing . 2017,第sepa27期

机译：结合纸张合作网络和主题模型进行专家主题分析和提取
4. Pairwise Topic Model via relation extraction [C] . Xiaoli Song, Yue Shang, Yuan Ling, IEEE International Congress on Big Data . 2014

机译：通过关系提取成对主题模型
5. Topics in galaxy formation: Pairwise velocities of dark matter halos and molecular hydrogen regulated star formation in cosmological simulations [D] . Thompson, Robert Jo. 2012

机译：星系形成的主题：宇宙学模拟中暗物质晕和分子氢调节恒星形成的成对速度
6. Lotka-Volterra pairwise modeling fails to capture diverse pairwise microbial interactions [O] . Babak Momeni, Li Xie, Wenying Shou 2017

机译：Lotka-Volterra成对建模无法捕获各种成对微生物相互作用
7. When topic models disagree: keyphrase extraction with mulitple topic models [O] . Sterckx Lucas, Demeester Thomas, Deleu Johannes, 2015

机译：主题模型不一致时：使用多个主题模型进行关键短语提取
8. Extraction et Utilisation des Relations Booleennes pour la Resolution des Programmes Lineaires en Variables 0-1, Volume 1 (Extraction and Evaluation of Boolean Relations for the Solution of Linear Programs in 0-1 Variables, Volume [R] . Jaumard, B. 1986

机译：提取与利用关系Booleennes pour la Resolution des programs Lineaires en Variablebles 0-1，Volume 1（0-1变量线性程序解决方案的布尔关系的提取和评估，体积

Pairwise Topic Model via relation extraction

摘要

著录项

相似文献

相关主题

期刊订阅