首页> 中文期刊> 《中国通信:英文版》 >Semantic Knowledge Acquisition from Blogs with Tag-Topic Model

Semantic Knowledge Acquisition from Blogs with Tag-Topic Model

         

摘要

This paper focuses on semantic knowledge acquisition from blogs with the proposed tag-topic model.The model extends the Latent Dirichlet Allocation(LDA) model by adding a tag layer between the document and the topic.Each document is represented by a mixture of tags;each tag is associated with a multinomial distribution over topics and each topic is associated with a multinomial distribution over words.After parameter estimation,the tags are used to describe the underlying topics.Thus the latent semantic knowledge within the topics could be represented explicitly.The tags are treated as concepts,and the top-N words from the top topics are selected as related words of the concepts.Then PMI-IR is employed to compute the relatedness between each tag-word pair and noisy words with low correlation removed to improve the quality of the semantic knowledge.Experiment results show that the proposed method can effectively capture semantic knowledge,especially the polyseme and synonym.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号