首页> 中文期刊> 《中国邮电高校学报:英文版》 >Mining microblog user interests based on TextRank with TF-IDF factor

Mining microblog user interests based on TextRank with TF-IDF factor

         

摘要

It is of great value and significance to model the interests of microblog user in terms of business and sociology.This paper presents a framework for mining and analyzing personal interests from microblog text with a new algorithm which integrates term frequency-inverse document frequency(TF-IDF) with TextRank.Firstly, we build a three-tier category system of user interest based on Wikipedia.In order to obtain the keywords of interest, we preprocess the posts, comments and reposts in different categories to select the keywords which appear both in the category system and microblogs.We then assign weight to each category and calculate the weight of keyword to get TF-IDF factors.Finally we score the ranking of each keyword by the TextRank algorithm with TF-IDF factors.Experiments on real Sina microblog data demonstrate that the precision of our approach significantly outperforms other existing methods.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号