首页> 外文会议>International Conference on Information Science and Control Engineering >Indonesians' Song Lyrics Topic Modelling Using Latent Dirichlet Allocation
【24h】

Indonesians' Song Lyrics Topic Modelling Using Latent Dirichlet Allocation

机译:印尼人使用潜在狄利克雷分配的歌曲歌词主题建模

获取原文

摘要

Lyrics in songs have important roles to give identity and storyline of songs. Lyrics are also considered as one of influence factor for popularity of the songs. However, it is difficult to manually assess the topic from numerous songs especially Indonesian's songs which have high poetic level in the lyrics. Knowing the intrinsic value of lyrics from numerous songs becomes a challenge, especially the lyrics written in complicated and complex language like Bahasa. This paper aims to know and interpret the topics from Indonesian's song lyrics. Indonesian's songs were obtained from daily TOP 200 Spotify in January 2017 - January 2018 with 193 different songs using Bahasa in the lyrics. Latent Dirichlet Allocation (LDA) for the topic modeling was used in this paper. Using 10 topics based on perplexity results, LDA has proper way to interpret the topics in numerous songs by giving information about top words in every topic and topic probabilities for each document or song.
机译:歌曲中的歌词在赋予歌曲身份和故事情节方面具有重要作用。歌词也被认为是歌曲流行的影响因素之一。但是,很难从众多歌曲中手动评估主题,尤其是歌词中诗意高的印尼歌曲。从众多歌曲中了解歌词的内在价值成为一个挑战,尤其是用复杂的语言(如Bahasa)编写的歌词。本文旨在了解和解释印尼歌曲歌词中的主题。印度尼西亚的歌曲是从2017年1月至2018年1月的每日TOP 200 Spotify中获得的,其中193首不同的歌曲使用Bahasa作为歌词。本文使用潜在狄利克雷分配(LDA)进行主题建模。 LDA通过使用基于困惑结果的10个主题,可以通过给出有关每个主题中的热门单词以及每个文档或歌曲的主题概率的信息,来解释众多歌曲中的主题的正确方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号