首页> 外文会议>International conference on advanced data mining and applications >Understanding Behavioral Differences Between Short and Long-Term Drinking Abstainers from Social Media
【24h】

Understanding Behavioral Differences Between Short and Long-Term Drinking Abstainers from Social Media

机译:从社交媒体了解短期和长期戒酒者之间的行为差​​异

获取原文

摘要

Drinking alcohol has high cost on society. The journey from being a regular drinker to a successful quitter may be a long and hard journey, fraught with the risk to relapse. Research has shown that certain behavioral changes can be effective towards staying abstained. Traditional way to conduct research on drinking abstainers uses questionnaire based approach to collect data from a curated group of people. However, it is an expensive approach in both cost and time and often results in small data with less diversity. Recently, social media has emerged as a rich data source. Reddit is one such social media platform that has a community ('subreddit') with an interest to quit drinking. The discussions among the group dates back to year 2011 and contain more than 40,000 posts. This large scale data is generated by users themselves and without being limited by any survey questionnaires. The most predictive factors from the features (unigrams, topics and LIWC) associated with short-term and long-term abstinence are identified using Lasso. It is seen that many common patterns manifest in unigrams, topics and LIWC. Whilst topics provided much richer associations between a group of words and the outcome, unigrams and LIWC are found to be good at finding highly predictive solo and psycho linguistically important words. Combining them we have found that many interesting patterns that are associated with the successful attempt made by the long-term abstainer, at the same time finding many of the common issues faced during the initial period of abstinence.
机译:饮酒对社会造成了高昂的代价。从经常饮酒到成功戒烟的过程可能是一段漫长而艰辛的旅程,充满了复发的风险。研究表明,某些行为改变可能对保持弃权有效。对戒酒者进行研究的传统方法是使用基于问卷的方法从精选人群中收集数据。但是,这在成本和时间上都是一种昂贵的方法,并且通常会导致数据量少,多样性少。最近,社交媒体已经成为一种丰富的数据源。 Reddit是一个这样的社交媒体平台,其社区(“ subreddit”)有意戒酒。该小组之间的讨论可以追溯到2011年,包含40,000多个帖子。这种大规模数据是由用户自己生成的,不受任何调查问卷的限制。使用Lasso可以识别与短期和长期戒断相关的特征(字母组合,主题和LIWC)中最具预测性的因素。可以看出,许多常见的模式都体现在unigram,主题和LIWC中。尽管主题在一组单词和结果之间提供了更丰富的关联,但是发现unigram和LIWC擅长查找具有高度预测性的独奏和心理上重要的单词。结合它们,我们发现许多有趣的模式与长期弃权者的成功尝试相关,同时发现了在禁欲初期面临的许多常见问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号