首页> 外文会议>Workshop on cognitive aspects of the lexicon;International conference on computational linguistics >Discovering Potential Terminological Relationships from Twitter's Timed Content
【24h】

Discovering Potential Terminological Relationships from Twitter's Timed Content

机译:从Twitter的定时内容中发现潜在的术语关系

获取原文

摘要

This paper presents a method to discover possible terminological relationships from tweets. We match the histories of terms (frequency patterns). Similar history indicates a possible relationship between terms. For example, if two terms (t1, t2) appeared frequently in Twitter at particular days, and there is a 'similarity' in the frequencies over a period of time, then tl and t2 can be related. Maintaining standard terminological repository with updated relationships can be difficu especially in a dynamic domain such as social media where thousands of new terms (neology) are coined every day. So we propose to construct a raw repository of lexical units with unconfirmed relationships. We have experimented our method on time-sensitive Arabic terms used by the online Arabic community of Twitter. We draw relationships between these terms by matching their similar frequency patterns (timelines). We use dynamic time warping as a similarity measure. For evaluation, we have selected 630 possible terms (we call them preterms) and we matched the similarity of these terms over a period of 30 days. Around 270 correct relationships were discovered with a precision of 0.61. These relationships were extracted without considering the textual context of the term.
机译:本文提出了一种从推文中发现可能的术语关系的方法。我们匹配术语(频率模式)的历史记录。相似的历史记录表明术语之间可能存在关系。例如,如果两个词(t1,t2)在特定日期频繁出现在Twitter中,并且在一段时间内频率上存在“相似性”,则t1和t2可以相关。维护具有更新关系的标准术语库可能很困难;尤其是在动态领域(例如社交媒体)中,每天都会创造成千上万个新术语(新词)。因此,我们建议构建具有未确认关系的词汇单元的原始存储库。我们已经对Twitter的在线阿拉伯语社区使用的对时间敏感的阿拉伯语术语进行了实验。我们通过匹配它们的相似频率模式(时间线)来绘制这些术语之间的关系。我们使用动态时间规整作为相似性度量。为了进行评估,我们选择了630个可能的术语(我们称它们为preterms),并在30天的时间内匹配了这些术语的相似性。发现了大约270个正确关系,精确度为0.61。这些关系是在不考虑该术语的文本上下文的情况下提取的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号