【24h】

Question identification on Turkish tweets

机译:土耳其推文中的问题识别

获取原文
获取原文并翻译 | 示例

摘要

Question identification is a field Natural Language Processing and also Information Extraction. The aim of work is detecting Turkish tweets which are including question expressions. The application contains three stages: applying some pre-processing steps to data set for cleaning unnecessary data like Retweet, determining candidate tweets via a rule-based method and extracting tweets which are really include questions using Conditional Random Fields. For this purpose one million tweets were collected and labeled. Tweets are ungrammatical data type. According to results; the model developed has been largely successful on tweets. Additionally, it is a first study about identifying questions on Turkish tweets.
机译:问题识别是自然语言处理领域,也是信息提取领域。工作的目的是检测包含问题表达的土耳其推文。该应用程序包含三个阶段:对数据集应用一些预处理步骤以清除不必要的数据(如Retweet),通过基于规则的方法确定候选推文以及使用条件随机字段提取确实包含问题的推文。为此,收集并标记了100万条推文。推文是非语法数据类型。根据结果​​;开发的模型已在推文上大获成功。此外,这是有关识别土耳其推文问题的第一项研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号