Feature-based Model for Extraction and Classification of High Quality Questions in Online Forum

Bolanle Ojokoh; Tobore Igbe; Ayobami Araoye

首页> 外文期刊>British Journal of Mathematics & Computer Science >Feature-based Model for Extraction and Classification of High Quality Questions in Online Forum

【24h】

Feature-based Model for Extraction and Classification of High Quality Questions in Online Forum

机译：基于特征的在线论坛高质量问题的提取与分类模型

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aims: To design and implement a classification-based model using specific features for identification and extraction of high quality questions in a thread. Study Design: The study design is divided into three modules: preprocessing, configuration, and question classification Place and Duration of Study: Department of Computer Science of the Federal University of Technology Akure, between June 2016 and December 2016 Methodology: This research proposes a way of identifying, extracting and classifying questions in order to enhance high quality answers in an online forum. One of the major issues in question extraction and classification in forum is the restriction on the number of categories considered such as Who, What, Where, Where, Which, Why and How which are not sufficient to capture all possible questions. In this work, a number of parameters were proposed and aggregated using fuzzy logic for context based spam detection and removal in order to enhance question identification and classification. Part of speech (POS) tagging was applied to analyse the structure of each extracted sentence based on the presence and position of predefined question tags; with this, issues like case sensitivity, grammatical construction and synonyms are addressed. Question classification is carried out with Na?ve Bayes and identifying semantic relationship between extracted questions is achieved with cosine similarity model. Experiments were performed on dataset constructed from Research Gate website. Results: We presented questions extracted from researchgate website into the system. The output consists of the corresponding POS tags and the category the question is classified into. The number of questions extracted from the website is dependent on the number of questions available in a forum. We were able to achieve a successful result of 3015 correctly extracted and classified questions at 80% POS tag occurrence. Conclusion: Our approach to question identification and classification was effective and covers more question categories. This can be applied to any question answering system.

机译：目的：使用特定功能设计和实现基于分类的模型，以识别和提取线程中的高质量问题。研究设计：研究设计分为三个模块：预处理，配置和问题分类研究地点和持续时间：联邦工业大学阿库雷分校计算机科学系，2016年6月至2016年12月方法：本研究提出了一种方法确定，提取和分类问题，以提高在线论坛的高质量答案。论坛中问题提取和分类的主要问题之一是对所考虑的类别数量的限制，例如，谁，什么，哪里，哪里，哪个，为什么和如何，不足以捕获所有可能的问题。在这项工作中，提出了许多参数，并使用模糊逻辑对基于上下文的垃圾邮件检测和清除使用模糊逻辑进行了汇总，以增强对问题的识别和分类。基于预定义问题标签的存在和位置，使用了词性（POS）标签来分析每个提取句子的结构;这样，解决了区分大小写，语法构造和同义词等问题。使用朴素贝叶斯进行问题分类，并使用余弦相似度模型识别提取的问题之间的语义关系。实验是从Research Gate网站构建的数据集上进行的。结果：我们提出了从researchgate网站提取的问题到系统中。输出包括相应的POS标签和问题分类的类别。从网站中提取的问题数量取决于论坛中可用的问题数量。在80％的POS标签发生率下，我们能够成功获得3015个正确提取和分类的问题的成功结果。结论：我们的问题识别和分类方法是有效的，涵盖了更多的问题类别。这可以应用于任何问答系统。

著录项

来源
《British Journal of Mathematics & Computer Science》 |2017年第1期|共21页
作者
Bolanle Ojokoh; Tobore Igbe; Ayobami Araoye;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类数学;
关键词
Questiononline forumResearchGateNa?ve Bayesspam filtering.;

机译：Questiononline forumResearchGateNa？贝叶斯垃圾邮件过滤。;

相似文献

外文文献
中文文献
专利

1. Disease-Treatment Relationship Extraction for Psoriasis from Online Healthcare Forums using NLP and Classification Techniques [J] . Mamatha Balipa, Balasubramani R. International Journal of Applied Engineering Research . 2018,第6aPta3期

机译：利用NLP和分类技术从网上医疗讨论中的疾病治疗关系提取
2. An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums [J] . Jun Gao, Ninghao Liu, Mark Lawley, Journal of healthcare engineering. . 2017,第1期

机译：来自网上医疗讨论的信息提取的可解释分类框架
3. An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums [J] . Gao Jun, Liu Ninghao, Lawley Mark, Journal of healthcare engineering. . 2017,第1期

机译：来自在线医疗论坛的信息提取的可解释分类框架
4. Expertise Modeling and Recommendation in Online Question and Answer Forums [C] . International Conference on Computational Science and Engineering . 2009

机译：在线问题和答案论坛中的专业知识建模和推荐
5. In search of a mathematics discourse model: Constructing mathematics knowledge through online discussion forums. [D] . Ortiz-Rodriguez, Madeline. 2008

机译：寻找数学话语模型：通过在线讨论论坛构建数学知识。
6. An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums [O] . Jun Gao, Ninghao Liu, Mark Lawley, 2017

机译：从在线医疗论坛中提取信息的可解释分类框架
7. Asking the Crowd: Question Analysis, Evaluation and Generation for Open Discussion on Online Forums [O] . Zi Chai, Xinyu Xing, Xiaojun Wan, 2019

机译：问人群：在线论坛上开放讨论的问题分析，评估和生成

Feature-based Model for Extraction and Classification of High Quality Questions in Online Forum

摘要

著录项

相似文献

相关主题

期刊订阅