Predicting User Competence from Text

机译：通过文本预测用户能力

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We explore the possibility of learning user competence from a text by using natural language processing and machine learning (ML) methods. In our context, competence is defined as the ability to identify the wildlife appearing in images and classifying into species correctly. We evaluate and compare the performance (regarding accuracy and F-measure) of the three ML methods. Naive Bayes (NB), Decision Trees (DT) and K-nearest neighbors (KNN). applied to the text corpus obtained from the Snapshot Sen-rengeti discussion forum posts. The baseline results show, that regarding accuracy. DT outperforms NB and KNN by 16.00%, and 15.00% respectively. Regarding F-measure. K-NN outperforms NB and DT by 12.08% and 1.17%, respectively. We also propose a hybrid model that combines the three models (DT. NB and KNN). We improve the baseline results with the calibration technique and additional features. Adding a bi-gram feature has shown a dramatic increase (from 48.38% to 64.40%) of accuracy for NB model. We achieved to push the accuracy limit in the baseline models from 93.39% to 94.09%.

机译：我们探索通过使用自然语言处理和机器学习（ML）方法从文本中学习用户能力的可能性。在我们的上下文中，能力定义为识别出现在图像中的野生动植物并正确分类的能力。我们评估和比较三种ML方法的性能（关于准确性和F量度）。朴素贝叶斯（NB），决策树（DT）和K近邻（KNN）。应用于从Snapshot Sen-rengeti讨论论坛帖子中获得的文本语料库。基线结果表明，这与准确性有关。 DT的表现分别优于NB和KNN，分别为16.00％和15.00％。关于F测度。 K-NN的性能分别超过NB和DT，分别为12.08％和1.17％。我们还提出了一个混合模型，该模型结合了三种模型（DT。NB和KNN）。我们使用校准技术和其他功能来改善基线结果。添加二元语法功能后，NB模型的准确性显着提高（从48.38％增至64.40％）。我们实现了将基准模型的准确性限制从93.39％提升到94.09％。

著录项

来源
《World multi-conference on systemics, cybernetics and informatics》|2017年|147-152|共6页
会议地点
作者
Yonas WOLDEMARIAM;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
text analysis; NLP; machine-learning; naive bayes; decision trees; and K-nearest neighbors;

机译：文本分析; NLP;机器学习天真的贝叶斯决策树;和K近邻;

相似文献

外文文献
中文文献
专利

1. PREDICTING PERSONALITY TRAITS OF FACEBOOK USERS USING TEXT MINING [J] . REINERT YOSUA RUMAGIT, ABBA SUGANDA GIRSANG Journal of Theoretical and Applied Information Technology . 2018,第20期

机译：使用文本挖掘预测脸书用户的人格特质
2. Semi-literate Texting (SLT): Survey based text message dataset from digitally semi-literate users in India [J] . Prawaal Sharma, Navneet Goyal, Vinay MR Data in Brief . 2021,第a期

机译：半识字短信（SLT）：基于对印度数字半识字用户的教科消息数据集
3. A cluster analysis of text message users based on their demand for text messaging: A behavioral economic approach [J] . Hayashi Yusuke, Friedel Jonathan E., Foreman Anne M., Journal of the experimental analysis of behavior . 2019,第3期

机译：基于他们对文本消息的需求的文本消息用户的集群分析：行为经济方法
4. Predicting User Competence from Text [C] . Yonas WOLDEMARIAM World multi-conference on systemics, cybernetics and informatics . 2017

机译：预测来自文本的用户能力
5. Assessing users and uses of electronic text: In case of the Japanese Text Initiative, Japanese classics electronic text on the World Wide Web. [D] . Noguchi, Sachie. 2001

机译：评估电子文本的用户和使用：如果使用“日本文字倡议”，则在互联网上使用日语经典电子文本。
6. Multi-dimensional classification of biomedical text: Toward automated practical provision of high-utility text to diverse users [O] . Hagit Shatkay, Fengxia Pan, Andrey Rzhetsky, -1

机译：生物医学文本的多维分类：向各种用户提供自动化实用的高实用性文本
7. When Bitcoin encounters information in an online forum: Using text mining to analyse user opinions and predict value fluctuation. [O] . Young Bin Kim, Jurim Lee, Nuri Park, 2017

机译：当比特币在在线论坛中遇到信息时：使用文本挖掘来分析用户意见并预测价值波动。
8. Text and Illustration Processing System (TIPS) User's Manual. Volume 1. Text Processing System [R] . Brown, C. J., Cox, R. 1981

机译：文本和插图处理系统（TIps）用户手册。第1卷。文本处理系统

Predicting User Competence from Text

摘要

著录项

相似文献

相关主题

期刊订阅