Building a Question-Answering Corpus Using Social Media and News Articles

机译：使用社交媒体和新闻文章构建一个问题答案的语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Is it possible to develop a reliable QA-CORPUS using social media data? What are the challenges faced when attempting such a task? In this paper, we discuss these questions and present our findings when developing a QA-CORPUS on the topic of Brazilian finance. In order to populate our corpus, we relied on opinions from experts on Brazilian finance that are active on the Twitter application. From these experts, we extracted information from news websites that are used as answers in the corpus. Moreover, to effectively provide rankings of answers to questions, we employ novel word vector based similarity measures between short sentences (that accounts for both questions and Tweets). We validated our methods on a recently released dataset of similarity between short Portuguese sentences. Finally, we also discuss the effectiveness of our approach when used to rank answers to questions from real users.

机译：是否有可能使用社交媒体数据开发可靠的QA-Corpus？尝试此类任务时面临的挑战是什么？在本文中，我们讨论这些问题并在制定巴西金融主题的QA语料库中展示我们的调查结果。为了填充我们的语料库，我们依赖于在Twitter申请中积极的巴西金融专家的意见。从这些专家来看，我们从新闻网站中提取了用作语料库中答案的新闻网站的信息。此外，为了有效地提供问题的答案排名，我们在短句之间采用了新的Word Vectory的类似性措施（对于两个问题和推文来说）。我们在短葡萄牙语句子之间验证了最近发布的相似性数据集的方法。最后，我们还讨论了我们的方法的有效性，当习惯于从真实用户那里对问题的答案进行排名。

著录项

来源
《International Workshop on Computational Processing of the Portuguese Language》|2016年|398p|共6页
会议地点
作者
Paulo Cavalin; Flavio Figueiredo; Maira de Bayser; Luis Moyano; Heloisa Candello; Ana Appel; Renan Souza;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP304.6-53;
关键词
Question and Answer; Social media; Finance;

机译：问答;社交媒体;财务;

相似文献

外文文献
中文文献
专利

1. Building semantically annotated corpus for text classification of Indian defence news articles [J] . aurabh A. Kanekar, Alind Sharma, Gaurang S. Patkar, International Journal of Information Technology . 2021,第4期

机译：建立语义注释的印度国防新闻文本分类语料库
2. Does Too Much News on Social Media Discourage News Seeking? Mediating Role of News Efficacy Between Perceived News Overload and News Avoidance on Social Media [J] . Chang Sup Park Social Media + Society . 2019,第3期

机译：关于社交媒体的消息太多了劝阻新闻追求？在社交媒体上介导新闻疗效与新闻避免之间的新闻疗效的作用
3. Analysing headlines as a way of downsizing news corpora: Evidence from an Arabic-English comparable corpus of newspaper articles [J] . Haider Ahmad S., Hussein Riyad F. Literary & linguistic computing . 2020,第4期

机译：分析头条新闻作为缩小新闻学习的方式：来自阿拉伯语 - 英语的证据报纸文章
4. Building a Question-Answering Corpus Using Social Media and News Articles [C] . Paulo Cavalin, Flavio Figueiredo, Maira de Bayser, International conference on computational processing of portuguese . 2016

机译：使用社交媒体和新闻文章建立疑问解答语料库
5. The news media meets social media: A content analysis of news outlets on Facebook. [D] . Murray, Danielle L. 2017

机译：新闻媒体与社交媒体相遇：Facebook上新闻媒体的内容分析。
6. Stance markers in English medical research articles and newspaper opinion columns: A comparative corpus-based study [O] . Qian Shen, Yating Tao 2021

机译：英语医学研究文章和报纸舆论专栏的立场标记：基于比较的语料库研究
7. Building English-Vietnamese Named Entity Corpus with Aligned Bilingual News Articles [O] . Quoc Hung Ngo, Ho Chi Minh, Dinh Dien, 2015

机译：用对齐的双语新闻文章建立英语 - 越南语命名实体语料库

Building a Question-Answering Corpus Using Social Media and News Articles

摘要

著录项

相似文献

相关主题

期刊订阅