首页> 美国卫生研究院文献>other >Enriching consumer health vocabulary through mining a social QA site: a similarity-based approach
【2h】

Enriching consumer health vocabulary through mining a social QA site: a similarity-based approach

机译:通过挖掘社交问答网站来丰富消费者健康词汇:基于相似度的方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The widely known vocabulary gap between health consumers and healthcare professionals hinders information seeking and health dialogue of consumers on end-user health applications. The Open Access and Collaborative Consumer Health Vocabulary (OAC CHV), which contains health-related terms used by lay consumers, has been created to bridge such a gap. Specifically, the OAC CHV facilitates consumers’ health information retrieval by enabling consumer-facing health applications to translate between professional language and consumer friendly language. To keep up with the constantly evolving medical knowledge and language use, new terms need to be identified and added to the OAC CHV. User-generated content on social media, including social question and answer (social Q&A) sites, afford us an enormous opportunity in mining consumer health terms. Existing methods of identifying new consumer terms from text typically use ad-hoc lexical syntactic patterns and human review. Our study extends an existing method by extracting n-grams from a social Q&A textual corpus and representing them with a rich set of contextual and syntactic features. Using K-means clustering, our method, simiTerm, was able to identify terms that are both contextually and syntactically similar to the existing OAC CHV terms. We tested our method on social Q&A corpora on two disease domains: diabetes and cancer. Our method outperformed three baseline ranking methods. A post-hoc qualitative evaluation by human experts further validated that our method can effectively identify meaningful new consumer terms on social Q&A.
机译:健康消费者和医疗保健专业人员之间广为人知的词汇鸿沟阻碍了消费者在最终用户健康应用程序上的信息搜索和健康对话。创建了开放访问和协作消费者健康词汇表(OAC CHV),其中包含非专业消费者使用的与健康相关的术语,以弥合这种差距。特别是,OAC CHV通过使面向消费者的健康应用程序能够在专业语言和对消费者友好的语言之间进行转换,来促进消费者的健康信息检索。为了跟上不断发展的医学知识和语言使用的需要,需要确定新术语并将其添加到OAC CHV中。用户在社交媒体上生成的内容,包括社交问答(社交问答)网站,为我们提供了挖掘消费者健康术语的巨大机会。从文本中识别新的消费者术语的现有方法通常使用临时词法句法模式和人工检查。我们的研究通过从社交问答文本语料库中提取n-gram并用丰富的上下文和句法特征表示它们来扩展现有方法。使用K-means聚类,我们的方法simiTerm能够识别与现有OAC CHV术语在上下文和语法上均相似的术语。我们在两个疾病领域的社交问答集上测试了我们的方法:糖尿病和癌症。我们的方法优于三种基准排名方法。由人类专家进行的事后定性评估进一步证明,我们的方法可以有效地识别有关社会问答的有意义的新消费者术语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号