Utility of General and Specific Word Embeddings for Classifying Translational Stages of Research.

机译：通用词词嵌入和特定词词嵌入对分类研究翻译阶段的效用。

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Conventional text classification models make a bag-of-words assumption reducing text into word occurrence counts per document. Recent algorithms such as word2vec are capable of learning semantic meaning and similarity between words in an entirely unsupervised manner using a contextual window and doing so much faster than previous methods. Each word is projected into vector space such that similar meaning words such as “strong” and “powerful” are projected into the same general Euclidean space. Open questions about these embeddings include their utility across classification tasks and the optimal properties and source of documents to construct broadly functional embeddings. In this work, we demonstrate the usefulness of pre-trained embeddings for classification in our task and demonstrate that custom word embeddings, built in the domain and for the tasks, can improve performance over word embeddings learnt on more general data including news articles or Wikipedia.

机译：常规的文本分类模型做出了一个词袋假设，将文本减少为每个文档的单词出现次数。诸如word2vec之类的最新算法能够使用上下文窗口以完全不受监督的方式学习单词之间的语义和相似性，并且比以前的方法快得多。每个单词都投影到向量空间中，以便将类似含义的单词（例如“强”和“有力”）投影到相同的一般欧几里得空间中。关于这些嵌入的未解决问题包括它们在分类任务中的效用以及构造广泛功能性嵌入的最佳属性和文档来源。在这项工作中，我们展示了预训练的嵌入对于任务分类的有用性，并展示了在领域和任务中内置的自定义单词嵌入相对于从更广泛的数据（包括新闻报道或Wikipedia）上学习的单词嵌入，可以提高性能。。

著录项

期刊名称 AMIA Annual Symposium Proceedings
作者
Vincent Major; Alisa Surkis; Yindalon Aphinyanaphongs;
展开▼
作者单位

展开▼
年(卷),期 2018(2018),-1
年度 2018
页码 1405–1414
总页数 10
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Projection Word Embedding Model With Hybrid Sampling Training for Classifying ICD-10-CM Codes: Longitudinal Observational Study [J] . Chin Lin, Yu-Sheng Lou, Dung-Jang Tsai, JMIR Medical Informatics . 2019,第3期

机译：ICD-10-CM代码分类的混合采样训练投影词嵌入模型：纵向观察研究
2. Deep Learning- and Word Embedding-Based Heterogeneous Classifier Ensembles for Text Classification [J] . Kilimci Zeynep H., Akyokus Selim Complexity . 2018,第1期

机译：基于深度学习和词嵌入的异构分类器集成
3. Deep Learning- and Word Embedding-Based Heterogeneous Classifier Ensembles for Text Classification [J] . Kilimci Zeynep H., Akyokus Seim Complexity . 2018,第2期

机译：基于深入的学习和Word嵌入的异构分类器组合文本分类
4. Enriching Word Sense Embeddings with Translational Context [C] . Mehdi Ghanimifard, Richard Johansson International conference on recent advances in natural language processing . 2015

机译：通过翻译上下文丰富词义嵌入
5. Specificity of the b Test, Dot Counting Test, Rey 15-Item Test Plus Recognition, and Rey Word Recognition Test in Monolingual Spanish Speakers Embedded Measure of Effort [D] . Robles, Luz Alehida 2013

机译：b语言测试，点计数测试，Rey 15项测试加识别和Rey单词识别测试在说西班牙语的嵌入式工作量中的特异性
6. A Word on Words in Words: How Do Embedded Words Affect Reading? [O] . Joshua Snell, Jonathan Grainger, Mathieu Declerck 2018

机译：单词中的单词：嵌入式单词如何影响阅读？
7. MetaMLP: A fast word embedding based classifier to profile target gene databases in metagenomic samples [O] . G. A. Arango-Argoty, L. S. Heath, A. Pruden, 2019

机译：METAMLP：将基于基于分类器的快速单词嵌入到METAGENOMIC样本中的目标基因数据库

Utility of General and Specific Word Embeddings for Classifying Translational Stages of Research.

摘要

著录项

相似文献

相关主题

期刊订阅