首页> 外国专利> THEMATIC MODELS WITH A PRIORI TONALITY PARAMETERS BASED ON DISTRIBUTED REPRESENTATIONS

THEMATIC MODELS WITH A PRIORI TONALITY PARAMETERS BASED ON DISTRIBUTED REPRESENTATIONS

机译:基于分布表示的优先级优先级参数的主题模型

摘要

FIELD: data processing.;SUBSTANCE: invention relates to means for thematic modelling with a priori tone parameters based on distributed representations. Text document is inserted into a thematic model and a presentation for each word in the text document is determined by the thematic model, wherein the representations are word vectors in the semantic space. Assessing presentations using a priori tone parameters to determine a theme corresponding to said text document, wherein the topic model comprises a priori tonality parameters, trained based on representations distributed using a regularizer, which sets the same tonality to words having similar word vectors, and wherein each a priori tonality parameter is the same for words having similar word vectors.;EFFECT: technical result consists in detecting a greater number of aspect-oriented tonal words and further improved classification.;8 cl, 5 dwg, 9 tbl
机译:技术领域本发明涉及用于基于分布式表示的具有先验音调参数的主题建模的装置。将文本文档插入到主题模型中,并由主题模型确定文本文档中每个单词的表示形式,其中表示形式是语义空间中的单词向量。使用先验音调参数来评估演示文稿以确定与所述文本文档相对应的主题,其中,主题模型包括先验音调参数,这些参数是基于使用正则化器分布的表示而训练的,该先验音调参数为具有相似词向量的词设置相同的音调,并且,其中对于具有相似单词向量的单词,每个先验音调参数都相同。效果:技术成果在于检测更多数量的面向方面的音调单词并进一步改善了分类。8 cl,5 dwg,9 tbl

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号