首页> 外国专利> SEMANTIC REPRESENTATION MODEL-BASED TEXT CLASSIFICATION METHOD AND APPARATUS, AND COMPUTER DEVICE

SEMANTIC REPRESENTATION MODEL-BASED TEXT CLASSIFICATION METHOD AND APPARATUS, AND COMPUTER DEVICE

机译:基于语义表示模型的文本分类方法和装置,以及计算机设备

摘要

A semantic representation model-based text classification method and apparatus, a computer device and a storage medium. The method comprises: acquiring inputted original text, and preprocessing the original text so as to obtain a word sequence; calculating to obtain a vector wi; generating a text embedding vector sequence {w1, w2,..., wn}; inputting the word sequence into a preset knowledge embedding model to acquire an entity embedding vector sequence {e1, e2,..., en}; inputting the text embedding vector sequence into a M-layer word granularity encoder for calculation to obtain an intermediate text embedding vector sequence; inputting the intermediate text embedding vector sequence and the entity embedding vector sequence into a N-layer knowledge granularity encoder for calculation to obtain a final text embedding vector sequence and a final entity embedding vector sequence; and inputting the final text embedding vector sequence and the final entity embedding vector sequence into a classification model to obtain a text classification result. Thus, the accuracy of text classification is improved.
机译:基于语义表示模型的文本分类方法和装置,计算机设备和存储介质。该方法包括:获取输入的原始文本,并预处理原始文本以获取单词序列;计算获得矢量Wi;生成嵌入矢量序列{W1,W2,...,WN};将单词序列输入预设知识嵌入模型以获取实体嵌入向量序列{E1,E2,...,EN};将嵌入矢量序列的文本输入到M层字粒度编码器中,以便计算以获取嵌入矢量序列的中间文本;将中间文本嵌入向量序列和实体嵌入矢量序列的输入到N层知识粒度编码器中,以获得最终文本嵌入矢量序列和最终实体嵌入矢量序列;并将最终文本嵌入向量序列和最终实体嵌入矢量序列的最终文本嵌入到分类模型中,以获取文本分类结果。因此,提高了文本分类的准确性。

著录项

  • 公开/公告号WO2021051503A1

    专利类型

  • 公开/公告日2021-03-25

    原文格式PDF

  • 申请/专利权人 PING AN TECHNOLOGY (SHENZHEN) CO. LTD.;

    申请/专利号WO2019CN116339

  • 发明设计人 DENG YUE;JIN GE;XU LIANG;

    申请日2019-11-07

  • 分类号G06F16/36;

  • 国家 CN

  • 入库时间 2022-08-24 17:57:13

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号