Research on calculation method of text similarity based on smooth inverse frequency

Yuan Ye; Yu Minmin; Liu Jiming

首页> 中文期刊> 《中国邮电高校学报：英文版》 >Research on calculation method of text similarity based on smooth inverse frequency

Research on calculation method of text similarity based on smooth inverse frequency

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

In order to improve the accuracy of text similarity calculation,this paper presents a text similarity function part of speech and word order-smooth inverse frequency(PO-SIF)based on sentence vector,which optimizes the classical SIF calculation method in two aspects:part of speech and word order.The classical SIF algorithm is to calculate sentence similarity by getting a sentence vector through weighting and reducing noise.However,the different methods of weighting or reducing noise would affect the efficiency and the accuracy of similarity calculation.In our proposed PO-SIF,the weight parameters of the SIF sentence vector are first updated by the part of speech subtraction factor,to determine the most crucial words.Furthermore,PO-SIF calculates the sentence vector similarity taking into the account of word order,which overcomes the drawback of similarity analysis that is mostly based on the word frequency.The experimental results validate the performance of our proposed PO-SIF on improving the accuracy of text similarity calculation.

著录项

来源
《中国邮电高校学报：英文版》 |2020年第2期|56-64|共9页
作者
Yuan Ye; Yu Minmin; Liu Jiming;
展开▼
作者单位

Key Laboratory of E-commerce and Modem Logistics;

Chongqing University of Posts and Telecommunications;

Chongqing 400065;

China;

展开▼
原文格式 PDF
正文语种 chi
中图分类文字信息处理;
关键词
word2vec; SIF; part-of-speech; word order similarity;

Research on calculation method of text similarity based on smooth inverse frequency

摘要

著录项

相关主题

期刊订阅