String Similarity Computing Based on Position And Cosine

机译：基于位置和余弦的字符串相似性计算

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

E-Business platform needs to have the production selection functionalities according to the products' feature and their cost performance, and at the same time, we need to clean data in the production and sale process, so it is important to calculate similarity between products. This paper proposes a new way to compute the similarity of string by segmenting string into words, numbering the corresponding positions and vectorizing the string. Then the similarity between the strings is computed by computing the cosine angle of the two vectors. Experiments show that the method avoids the maximum or minimum of LCS and GST. In addition, the proposed method also improves the accuracy of similarity calculation.

机译：E-Business Platform需要根据产品的功能和成本表现进行生产选择功能，同时，我们需要清洁生产和销售过程中的数据，因此计算产品之间的相似性很重要。本文提出了一种通过将字符串分割为单词来计算字符串相似性的新方法，编号相应的位置和矢量化字符串。然后通过计算两个向量的余弦角来计算字符串之间的相似性。实验表明，该方法避免了LCS和GST的最大值或最小值。此外，所提出的方法还提高了相似性计算的准确性。

著录项

来源
《IEEE International Conference on Electronics Information and Emergency Communication》|2017年|602p|共6页
会议地点
作者
Na Cheng; Zhongqing Yu; Kaixi Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN91-53;
关键词
Angle cosine; Position encoding; Approximately duplicate records; Data cleaning; Products select;

机译：角余弦;位置编码;近似重复记录;数据清洁;产品选择;

相似文献

外文文献
中文文献
专利

1. EKF–GPR-Based Fingerprint Renovation for Subset-Based Indoor Localization with Adjusted Cosine Similarity [J] . Junhua Yang, Yong Li, Wei Cheng, Sensors . 2018,第1期

机译：基于EKF–GPR的指纹更新，用于基于余弦相似度调整的子集的室内定位
2. Ontology-based structured cosine similarity in document summarization: with applications to mobile audio-based knowledge management [J] . Soe-Tsyr Yuan, Jerry Sun IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2005,第5期

机译：文档摘要中基于本体的结构化余弦相似度：应用于基于移动音频的知识管理
3. String similarity join with different similarity thresholds based on novel indexing techniques [J] . Chuitian RONG, Yasin N. SILVA, Chunqing LI Frontiers of computer science in China . 2017,第2期

机译：基于新颖索引技术的字符串相似度连接，具有不同的相似度阈值
4. String Similarity Computing Based on Position And Cosine [C] . Na Cheng, Zhongqing Yu, Kaixi Wang IEEE International Conference on Electronics Information and Emergency Communication . 2017

机译：基于位置和余弦的字符串相似性计算
5. Using semantic similarity measures in the biomedical domain for computing functional similarity between genes based on gene ontology [D] . Khabiri, Elham 2007

机译：在生物医学领域中使用语义相似性度量基于基因本体计算基因之间的功能相似性
6. EKF–GPR-Based Fingerprint Renovation for Subset-Based Indoor Localization with Adjusted Cosine Similarity [O] . Junhua Yang, Yong Li, Wei Cheng, 2018

机译：基于EKF–GPR的指纹更新用于基于子集的室内余弦调整后的余弦相似度
7. Figure 14: Comparing the correct question recommendation based on three similarity metrics: (A) soft cosine, (B) cosine and (C) Jaccard. [O] . -1

机译：图14：基于三个相似度量的正确问题推荐进行比较：（a）软余弦，（b）余弦和（c）jaccard。

String Similarity Computing Based on Position And Cosine

摘要

著录项

相似文献

相关主题

期刊订阅