首页> 外文会议>IEEE International Conference on Semantic Computing >On-Device Sentence Similarity for SMS Dataset
【24h】

On-Device Sentence Similarity for SMS Dataset

机译:SMS DataSet的设备句子相似度

获取原文

摘要

Determining the sentence similarity between Short Message Service (SMS) texts/sentences plays a significant role in mobile device industry. Gauging the similarity between SMS data is thus necessary for various applications like enhanced searching and navigation, clubbing together SMS of similar type when given a custom label or tag is provided by user irrespective of their sender etc. The problem faced with SMS data is its incomplete structure and grammatical inconsistencies. In this paper, we propose a unique pipeline for evaluating the text similarity between SMS texts. We use Part of Speech (POS) model for keyword extraction by taking advantage of the partial structure embedded in SMS texts and similarity comparisons are carried out using statistical methods. The proposed pipeline deals with major semantic variations across SMS data as well as makes it effective for its application on-device (mobile phone). To showcase the capabilities of our work, our pipeline has been designed with an inclination towards one of the possible applications of SMS text similarity discussed in one of the following sections but nonetheless guarantees scalability for other applications as well.
机译:确定短消息服务(SMS)文本/句子之间的句子相似性在移动设备行业中发挥着重要作用。因此,根据增强型搜索和导航,如增强的搜索和导航,如增强的搜索和导航,在给定自定义标签或标签时,使用相似类型的短信是不管他们的发件人等所提供的。结构和语法不一致。在本文中,我们提出了一种唯一的管道,用于评估SMS文本之间的文本相似性。我们利用嵌入在短信文本中的部分结构和使用统计方法进行相似性比较来使用关键字提取的一部分语音(POS)模型。该拟议的管道涉及SMS数据的主要语义变化,并使其在设备上的应用程序(移动电话)有效。为了展示我们工作的能力,我们的管道已经设计,并倾向于朝着以下部分之一讨论的SMS文本相似性之一,但仍然保证了其他应用程序的可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号