首页> 外文会议>IEEE International Conference on Semantic Computing >On-Device Tag Generation for Unstructured Text
【24h】

On-Device Tag Generation for Unstructured Text

机译:非结构化文本的设备标签生成

获取原文

摘要

With the overwhelming transition to smart phones, storing important information in the form of unstructured text has become habitual to users of mobile devices. From grocery lists to drafts of emails and important speeches, users store a lot of data in the form of unstructured text (for eg: in the Notes application) on their devices, leading to cluttering of data. This not only prevents users from efficient navigation in the applications but also precludes them from perceiving the relations that could be present across data in those applications. This paper proposes a novel pipeline to generate a set of tags using world knowledge based on the keywords and concepts present in unstructured textual data. These tags can then be used to summarize, categorize or search for the desired information thus enhancing user experience by allowing them to have a holistic outlook of the kind of information stored in the form of unstructured text. In the proposed system, we use an on-device (mobile phone) efficient CNN model with pruned ConceptNet resource to achieve our goal. The architecture also presents a novel ranking algorithm to extract the top n tags from any given text.
机译:随着向智能手机的压倒性过渡,以非结构化文本的形式存储重要信息已经成为移动设备用户的习惯性。从杂货列表到电子邮件的草稿和重要演讲,用户在其设备上以非结构化文本的形式存储大量数据(例如:在Notes应用程序中),导致数据的杂乱。这不仅可以防止用户在应用程序中有效导航,而且禁止它们感知这些应用程序中可能存在的关系。本文提出了一种新的管道,用于基于非结构化文本数据中存在的关键字和概念使用世界知识生成一组标签。然后,这些标签可以用于总结,分类或搜索所需的信息,从而通过允许它们具有以非结构化文本的形式具有存储的信息的整体前景来增强用户体验。在所提出的系统中,我们使用一个设备(移动电话)高效的CNN模型与修剪修剪的概念资源来实现我们的目标。该架构还提出了一种新颖的排名算法,用于从任何给定文本中提取顶部n标签。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号