首页> 外国专利> Suffix tree similarity measure for document clustering

Suffix tree similarity measure for document clustering

机译:用于文档聚类的后缀树相似性度量

摘要

The subject innovation provides for systems and methods to facilitate weighted suffix tree clustering. Conventional suffix tree cluster models can be augmented by incorporating quality measures to facilitate improved performance. Further the quality measure can be employed in determining cluster labels that show improvements in accuracy over conventional means. Additionally “stopnodes” can be defined to facilitate traversing suffix tree models efficiently. Quality measurements can be determined based in part on weighting factors applied to terms in a vector model, said terms being mapped from a suffix tree model.
机译:本发明提供了有助于加权后缀树聚类的系统和方法。可以通过合并质量度量来增强常规后缀树群集模型,以促进改进的性能。此外,可以使用质量度量来确定聚类标签,该聚类标签显示出优于常规手段的准确性。另外,可以定义“停止节点”以方便有效地遍历后缀树模型。可以部分地基于应用于矢量模型中的项的加权因子来确定质量测量,所述项是从后缀树模型映射的。

著录项

  • 公开/公告号US8676815B2

    专利类型

  • 公开/公告日2014-03-18

    原文格式PDF

  • 申请/专利权人 XIAOTIE DENG;HUNG CHIM;

    申请/专利号US20090436722

  • 发明设计人 XIAOTIE DENG;HUNG CHIM;

    申请日2009-05-06

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 16:02:07

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号