首页> 外国专利> Method and System for Seed Based Clustering of Categorical Data

Method and System for Seed Based Clustering of Categorical Data

机译:基于种子的分类数据聚类方法和系统

摘要

A computerized method of representing a dataset with a taxonomy includes augmenting a dataset containing a plurality of records with a plurality of predetermined exemplars; representing the plurality of records and predetermined exemplars within the augmented dataset as a plurality of clusters in an initial taxonomy layer; generating a truncated hierarchy of cluster sets based on clusters within the initial taxonomy layer, wherein clusters within the truncated hierarchy contain no more than a predetermined number of exemplars; and labeling clusters within the truncated hierarchy.
机译:一种用分类法表示数据集的计算机化方法,包括:用多个预定示例来扩充包含多个记录的数据集;将扩展数据集中的多个记录和预定示例表示为初始分类层中的多个聚类;基于初始分类学层内的聚类生成聚类集的删减层次结构,其中,该删减的层次结构内的聚类包含不超过预定数量的样本;并在截断的层次结构中标记群集。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号