首页> 外国专利> CLUSTERING HIGH DIMENSIONAL DATA USING GAUSSIAN MIXTURE COPULA MODEL WITH LASSO BASED REGULARIZATION

CLUSTERING HIGH DIMENSIONAL DATA USING GAUSSIAN MIXTURE COPULA MODEL WITH LASSO BASED REGULARIZATION

机译:使用高斯混合Copula模型和基于lass的调节来聚类高维数据

摘要

LASSO constraints can lead to a Gaussian mixture copula model that is more robust, better conditioned, and more reflective of the actual clusters in the training data. These qualities of the GMCM have been shown with data obtained from: digital images of fine needle aspirates of breast tissue for detecting cancer; email for detecting spam; two dimensional terrain data for detecting hills and valleys; and video sequences of hand movements to detect gestures. Using training data, a GMCM estimate can be produced and iteratively refined to maximize a penalized log likelihood estimate until sequential iterations are within a threshold value of one another. The GMCM estimate can then be used to classify further samples. The LASSO constraints help keep the analysis tractibe such that useful results can be found and used while the result is still useful.
机译:LASSO约束可以导致高斯混合copula模型,该模型更健壮,条件更好,并且更能反映训练数据中的实际群集。 GMCM的这些质量已经从以下数据中获得了证明:乳腺组织细针抽吸物的数字图像,用于检测癌症;用于检测垃圾邮件的电子邮件;用于检测丘陵和山谷的二维地形数据;和视频序列的手部动作以检测手势。使用训练数据,可以生成GMCM估计值并进行迭代优化,以使惩罚对数似然估计值最大化,直到顺序迭代彼此之间都在阈值内。 GMCM估计值然后可以用于对其他样本进行分类。 LASSO约束有助于保持分析的准确性,以便在结果仍然有用的情况下可以找到并使用有用的结果。

著录项

  • 公开/公告号US2017293856A1

    专利类型

  • 公开/公告日2017-10-12

    原文格式PDF

  • 申请/专利权人 XEROX CORPORATION;

    申请/专利号US201615093302

  • 申请日2016-04-07

  • 分类号G06N99/00;G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 13:52:46

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号