首页> 外国专利> GROUND TRUTH QUALITY FOR MACHINE LEARNING MODELS

GROUND TRUTH QUALITY FOR MACHINE LEARNING MODELS

机译:机器学习模式的地面真理质量

摘要

Methods, systems and computer program products for improving ground truth quality for modeling are provided. Aspects include receiving a plurality of data inputs, wherein each of the plurality of data inputs has an associated label. Aspects also include training a model based on the plurality of data inputs. Aspects also include generating a plurality of vector representations corresponding to the plurality of data inputs based on the model. Aspects also include clustering the plurality of vector representations into one or more clusters. Aspects also include identifying at least one anomalous data input based on the one or more clusters. The at least one anomalous data input can be a data input of the plurality of data inputs that is mislabeled, contributes to an ambiguous class structure or is an outlier. Aspects also include outputting a notification that provides an indication of the at least one anomalous data input.
机译:提供了用于改善建模的地面真理质量的方法,系统和计算机程序产品。方面包括接收多个数据输入,其中多个数据输入中的每一个具有相关联的标签。方面还包括基于多个数据输入训练模型。方面还包括基于模型生成与多个数据输入对应的多个矢量表示。方面还包括将多个载体表示聚类为一个或多个簇。方面还包括基于一个或多个簇识别至少一个异常数据输入。至少一个异常数据输入可以是误标标记的多个数据输入的数据输入,有助于模糊的类结构或者是异常值。方面还包括输出提供至少一个异常数据输入的指示的通知。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号