首页>
外国专利>
DETERMINING CONFIDENT DATA SAMPLES FOR MACHINE LEARNING MODELS ON UNSEEN DATA
DETERMINING CONFIDENT DATA SAMPLES FOR MACHINE LEARNING MODELS ON UNSEEN DATA
展开▼
机译:确定未知数据上机器学习模型的机密数据样本
展开▼
页面导航
摘要
著录项
相似文献
摘要
Techniques are provided for determining confident data samples for machine learning (ML) models on unseen data. In one embodiment, a method is provided that comprises extracting, by a system comprising a processor, a feature vector for a data sample based on projection of the data sample onto a standard feature space. The method further comprises processing, by the system, the feature vector using an outlier detection model to determine whether the data sample is within a scope of a training dataset used to train a machine learning model, wherein the outlier detection model was trained using features extracted from the training dataset based on projection of data samples included in the training dataset onto the standard feature space.
展开▼