首页>
外国专利>
AUTOMATED NONPARAMETRIC CONTENT ANALYSIS FOR INFORMATION MANAGEMENT AND RETRIEVAL
AUTOMATED NONPARAMETRIC CONTENT ANALYSIS FOR INFORMATION MANAGEMENT AND RETRIEVAL
展开▼
机译:信息管理和检索的自动非参数内容分析
展开▼
页面导航
摘要
著录项
相似文献
摘要
Embodiments of the invention utilize a feature-extraction approach and/or a matching approach in combination with a nonparametric approach to estimate the proportion of documents in each of multiple labeled categories with high accuracy. The feature-extraction approach automatically generates continuously valued text features optimized for estimating the category proportions, and the matching approach constructs a matched set that closely resembles a data set that is unobserved based on an observed set, thereby improving the degree to which the distributions of the observed and unobserved sets resemble each other.
展开▼