首页>
外国专利>
SYSTEM AND METHOD FOR AUTOMATIC DATA ENRICHMENT FROM MULTIPLE PUBLIC DATASETS IN DATA INTEGRATION TOOLS
SYSTEM AND METHOD FOR AUTOMATIC DATA ENRICHMENT FROM MULTIPLE PUBLIC DATASETS IN DATA INTEGRATION TOOLS
展开▼
机译:从数据集成工具中的多个公共数据集中自动富集数据的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A source dataset is enriched by standardization of address data, date and time analysis, and demographic analysis. The enriched source dataset is used to form one or more distinct clusters that are unique combinations of values for one or more attributes of the enriched source dataset. One or more related datasets are found for each of the clusters, and the related datasets are merged into the enriched source dataset using a distributed join operation, wherein the distributed join allows each row of the source dataset to be joined with a different one of the related datasets, where the different one of the related datasets is closest to the cluster to which the row belongs.
展开▼