首页>
外国专利>
Data mining platform for knowledge discovery from heterogeneous data types and/or heterogeneous data sources
Data mining platform for knowledge discovery from heterogeneous data types and/or heterogeneous data sources
展开▼
机译:数据挖掘平台,用于从异构数据类型和/或异构数据源中发现知识
展开▼
页面导航
摘要
著录项
相似文献
摘要
The data mining platform comprises a plurality of system modules, each formed from a plurality of components. Each module has an input data component, a data analysis engine for processing the input data, an output data component for outputting the results of the data analysis, and a web server to access and monitor the other modules within the unit and to provide communication to other units. Each module processes a different type of data, for example, a first module processes microarray (gene expression) data while a second module processes biomedical literature on the Internet for information supporting relationships between genes and diseases and gene functionality. In the preferred embodiment, the data analysis engine is a kernel-based learning machine, and in particular, one or more support vector machines (SVMs). The data analysis engine includes a pre-processing function for feature selection, for reducing the amount of data to be processed by selecting the optimum number of attributes, or “features”, relevant to the information to be discovered.
展开▼