...
首页> 外文期刊>International Journal of Applied Engineering Research >Similarity Search using Cluster based Ensemble Classification
【24h】

Similarity Search using Cluster based Ensemble Classification

机译:使用基于群集的合成分类搜索相似性搜索

获取原文
获取原文并翻译 | 示例
           

摘要

An automatic retrieval system for similarity search using cluster based ensemble classifier is proposed in this work to help the doctors. The proposed system has designed with the modules (i) metric database construction (ii) cluster based ensemble classifier and (iii) similar image/test report retrieval. Metric database is constructed by first and second order features of lung tomography images available in the internet, heart, liver and diabetes datasets collected from UCI machine learning repository. Dataset is further reduced by association rule mining as a feature vector. The cluster based ensemble classifier technique includes two phases namely training and testing phase. The phase clusters the reduced feature vector into groups using clustering algorithms (k-means and fuzzy c-means). The defined cluster id is passed along with the metric data as an attribute value. These cluster vectors are further classified using ensemble classifier with the decision tree, SVM and bayesian classifiers. Accuracy obtained by cluster based ensemble classifier is 95.33% for lung images, 83.14% for diabetes and 86.97% for heart dataset and 93.32% for liver dataset which is higher than the accuracy obtained by clustering or by ensemble classifier (mixture of decision tree, SVM and Bayesian classifier) or by individual classifiers namely decision tree classifier, SVM or Bayesian classifier. Hence cluster based ensemble classifier is better for similarity search than other methods.
机译:在这项工作中提出了一种使用基于集群的集群分类器的相似性搜索的自动检索系统来帮助医生。所提出的系统设计了模块(i)度量数据库构建(ii)基于集群的集群分类器和(iii)类似的图像/测试报告检索。从UCI机器学习存储库中收集的互联网,心脏,肝脏和糖尿病数据集中可用的肺断层扫描图像的第一和二阶特征是由第一和二阶特征构成的。通过关联规则挖掘作为特征向量,数据集进一步减少。基于群集的集合分类器技术包括两个阶段即训练和测试阶段。将阶段簇将缩小的特征向量簇用聚类算法(K-means和模糊C-il算法)将缩小的特征向量簇聚集成组。定义的群集ID与度量数据一起传递为属性值。使用具有决策树,SVM和Bayesian分类器的集合分类器进一步分类这些群集向量。基于集群的集群分类获得的精度为肺图像的95.33%,糖尿病的83.14%,心脏数据集86.97%,肝脏数据集的93.32%高于通过聚类或通过集成分类器获得的精度(决策树,SVM的混合)和贝叶斯分类器)或单个分类器即决策树分类器,SVM或贝叶斯分类器。因此,基于群集的集合分类比其他方法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号