首页> 外文会议>International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems >Clustering-Based Data Reduction Approach to Speed up SVM in Classification and Regression Tasks
【24h】

Clustering-Based Data Reduction Approach to Speed up SVM in Classification and Regression Tasks

机译:基于聚类的数据约简方法可加快分类和回归任务中的支持向量机

获取原文

摘要

Support Vector Machine (SVM) is a popular machine learning algorithm, being able to tackle non-linear problem by use of appropriate kernels. However, the use of SVM can become unfeasible for many applications where relatively large datasets are used. Its application becomes particularly more prohibitive when considering environments where the hardware has strict memory and processing limitations. By appropriately reducing the size of the training set we can in succession speed up the learning and diagnosis of SVM. In this work we implement a data reduction approach using clustering to build a smaller and representative set. The approach is extended for both classification and regression problems. Results evaluated on both normal PC and low resource edge device showed a better performance with only a small loss in diagnosis accuracy for most cases. Still, on cases where a high loss was observed, the reduction approach allowed to regain the accuracy with a faster hyper-parameter optimization.
机译:支持向量机(SVM)是一种流行的机器学习算法,能够通过使用适当的内核来解决非线性问题。但是,对于许多使用相对较大的数据集的应用程序,使用SVM变得不可行。考虑到硬件具有严格的内存和处理限制的环境时,其应用变得特别禁止。通过适当减少训练集的大小,我们可以连续加快SVM的学习和诊断速度。在这项工作中,我们实现了一种使用聚类的数据缩减方法,以构建一个较小的代表性集合。该方法适用于分类和回归问题。在大多数情况下,在普通PC和低资源边缘设备上评估的结果均显示出更好的性能,而诊断准确性仅有很小的损失。尽管如此,在观察到高损失的情况下,减少方法允许通过更快的超参数优化来重新获得准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号