Nested Mini-Batch K-Means

机译：嵌套小批量K均值

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A new algorithm is proposed which accelerates the mini-batch k-means algorithm of Sculley (2010) by using the distance bounding approach of Elkan (2003). We argue that, when incorporating distance bounds into a mini-batch algorithm, already used data should preferentially be reused. To this end we propose using nested mini-batches, whereby data in a mini-batch at iteration t is automatically reused at iteration t + 1. Using nested mini-batches presents two difficulties. The first is that unbalanced use of data can bias estimates, which we resolve by ensuring that each data sample contributes exactly once to centroids. The second is in choosing mini-batch sizes, which we address by balancing premature fine-tuning of centroids with redundancy induced slow-down. Experiments show that the resulting nmbatch algorithm is very effective, often arriving within 1% of the empirical minimum 100 × earlier than the standard mini-batch algorithm.

机译：提出了一种新的算法，该算法通过使用Elkan（2003）的距离限制方法来加速Sculley（2010）的小批量k均值算法。我们认为，将距离限制合并到小批量算法中时，应该优先重用已使用的数据。为此，我们建议使用嵌套的迷你批处理，由此在迭代t处的微型批处理中的数据将在迭代t + 1处自动重用。使用嵌套的微型批处理存在两个困难。首先是数据的不均衡使用可能会使估计值产生偏差，我们通过确保每个数据样本对质心的贡献恰好一次来解决这一问题。第二个是选择小批量大小，我们通过平衡质心的过早微调与冗余导致的速度下降来解决。实验表明，所得的nmbatch算法非常有效，通常比标准的mini-batch算法提前100％达到经验最小值的1％之内。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2016年|1360-1368|共9页
会议地点
作者
James Newling; Francois Fleuret;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. mbkmeans: Fast clustering for single cell data using mini-batch k-means [J] . Stephanie C. Hicks, Ruoxi Liu, Yuwei Ni¤, PLoS Computational Biology . 2021,第1期

机译：MBKMeans：使用Mini-Batch K-Means的单个小区数据快速聚类
2. The nested k-means method: A new approach for detecting lost persons in aerial images acquired by unmanned aerial vehicles [J] . Niedzielski Tomasz, Jurecka Mirosława, Stec Magdalena, Journal of Field Robotics . 2017,第8期

机译：嵌套k均值方法：一种检测无人驾驶飞机获取的空中图像中迷路人员的新方法
3. Mini-batch sample selection strategies for deep learning based speech recognition [J] . Dokuz Yesim, Tufekci Zekeriya Applied Acoustics . 2021,第Jana期

机译：基于深度学习的语音识别的迷你批量样本策略
4. Nested Mini-Batch K-Means [C] . James Newling, Francois Fleuret Annual conference on Neural Information Processing Systems . 2016

机译：嵌套迷你批量k-meanse
5. The Effect of the Mini-Batch Size on Deep Neural Networks Training. [D] . Soto, Philippe. 2017

机译：最小批量大小对深度神经网络训练的影响。
6. An approach on the implementation of full batch, online and mini-batch learning on a Mamdani based neuro-fuzzy system with center-of-sets defuzzification: Analysis and evaluation about its functionality, performance, and behavior [O] . Sukey Nakasima-López, Juan R. Castro, Mauricio A. Sanchez, 2012

机译：在基于Mamdani的神经模糊系统上进行全批处理，在线和小批量学习的方法，该系统具有集中心去模糊化：有关其功能，性能和行为的分析和评估
7. mbkmeans: Fast clustering for single cell data using mini-batch k-means [O] . Stephanie C. Hicks, Ruoxi Liu, Yuwei Ni, 2021

机译：MBKMeans：使用Mini-Batch K-Means的单个小区数据快速聚类

Nested Mini-Batch K-Means

摘要

著录项

相似文献

相关主题

期刊订阅