首页> 外国专利> Joint Architecture And Hyper-Parameter Search For Machine Learning Models

Joint Architecture And Hyper-Parameter Search For Machine Learning Models

机译:联合架构和超参数搜索机器学习模型

摘要

The present disclosure provides a differentiable joint hyper-parameter and architecture search approach, with some implementations including the idea of discretizing the continuous space into a linear combination of multiple categorical basis. One example element of the proposed approach is the use of weight sharing across all architecture- and hyper-parameters which enables it to search efficiently over the large joint search space. Experimental results on MobileNet/ResNet/EfficientNet/BERT show that the proposed systems significantly improve the accuracy by up to 2% on ImageNet and the F1 by up to 0.4 on SQuAD, with search cost comparable to training a single model. Compared to other AutoML methods, such as random search or Bayesian method, the proposed techniques can achieve better accuracy with 10× less compute cost.
机译:本公开提供了一种可分解的关节超参数和架构搜索方法,其中一些实现包括将连续空间离散地分成多个分类基础的线性组合的想法。 所提出的方法的一个示例元素是在所有架构和超参数上使用权重共享,这使得它能够在大型关节搜索空间上有效地搜索。 MobileNet / Reset / CeffectsNet / BERT上的实验结果表明,该系统在ImageNet上显着提高了最高2%的精度,并且在小队上高达0.4,搜索成本与训练单一模型相当。 与其他自动化方法相比,如随机搜索或贝叶斯方法,所提出的技术可以实现更好的准确性,以10倍减少计算成本。

著录项

  • 公开/公告号US2021383223A1

    专利类型

  • 公开/公告日2021-12-09

    原文格式PDF

  • 申请/专利权人 GOOGLE LLC;

    申请/专利号US202117337834

  • 申请日2021-06-03

  • 分类号G06N3/08;G06N3/04;

  • 国家 US

  • 入库时间 2022-08-24 22:42:46

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号