Online 3D Packing Problem Based on Bi-Value Guidance

Mingkai Qi; Liye Zhang

首页> 中文期刊> 《电脑和通信（英文）》 >Online 3D Packing Problem Based on Bi-Value Guidance

Online 3D Packing Problem Based on Bi-Value Guidance

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The online 3D packing problem has received increasing attention in recent years due to its practical value. However, the problem itself possesses some peculiar properties, such as sequential decision-making and the large size of the state space, which have made the use of reinforcement learning with Markov decision processes a popular approach for solving this problem. In this paper, we focus on the problem of high variance in value estimation caused by reward uncertainty in the presence of highly uncertain dynamics. To address this, proposed a solution based on auxiliary tasks and intrinsic rewards for the online 3D bin packing problem, guided by a binary-valued network, to assist the agent in learning the policy within the framework of actor-critic deep reinforcement learning. Specifically, the maintenance of two-valued networks and the utilization of multi-valued network estimates are employed to replace the original value estimates, aiming to provide better guidance for the learning of policy networks. Experimentally, it has been demonstrated that our model can achieve more robust learning and outperform previous works in terms of performance.

著录项

来源
《电脑和通信（英文）》 |2023年第7期|156-173|共18页
作者
Mingkai Qi; Liye Zhang;
展开▼
作者单位

School of Computer Science and Technology;

Shandong University of Technology;

Zibo;

China;

展开▼
原文格式 PDF
正文语种 chi
中图分类数学分析;
关键词
Deep Learning; Reinforcement Learning; Bin Packing; Value Estimation;

相似文献

中文文献
外文文献
专利

1. In-situ 3D contour measurement for laser powder bed fusion based on phase guidance [J] . Yuze Zhang ,Pan Zhang ,Xin Jiang . 力学快报:英文版 . 2023,第2期
2. Real-time continuous image guidance for endoscopic retrograde cholangiopancreatography based on 3D/2D registration and respiratory compensation [J] . Da-Ya Zhang ,Shuo Yang ,Hai-Xiao Geng . 世界胃肠病学杂志:英文版 . 2023,第期
3. Correlates of the School-Based Guidance Program to Freshman Students in Guangdong Province,China:Basis for Enhanced Guidance Program [J] . Cijiang Xiong . 当代教育研究(百图) . 2022,第9期
4. 求解online packing problem的F-B绝对近似算法 [J] . 黄海 ,李松斌 . 计算机工程与应用 . 2017,第11期
5. Online Public Opinion Guidance Strategy for College Students in the Era of We Media [J] . Yan Wu ,Yujiao Song ,Fang Wang . 电脑和通信（英文） . 2019,第12期
6. Research on Learning Guidance Based on I-E Mode in Higher Vocational Education [C] . Lin XIAO ,Jingxia CHEN ,Wenliang NIU . 2011年全国高等职业教育电子信息类专业学术暨教学研讨会 . 2011

Online 3D Packing Problem Based on Bi-Value Guidance

摘要

著录项

相似文献

相关主题

期刊订阅