Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016;
Shenyang Ligong University, Shenyang 110168;
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016;
Reinforcement learning; composite rules; mean tardiness; dynamic job-shop scheduling;