How to maximize reward rate on two variable-interval paradigms

机译：如何在两个可变间隔范式上最大化报酬率

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Without assuming any constraints on behavior, we derive the policy that maximizes overall reward rate on two variable-interval paradigms. The first paradigm is concurrent variable time-variable time with changeover delay. It is shown that for nearly all parameter values, a switch to the schedule with the longer interval should be followed immediately by a switch back to the schedule with the shorter interval. The matching law does not hold at the optimum and does not uniquely specify the obtained reward rate. The second paradigm is discrete trial concurrent variable interval-variable interval. For given schedule parameters, the optimal policy involves a cycle of a fixed number of choices of the schedule with the shorter interval followed by one choice of the schedule with the longer interval. Molecular maximization sometimes results in optimal behavior.

机译：在不假设任何行为约束的情况下，我们得出了在两个可变时间间隔范式上最大化总回报率的策略。第一个范例是具有转换延迟的并发可变时变时间。结果表明，对于几乎所有参数值，应立即切换到具有较长间隔的时间表，然后立即切换回具有较短间隔的时间表。匹配法则不能保持最佳状态，并且不能唯一地指定获得的奖励率。第二种范例是离散试验并发变量区间-可变区间。对于给定的时间表参数，最佳策略涉及一个周期，该周期具有固定间隔的较短时间的调度选择，然后是具有较长时间间隔的一个调度选择。分子最大化有时会导致最佳行为。

著录项

期刊名称 Journal of the Experimental Analysis of Behavior
作者
Alasdair I. Houston; John McNamara;
展开▼
作者单位

展开▼
年(卷),期 1981(35),3
年度 1981
页码 367–396
总页数 30
原文格式 PDF
正文语种
中图分类医学行为学;
关键词

相似文献

外文文献
中文文献
专利

1. Reward Rate Maximization and Optimal Transmission Policy of EH Device With Temporal Death in EH-WSNs [J] . Shengda Tang, Liansheng Tan IEEE transactions on wireless communications . 2017,第2期

机译：EH-WSNs中具有暂时性死亡的EH设备的奖励率最大化和最优传输策略
2. Tuning the speed-accuracy trade-off to maximize reward rate in multisensory decision-making [J] . Jan Drugowitsch, Gregory C DeAngelis, Dora E Angelaki, eLife journal . 2015,第september期

机译：调整速度精度的权衡以最大化多感官决策中的回报率
3. Reward-rate maximization in sequential identification under a stochastic deadline [J] . Dayanik S., Yu A.J. SIAM Journal on Control and Optimization . 2013,第4期

机译：随机期限内顺序识别中的奖励率最大化
4. Cross-layer channel selection and reward-based power allocation for maximizing system capacity and reward in 4G MIMO wireless communications [C] . Chang Ben-Jye, Liang Ying-Hsin, Jhuang Kai-Peng, International Conference on Information Science, Electronics and Electrical Engineering . 2014

机译：跨层信道选择和基于奖励的功率分配，可在4G MIMO无线通信中最大化系统容量和奖励
5. Factors that contribute to individual differences in responsiveness to cocaine and natural rewards in a reward comparison paradigm [D] . Schroy, Pearl Lee 2006

机译：在奖励比较范式中导致个体对可卡因和自然奖励的反应能力差异的因素
6. Maximizing present value: A model to explain why moderate response rates obtain on variable-interval schedules [O] . Alan Silberberg, Frederick R. Warren-Boulton, Toshio Asano 1988

机译：最大化现值：解释为什么在可变间隔时间表中获得中等响应率的模型
7. How to maximize reward rate on two variable-interval paradigms [O] . Houston, Alasdair I., McNamara, John 1981

机译：如何在两个可变间隔范式上最大化报酬率

How to maximize reward rate on two variable-interval paradigms

摘要

著录项

相似文献

相关主题

期刊订阅