首页> 外文学位 >Adaptive online optimization of Markov reward processes with application to pricing of multiclass loss network services.

【24h】

Adaptive online optimization of Markov reward processes with application to pricing of multiclass loss network services.

机译：马尔可夫奖励过程的自适应在线优化及其在多类亏损网络服务定价中的应用。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work studies the problem of adaptive online optimization of Markov reward processes. The problem at hand is the following: given a Markov chain whose transition probability matrix and its expected cost per stage are functions of a (1) a set of tunable parameters, and (2) a set of unknown but fixed parameters, find the set of (tunable) parameters that maximizes the average reward per stage observed. This work introduces techniques that improve the performance of existing simulation-based methods, and that are robust to uncertainty of the system parameters. We show the almost sure convergence of the algorithms to locally optimal values, including the adaptive case, while the tracking ability of the adaptive algorithm is illustrated numerically.; The methodological work in online methods is applied to a significant optimization problem, namely the problem of setting prices for services in a multiclass loss networks. Such networks consists of a set of resources shared by multiple classes of users characterized by their usage patterns. The network sets the price per-call/per-class and it is assumed that users are sensitive to prices, in the sense that prices affect the arrival process. The algorithms developed here are applied to the solution to this problem. The tracking ability of the algorithms is illustrated by scenarios where the service time parameters change smoothly, or infrequently, over time.

机译：这项工作研究了马尔可夫奖励过程的自适应在线优化问题。当前存在的问题如下：给定一个马尔可夫链，其转移概率矩阵及其每个阶段的预期成本是（1）一组可调参数和（2）一组未知但固定的参数的函数，请找到该集合（可调）参数，以使观察到的每个阶段的平均回报最大化。这项工作介绍了一些技术，这些技术可以改善现有基于仿真的方法的性能，并且对系统参数的不确定性具有鲁棒性。我们展示了算法到局部最优值（包括自适应情况）的几乎确定的收敛性，而自适应算法的跟踪能力用数字表示。在线方法中的方法工作被应用于一个重大的优化问题，即在多类损失网络中为服务定价的问题。这样的网络由一组资源组成，这些资源由以其使用模式为特征的多类用户共享。该网络设置每个呼叫/每个类别的价格，并假设用户对价格敏感，因为价格会影响到达过程。此处开发的算法适用于此问题的解决方案。通过服务时间参数随时间平滑地或不频繁地变化的场景来说明算法的跟踪能力。

著录项

作者
Campos-Nanez, Enrique.;
展开▼
作者单位

University of Virginia.;

展开▼
授予单位 University of Virginia.;
学科 Engineering System Science.; Operations Research.; Computer Science.
学位 Ph.D.
年度 2003
页码 148 p.
总页数 148
原文格式 PDF
正文语种 eng
中图分类系统科学;运筹学;自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. 基于随机权神经网络的在线自适应半监督学习算法及其在工业过程产品质量评价中的应用 [J] . 代伟, 胡金成, 程玉虎, 中南大学学报（英文版） . 2019,第012期
2. Decentralized Algorithms for Adaptive Pricing in Multiclass Loss Networks [J] . Campos-Nanez E. Networking, IEEE/ACM Transactions on . 2010,第3期

机译：多类损失网络中自适应定价的分散算法
3. Online Network Optimization Using Product-Form Markov Processes [J] . Jaron Sanders, Sem C. Borst, Johan S. H. van Leeuwaarden IEEE Transactions on Automatic Control . 2016,第7期

机译：使用产品形式马尔可夫过程的在线网络优化
4. State Classification and Multiclass Optimization of Continuous-Time and Continuous-State Markov Processes [J] . Xi-Ren Cao IEEE Transactions on Automatic Control . 2019,第9期

机译：连续时间和连续状态马尔可夫过程的状态分类和多类优化
5. Adaptive Optimization of Markov Reward Processes [C] . Enrique Campos-Nanez, Stephen D. Patek, The Institute of Electrical and Electronics EngineersInc. IEEE Conference on Decision and Control . 2005

机译：马尔可夫奖励过程的自适应优化
6. Applications of spanning trees to continuous-time Markov processes, with emphasis on loss systems. [D] . McNamara, Richard C. 2004

机译：生成树在连续时间马尔可夫过程中的应用，重点是损失系统。
7. Factorized time-dependent distributions for certain multiclass queueing networks and an application to enzymatic processing networks [O] . W.H. Mather, J. Hasty, L.S. Tsimring, -1

机译：某些多种子队排队网络的分解时间依赖性分布以及酶处理网络的应用
8. An algorithm for multiobjective Markov decision processes : Discounted reward case(Optimization Theory and its Applications in Mathematical Systems) [O] . 涌田和芳 1995

机译：多目标马尔可夫决策过程的一种算法：折扣奖励案例（优化理论及其在数学系统中的应用）
9. Transient Analysis and Applications of Markov Reward Processes. [R] . Sipe, J. A. 2003

机译：马尔可夫奖励过程的瞬态分析与应用。

Adaptive online optimization of Markov reward processes with application to pricing of multiclass loss network services.

摘要

著录项

相似文献

相关主题

期刊订阅