Policy Iteration for Learning an Exercise Policy for American Options

机译：学习美国期权行权政策的政策迭代

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Options are important financial instruments, whose prices are usually determined by computational methods. Computational finance is a compelling application area for reinforcement learning research, where hard sequential decision making problems abound and have great practical significance. In this paper, we investigate reinforcement learning methods, in particular, least squares policy iteration (LSPI), for the problem of learning an exercise policy for American options. We also investigate a method by Tsitsiklis and Van Roy, referred to as FQI. We compare LSPI and FQI with LSM, the standard least squares Monte Carlo method from the finance community. We evaluate their performance on both real and synthetic data. The results show that the exercise policies discovered by LSPI and FQI gain larger payoffs than those discovered by LSM, on both real and synthetic data. Our work shows that solution methods developed in reinforcement learning can advance the state of the art in an important and challenging application area, and demonstrates furthermore that computational finance remains an under-explored area for deployment of reinforcement learning methods.

机译：期权是重要的金融工具，其价格通常由计算方法确定。计算金融是强化学习研究的一个引人注目的应用领域，其中困难的顺序决策问题比比皆是，具有很大的现实意义。在本文中，我们研究了强化学习方法，特别是最小二乘策略迭代（LSPI），用于学习美式期权的执行政策问题。我们还研究了Tsitsiklis和Van Roy提出的一种称为FQI的方法。我们将LSPI和FQI与LSM（金融界的标准最小二乘蒙特卡罗方法）进行比较。我们评估它们在真实和综合数据上的性能。结果表明，在真实数据和综合数据上，LSPI和FQI发现的运动策略所获得的收益均大于LSM发现的运动策略。我们的工作表明，在强化学习中开发的解决方法可以在重要且具有挑战性的应用领域中提高技术水平，并且进一步证明了计算财务仍然是部署强化学习方法尚待研究的领域。

著录项

来源
《Recent advances in reinforcement learning》|2008年|165-178|共14页
会议地点 Villeneuve dAscq(FR);Villeneuve dAscq(FR)
作者
Yuxi Li; Dale Schuurmans;
展开▼
作者单位

Department of Computing Science University of Alberta;

Department of Computing Science University of Alberta;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Learning Exercise Policies for American Options [J] . Csaba Szepesvari, Dale Schuurmans, Yuxi Li JMLR: Workshop and Conference Proceedings . 2009,第2009期

机译：美国期权学习运动政策
2. A Comparison of Iterated Optimal Stopping and Local Policy Iteration for American Options Under Regime Switching [J] . J. Babbin, P. A. Forsyth, G. Labahn Journal of Scientific Computing . 2014,第2期

机译：体制转换下美国期权的迭代式最优停损与局部政策迭代的比较
3. A policy iteration algorithm for the American put option and free boundary control problems [J] . Journal of Computational and Applied Mathematics . 2020,第期

机译：策略迭代算法，用于美国的选项和自由边界控制问题
4. Policy Iteration for Learning an Exercise Policy for American Options [C] . Yuxi Li, Dale Schuurmans European Workshop on Reinforcement Learning . 2008

机译：学习美国选项的行使政策的政策迭代
5. Rescission and repricing of executive stock options: Repricing alternatives, optimal repricing policy, and early exercise. [D] . Yang, Twan-Shan (Jerry). 2002

机译：高管股票期权的撤消和重新定价：重新定价替代方案，最佳重新定价政策和早期行使。
6. Integrated health service delivery networks: concepts policy options and road map for implementation in the Americas [O] . Lourdes Ferrer 2013

机译：综合卫生服务提供网络：在美洲实施的概念政策选择和路线图
7. Policy Iteration for Learning an Exercise Policy for American Options [O] . Yuxi Li, Dale Schuurmans 2010

机译：学习美国期权行权政策的政策迭代
8. Compendium of Options for Government Policy to Encourage Private Sector Responses to Potential Climate Change: Volume 2. Policy Options by Sector and Considerations for Assembling a Policy Package. [R] . 1989

机译：政府政策选择汇编鼓励私营部门应对潜在的气候变化：第2卷。按部门划分的政策选择和组合政策方案的考虑因素。

Policy Iteration for Learning an Exercise Policy for American Options

摘要

著录项

相似文献

相关主题

期刊订阅