An Option-Dependent Analysis of Regret Minimization Algorithms in Finite-Horizon Semi-Markov Decision Processes - 专知论文

会员服务 ·

0

Analysis · Learning · Processing（编程语言） · 分层强化学习 · Performer ·

2023 年 5 月 10 日

An Option-Dependent Analysis of Regret Minimization Algorithms in Finite-Horizon Semi-Markov Decision Processes

翻译：暂无翻译

Gianluca Drappo,Alberto Maria Metelli,Marcello Restelli

A large variety of real-world Reinforcement Learning (RL) tasks is characterized by a complex and heterogeneous structure that makes end-to-end (or flat) approaches hardly applicable or even infeasible. Hierarchical Reinforcement Learning (HRL) provides general solutions to address these problems thanks to a convenient multi-level decomposition of the tasks, making their solution accessible. Although often used in practice, few works provide theoretical guarantees to justify this outcome effectively. Thus, it is not yet clear when to prefer such approaches compared to standard flat ones. In this work, we provide an option-dependent upper bound to the regret suffered by regret minimization algorithms in finite-horizon problems. We illustrate that the performance improvement derives from the planning horizon reduction induced by the temporal abstraction enforced by the hierarchical structure. Then, focusing on a sub-setting of HRL approaches, the options framework, we highlight how the average duration of the available options affects the planning horizon and, consequently, the regret itself. Finally, we relax the assumption of having pre-trained options to show how in particular situations, learning hierarchically from scratch could be preferable to using a standard approach.

翻译：暂无翻译

0

相关内容

Analysis

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

基于环保型绝缘气体CF3I的新一代GIL的应用基础研究

国家自然科学基金

0+阅读 · 2013年12月31日

拉伸诱导聚乳酸多级结构及结构演变规律研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型高稳定全光纤NICE-OHMS色散光谱技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Event-Triggered Time-Varying Bayesian Optimization

Arxiv

0+阅读 · 2023年6月27日

Regularization and finite element error estimates for elliptic distributed optimal control problems with energy regularization and state or control constraints

Arxiv

0+阅读 · 2023年6月27日

Learning non-Markovian Decision-Making from State-only Sequences

Arxiv

0+阅读 · 2023年6月27日

Integer Linear Programming Modeling of Addition Sequences With Additional Constraints for Evaluation of Power Terms

Arxiv

0+阅读 · 2023年6月26日

Analysis of a mixed finite element method for stochastic Cahn-Hilliard equation with multiplicative noise

Arxiv

0+阅读 · 2023年6月23日

VIP会员

文章信息

相关主题

Processing（编程语言）

分层强化学习

相关VIP内容

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能对军事行动进行建模》

《利用人工智能学习、优化与推演美国海军作战部队的战略布局与分散（续文）》

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

《指挥官意图消息中关键概念自动提取》最新47页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Event-Triggered Time-Varying Bayesian Optimization

Arxiv

0+阅读 · 2023年6月27日

Regularization and finite element error estimates for elliptic distributed optimal control problems with energy regularization and state or control constraints

Arxiv

0+阅读 · 2023年6月27日

Learning non-Markovian Decision-Making from State-only Sequences

Arxiv

0+阅读 · 2023年6月27日

Integer Linear Programming Modeling of Addition Sequences With Additional Constraints for Evaluation of Power Terms

Arxiv

0+阅读 · 2023年6月26日

Analysis of a mixed finite element method for stochastic Cahn-Hilliard equation with multiplicative noise

Arxiv

0+阅读 · 2023年6月23日

相关基金

基于环保型绝缘气体CF3I的新一代GIL的应用基础研究

国家自然科学基金

0+阅读 · 2013年12月31日

拉伸诱导聚乳酸多级结构及结构演变规律研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型高稳定全光纤NICE-OHMS色散光谱技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员