Q-based Equilibria - 专知论文

会员服务 ·

0

有偏 · Continuity · 估计/估计量 · 样例 · 人工智能 ·

2023 年 4 月 25 日

Q-based Equilibria

翻译：暂无翻译

from arxiv, 32 pages, 19 figures, 14 tables

In dynamic environments, Q-learning is an adaptative rule that provides an estimate (a Q-value) of the continuation value associated with each alternative. A naive policy consists in always choosing the alternative with highest Q-value. We consider a family of Q-based policy rules that may systematically favor some alternatives over others, for example rules that incorporate a leniency bias that favors cooperation. In the spirit of Compte and Postlewaite [2018], we look for equilibrium biases (or Qb-equilibria) within this family of Q-based rules. We examine classic games under various monitoring technologies.

翻译：暂无翻译

0

相关内容

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

基于刀具变形和工件亚表层质量预测模型的自由曲面超精密铣削加工运动规划

国家自然科学基金

0+阅读 · 2014年12月31日

NiTi-TiB2复合材料的原位合成及其结构功能特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

晶间强化Ti3SiC2基复合材料摩擦学性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型氟化石墨烯基润滑材料的研制及其摩擦学行为

国家自然科学基金

0+阅读 · 2012年12月31日

立方（cubic)-TiB的合成、晶体结构与物理性能

国家自然科学基金

0+阅读 · 2011年12月31日

Computing Algorithm for an Equilibrium of the Generalized Stackelberg Game

Arxiv

0+阅读 · 2023年6月9日

A shape derivative approach to domain simplification

Arxiv

0+阅读 · 2023年6月8日

Steering No-Regret Learners to Optimal Equilibria

Arxiv

0+阅读 · 2023年6月8日

Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games

Arxiv

0+阅读 · 2023年6月8日

Blessings and Curses of Covariate Shifts: Adversarial Learning Dynamics, Directional Convergence, and Equilibria

Arxiv

0+阅读 · 2023年6月7日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的事件抽取：方法、模态与未来展望的全面综述

美海军作战管理系统：变革战场空间的二十年

【MIT博士论文】以语言为中心的医学影像理解

俄罗斯“沙希德”/“天竺葵”攻击无人机

相关资讯

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

相关论文

Computing Algorithm for an Equilibrium of the Generalized Stackelberg Game

Arxiv

0+阅读 · 2023年6月9日

A shape derivative approach to domain simplification

Arxiv

0+阅读 · 2023年6月8日

Steering No-Regret Learners to Optimal Equilibria

Arxiv

0+阅读 · 2023年6月8日

Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games

Arxiv

0+阅读 · 2023年6月8日

Blessings and Curses of Covariate Shifts: Adversarial Learning Dynamics, Directional Convergence, and Equilibria

Arxiv

0+阅读 · 2023年6月7日

相关基金

基于刀具变形和工件亚表层质量预测模型的自由曲面超精密铣削加工运动规划

国家自然科学基金

0+阅读 · 2014年12月31日

NiTi-TiB2复合材料的原位合成及其结构功能特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

晶间强化Ti3SiC2基复合材料摩擦学性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型氟化石墨烯基润滑材料的研制及其摩擦学行为

国家自然科学基金

0+阅读 · 2012年12月31日

立方（cubic)-TiB的合成、晶体结构与物理性能

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员