对Bernoulli盗匪的对称双武装Bernoulli Bandit的PDE分析 (A PDE-Based Analysis of the Symmetric Two-Armed Bernoulli Bandit) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 优化器 · 线性的 · 缩放 · 周期的 ·

2022 年 2 月 11 日

A PDE-Based Analysis of the Symmetric Two-Armed Bernoulli Bandit

翻译：对Bernoulli盗匪的对称双武装Bernoulli Bandit的PDE分析

Vladimir A. Kobzar,Robert V. Kohn

This work addresses a version of the two-armed Bernoulli bandit problem where the sum of the means of the arms is one (the symmetric two-armed Bernoulli bandit). In a regime where the gap between these means goes to zero and the number of prediction periods approaches infinity, we obtain the leading order terms of the expected regret and pseudoregret for this problem by associating each of them with a solution of a linear parabolic partial differential equation. Our results improve upon the previously known results; specifically we explicitly compute the leading order term of the optimal regret and pseudoregret in three different scaling regimes for the gap. Additionally, we obtain new non-asymptotic bounds for any given time horizon.

翻译：这项工作解决了双臂伯努利土匪问题的一个版本,即武器手段的总和是一个(对称双臂伯努利土匪 ) 。在这两个手段之间的差距达到零和预测期数接近无限的政权中,我们获得了这一问题预期遗憾和假象的主要顺序条件,将其中每个人与线性抛物线部分差别方程式的解决方案联系起来。我们的结果比以前已知的结果有所改进;具体地说,我们在三种不同的差距缩放制度中明确计算了最佳遗憾和伪雷格特的首要顺序。此外,我们获得了任何特定时间跨度的新的非救济界限。

0

相关内容

赌博机/老虎机

赌博机/老虎机

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

非局部均值图像去噪算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Mercer核的非负矩阵分解关键问题研究及其在人脸识别中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

无界Petri网分析理论与方法

国家自然科学基金

1+阅读 · 2012年12月31日

基于肺结节多正交位CT图像Curvelet纹理构建 Gradient Boosting 集成预测模型

国家自然科学基金

0+阅读 · 2011年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Boundary integral equation methods for the solution of scattering and transmission 2D elastodynamic problems

Arxiv

0+阅读 · 2022年4月20日

An Efficient Algorithm for the Proximity Connected Two Center Problem

Arxiv

0+阅读 · 2022年4月19日

Risk and optimal policies in bandit experiments

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Warped Dynamic Linear Models for Time Series of Counts

Warped Dynamic Linear Models for Time Series of Counts

Arxiv

0+阅读 · 2022年4月15日

On the representation of non-holonomic univariate power series

Arxiv

0+阅读 · 2022年4月14日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能对军事行动进行建模》

《利用人工智能学习、优化与推演美国海军作战部队的战略布局与分散（续文）》

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

《指挥官意图消息中关键概念自动提取》最新47页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

相关论文

Boundary integral equation methods for the solution of scattering and transmission 2D elastodynamic problems

Arxiv

0+阅读 · 2022年4月20日

An Efficient Algorithm for the Proximity Connected Two Center Problem

Arxiv

0+阅读 · 2022年4月19日

Risk and optimal policies in bandit experiments

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年4月18日

Warped Dynamic Linear Models for Time Series of Counts

Warped Dynamic Linear Models for Time Series of Counts

Arxiv

0+阅读 · 2022年4月15日

On the representation of non-holonomic univariate power series

Arxiv

0+阅读 · 2022年4月14日

相关基金

非局部均值图像去噪算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Mercer核的非负矩阵分解关键问题研究及其在人脸识别中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

无界Petri网分析理论与方法

国家自然科学基金

1+阅读 · 2012年12月31日

基于肺结节多正交位CT图像Curvelet纹理构建 Gradient Boosting 集成预测模型

国家自然科学基金

0+阅读 · 2011年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员