Program Synthesis for Robot Learning from Demonstrations - 专知论文

会员服务 ·

0

Learning · Performer · 正则表达式 · Alphabet · 机器人 ·

2023 年 5 月 4 日

Program Synthesis for Robot Learning from Demonstrations

翻译：暂无翻译

Noah Patton,Kia Rahmani,Meghana Missula,Joydeep Biswas,Işil Dillig

from arxiv, 31 Pages, Submitted for Review

This paper presents a new synthesis-based approach for solving the Learning from Demonstration (LfD) problem in robotics. Given a set of user demonstrations, the goal of programmatic LfD is to learn a policy in a programming language that can be used to control a robot's behavior. We address this problem through a novel program synthesis algorithm that leverages two key ideas: First, to perform fast and effective generalization from user demonstrations, our synthesis algorithm views these demonstrations as strings over a finite alphabet and abstracts programs in our DSL as regular expressions over the same alphabet. This regex abstraction facilitates synthesis by helping infer useful program sketches and pruning infeasible parts of the search space. Second, to deal with the large number of object types in the environment, our method leverages a Large Language Model (LLM) to guide search. We have implemented our approach in a tool called Prolex and present the results of a comprehensive experimental evaluation on 120 benchmarks involving 40 unique tasks in three different environments. We show that, given a 120 second time limit, Prolex can find a program consistent with the demonstrations in 80% of the cases. Furthermore, for 81% of the tasks for which a solution is returned, Prolex is able to find the ground truth program with just one demonstration. To put these results in perspective, we conduct a comparison against two baselines and show that both perform much worse.

翻译：暂无翻译

0

相关内容

Learning

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

铁基双金属/石墨烯的制备及其吸附与可见光Fenton降解染料的性能和机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

极硬纳米孪晶氮化硼的高压合成及其性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

有机物/磷酸盐多组分电解质水溶液与铁系氧化物的表面络合反应机制和模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

湿式烟气循环氧/燃料燃烧方式下超细颗粒物和典型重金属的排放机理

国家自然科学基金

0+阅读 · 2012年12月31日

2012年全国复分析会议

国家自然科学基金

0+阅读 · 2012年6月18日

Probabilistic matching of real and generated data statistics in generative adversarial networks

Arxiv

0+阅读 · 2023年6月19日

Language to Rewards for Robotic Skill Synthesis

Arxiv

0+阅读 · 2023年6月16日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

Image-to-Image Retrieval by Learning Similarity between Scene Graphs

Arxiv

21+阅读 · 2020年12月29日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

36+阅读 · 2020年9月3日

VIP会员

文章信息

相关主题

正则表达式

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Probabilistic matching of real and generated data statistics in generative adversarial networks

Arxiv

0+阅读 · 2023年6月19日

Language to Rewards for Robotic Skill Synthesis

Arxiv

0+阅读 · 2023年6月16日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

Image-to-Image Retrieval by Learning Similarity between Scene Graphs

Arxiv

21+阅读 · 2020年12月29日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

36+阅读 · 2020年9月3日

相关基金

铁基双金属/石墨烯的制备及其吸附与可见光Fenton降解染料的性能和机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

极硬纳米孪晶氮化硼的高压合成及其性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

有机物/磷酸盐多组分电解质水溶液与铁系氧化物的表面络合反应机制和模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

湿式烟气循环氧/燃料燃烧方式下超细颗粒物和典型重金属的排放机理

国家自然科学基金

0+阅读 · 2012年12月31日

2012年全国复分析会议

国家自然科学基金

0+阅读 · 2012年6月18日

微信扫码咨询专知VIP会员