EfficientFlow：面向具身AI的高效等变流策略学习 (EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI)

Generative modeling has recently shown remarkable promise for visuomotor policy learning, enabling flexible and expressive control across diverse embodied AI tasks. However, existing generative policies often struggle with data inefficiency, requiring large-scale demonstrations, and sampling inefficiency, incurring slow action generation during inference. We introduce EfficientFlow, a unified framework for efficient embodied AI with flow-based policy learning. To enhance data efficiency, we bring equivariance into flow matching. We theoretically prove that when using an isotropic Gaussian prior and an equivariant velocity prediction network, the resulting action distribution remains equivariant, leading to improved generalization and substantially reduced data demands. To accelerate sampling, we propose a novel acceleration regularization strategy. As direct computation of acceleration is intractable for marginal flow trajectories, we derive a novel surrogate loss that enables stable and scalable training using only conditional trajectories. Across a wide range of robotic manipulation benchmarks, the proposed algorithm achieves competitive or superior performance under limited data while offering dramatically faster inference. These results highlight EfficientFlow as a powerful and efficient paradigm for high-performance embodied AI.

翻译：生成建模近期在视觉运动策略学习方面展现出显著潜力，能够实现跨多样具身AI任务的灵活且富有表现力的控制。然而，现有生成策略常面临数据效率低下（需要大规模演示数据）和采样效率不足（推理过程中动作生成缓慢）的问题。本文提出EfficientFlow，一个基于流策略学习的高效具身AI统一框架。为提升数据效率，我们将等变性引入流匹配过程。理论上证明，当使用各向同性高斯先验和等变速度预测网络时，生成的动作分布保持等变性，从而提升泛化能力并显著降低数据需求。为加速采样，我们提出一种新颖的加速正则化策略。由于边缘流轨迹的直接加速度计算难以处理，我们推导出一种新的替代损失函数，仅需条件轨迹即可实现稳定且可扩展的训练。在广泛的机器人操作基准测试中，所提算法在有限数据条件下取得竞争性或更优性能，同时提供显著更快的推理速度。这些结果凸显了EfficientFlow作为高性能具身AI的强大且高效范式。

相关内容

关注 0

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

DeepSeek模型综述：V1 V2 V3 R1-Zero

专知会员服务

116+阅读 · 2月11日

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

专知会员服务

195+阅读 · 2020年5月31日

Time2Vec：学习时间的向量表示，Time2Vec: Learning a Vector Representation of Time

专知会员服务

36+阅读 · 2020年5月10日