Openpi Comet：2025年BEHAVIOR挑战赛竞赛解决方案 (Openpi Comet: Competition Solution For 2025 BEHAVIOR Challenge)

Junjie Bai,Yu-Wei Chao,Qizhi Chen,Jinwei Gu,Moo Jin Kim,Zhaoshuo Li,Xuan Li,Tsung-Yi Lin,Ming-Yu Liu,Nic Ma,Kaichun Mo,Delin Qu,Shangkun Sun,Hongchi Xia,Fangyin Wei,Xiaohui Zeng

from arxiv, preprint

The 2025 BEHAVIOR Challenge is designed to rigorously track progress toward solving long-horizon tasks by physical agents in simulated environments. BEHAVIOR-1K focuses on everyday household tasks that people most want robots to assist with and these tasks introduce long-horizon mobile manipulation challenges in realistic settings, bridging the gap between current research and real-world, human-centric applications. This report presents our solution to the 2025 BEHAVIOR Challenge in a very close 2nd place and substantially outperforms the rest of the submissions. Building on $π_{0.5}$, we focus on systematically building our solution by studying the effects of training techniques and data. Through careful ablations, we show the scaling power in pre-training and post-training phases for competitive performance. We summarize our practical lessons and design recommendations that we hope will provide actionable insights for the broader embodied AI community when adapting powerful foundation models to complex embodied scenarios.

翻译：2025年BEHAVIOR挑战赛旨在严格追踪物理智能体在仿真环境中解决长时程任务方面的进展。BEHAVIOR-1K专注于人们最期望机器人协助的日常家庭任务，这些任务在真实场景中引入了长时程移动操作挑战，从而弥合了当前研究与现实世界、以人为中心的应用之间的差距。本报告介绍了我们在2025年BEHAVIOR挑战赛中荣获非常接近的第二名的解决方案，其性能显著优于其他提交方案。基于$π_{0.5}$，我们通过系统研究训练技术和数据的影响来构建解决方案。通过细致的消融实验，我们展示了在预训练和后训练阶段实现竞争性性能的扩展能力。我们总结了实践经验和设计建议，希望为更广泛的具身智能社区在将强大的基础模型适配到复杂具身场景时提供可操作的见解。