翻译标题：高效的蒙特卡罗树搜索物体操作规划翻译摘要：本文提出了一种高效的物体操作规划方法，使用蒙特卡罗树搜索（MCTS）来找到接触序列以及一种高效的基于ADMM的轨迹优化算法来评估候选接触序列的动态可行性。为了加速MCTS，我们提出了一种学习目标条件策略-价值网络以指导搜索方向有前途的节点的方法。此外，操作特定的启发式策略还能够大大减少搜索空间。在物理模拟器和实际硬件上进行了系统的物体操作实验，证明了我们方法的效率。特别是，通过学习的策略-价值网络，我们的方法在长序列的操作规划中表现出优越的扩展性，显著提高了规划成功率。 (Efficient Object Manipulation Planning with Monte Carlo Tree Search)

翻译：翻译标题：高效的蒙特卡罗树搜索物体操作规划翻译摘要：本文提出了一种高效的物体操作规划方法，使用蒙特卡罗树搜索（MCTS）来找到接触序列以及一种高效的基于ADMM的轨迹优化算法来评估候选接触序列的动态可行性。为了加速MCTS，我们提出了一种学习目标条件策略-价值网络以指导搜索方向有前途的节点的方法。此外，操作特定的启发式策略还能够大大减少搜索空间。在物理模拟器和实际硬件上进行了系统的物体操作实验，证明了我们方法的效率。特别是，通过学习的策略-价值网络，我们的方法在长序列的操作规划中表现出优越的扩展性，显著提高了规划成功率。

Huaijiang Zhu,Avadesh Meduri,Ludovic Righetti

This paper presents an efficient approach to object manipulation planning using Monte Carlo Tree Search (MCTS) to find contact sequences and an efficient ADMM-based trajectory optimization algorithm to evaluate the dynamic feasibility of candidate contact sequences. To accelerate MCTS, we propose a methodology to learn a goal-conditioned policy-value network to direct the search towards promising nodes. Further, manipulation-specific heuristics enable to drastically reduce the search space. Systematic object manipulation experiments in a physics simulator and on real hardware demonstrate the efficiency of our approach. In particular, our approach scales favorably for long manipulation sequences thanks to the learned policy-value network, significantly improving planning success rate.

翻译：