UDE: 人类运动一代的统一驾驶引擎 (UDE: A Unified Driving Engine for Human Motion Generation)

Generating controllable and editable human motion sequences is a key challenge in 3D Avatar generation. It has been labor-intensive to generate and animate human motion for a long time until learning-based approaches have been developed and applied recently. However, these approaches are still task-specific or modality-specific\cite {ahuja2019language2pose}\cite{ghosh2021synthesis}\cite{ferreira2021learning}\cite{li2021ai}. In this paper, we propose ``UDE", the first unified driving engine that enables generating human motion sequences from natural language or audio sequences (see Fig.~\ref{fig:teaser}). Specifically, UDE consists of the following key components: 1) a motion quantization module based on VQVAE that represents continuous motion sequence as discrete latent code\cite{van2017neural}, 2) a modality-agnostic transformer encoder\cite{vaswani2017attention} that learns to map modality-aware driving signals to a joint space, and 3) a unified token transformer (GPT-like\cite{radford2019language}) network to predict the quantized latent code index in an auto-regressive manner. 4) a diffusion motion decoder that takes as input the motion tokens and decodes them into motion sequences with high diversity. We evaluate our method on HumanML3D\cite{Guo_2022_CVPR} and AIST++\cite{li2021learn} benchmarks, and the experiment results demonstrate our method achieves state-of-the-art performance. Project website: \url{https://github.com/zixiangzhou916/UDE/

翻译：在 3D Astatar 的 3D 產生 3D 產生控制可控和可编辑的人类動脈序列中, 3D 產生 3D 產生 3Affatar 產生 3D 產生和動動動的人類動脈序列中, 一直需要勞力密集的產生和動動動動動動動動動動動動動動作, 直到最近才被開發行。然而, 這些方法仍具特定工作或模式特定功能的 {ahuja2019 語系語系, 根據 VQVADE 代表连续的動動動動動動動動動動動動動動動序, 根據 20ODFL 20 數據解動變變變變變變數, 根據化為數據機動動動動動動動動動動動作、數據數據化機機路路路路路路、數化數化數數數數數數數、數化數數數數數化機化數據機路路路路路路路、數、數、數、數效變化機化機化數、數學路路路路運化路路路路路路路運化路路路、數路、數、數化數化數路路路運化路路路、數路路路路路路路路路路路路路路路路路路、數、數路路路路路路路路路路路路路路路路路路路路路、數、數、數、數路路路路路路路路路路路路路路路路路路運運化路路路路路路運運運運運運運運路路路路路、數路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路、數

相关内容

Engineering

关注 6

《工程》是中国工程院（CAE）于2015年推出的国际开放存取期刊。其目的是提供一个高水平的平台，传播和分享工程研发的前沿进展、当前主要研究成果和关键成果；报告工程科学的进展，讨论工程发展的热点、兴趣领域、挑战和前景，在工程中考虑人与环境的福祉和伦理道德，鼓励具有深远经济和社会意义的工程突破和创新，使之达到国际先进水平，成为新的生产力，从而改变世界，造福人类，创造新的未来。期刊链接：https://www.sciencedirect.com/journal/engineering

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日