改进的均值流：论快速前向生成模型面临的挑战 (Improved Mean Flows: On the Challenges of Fastforward Generative Models)

MeanFlow (MF) has recently been established as a framework for one-step generative modeling. However, its ``fastforward'' nature introduces key challenges in both the training objective and the guidance mechanism. First, the original MF's training target depends not only on the underlying ground-truth fields but also on the network itself. To address this issue, we recast the objective as a loss on the instantaneous velocity $v$, re-parameterized by a network that predicts the average velocity $u$. Our reformulation yields a more standard regression problem and improves the training stability. Second, the original MF fixes the classifier-free guidance scale during training, which sacrifices flexibility. We tackle this issue by formulating guidance as explicit conditioning variables, thereby retaining flexibility at test time. The diverse conditions are processed through in-context conditioning, which reduces model size and benefits performance. Overall, our $\textbf{improved MeanFlow}$ ($\textbf{iMF}$) method, trained entirely from scratch, achieves $\textbf{1.72}$ FID with a single function evaluation (1-NFE) on ImageNet 256$\times$256. iMF substantially outperforms prior methods of this kind and closes the gap with multi-step methods while using no distillation. We hope our work will further advance fastforward generative modeling as a stand-alone paradigm.

翻译：均值流（MF）最近已被确立为一步生成建模的框架。然而，其“快速前向”特性在训练目标和引导机制方面引入了关键挑战。首先，原始MF的训练目标不仅依赖于底层真实场，还依赖于网络本身。为解决此问题，我们将目标重新表述为关于瞬时速度$v$的损失，并通过预测平均速度$u$的网络进行重新参数化。我们的重构产生了一个更标准的回归问题，并提升了训练稳定性。其次，原始MF在训练期间固定了无分类器引导的尺度，这牺牲了灵活性。我们通过将引导公式化为显式的条件变量来解决此问题，从而在测试时保持灵活性。多样化的条件通过上下文条件处理，这减少了模型规模并有利于性能。总体而言，我们的$\\textbf{改进均值流}$（$\\textbf{iMF}$）方法完全从零开始训练，在ImageNet 256$\\times$256数据集上以单次函数评估（1-NFE）实现了$\\textbf{1.72}$的FID分数。iMF显著优于此类先前方法，并在不使用蒸馏技术的情况下缩小了与多步方法的差距。我们希望我们的工作能进一步推动快速前向生成建模作为一个独立范式的发展。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日