收敛速度论文 - 专知

会员服务 ·

收敛速度

Stabilizing Policy Gradient Methods via Reward Profiling

Arxiv

0+阅读 · 11月20日

DE-Sinc approximation for unilateral rapidly decreasing functions and its computational error bound

Arxiv

0+阅读 · 11月10日

A New Initial Approximation Bound in the Durand Kerner Algorithm for Finding Polynomial Zeros

Arxiv

0+阅读 · 11月11日

Why Does Stochastic Gradient Descent Slow Down in Low-Precision Training?

Arxiv

0+阅读 · 11月22日

Achieving Pareto Optimality in Games via Single-bit Feedback

Arxiv

0+阅读 · 11月19日

Muon is Provably Faster with Momentum Variance Reduction

Arxiv

0+阅读 · 12月18日

U-REPA: Aligning Diffusion U-Nets to ViTs

Arxiv

0+阅读 · 11月24日

FedQS: Optimizing Gradient and Model Aggregation for Semi-Asynchronous Federated Learning

Arxiv

0+阅读 · 11月25日

Discontinuous Galerkin Methods with Generalized Numerical Fluxes for the Vlasov-Viscous Burgers' System

Arxiv

0+阅读 · 2023年5月2日

Analysis of the discrepancy principle for Tikhonov regularization under low order source conditions

Arxiv

0+阅读 · 2023年5月1日

End to End Lane detection with One-to-Several Transformer

Arxiv

0+阅读 · 2023年5月1日

Event Tables for Efficient Experience Replay

Arxiv

0+阅读 · 2023年4月21日

On the Effects of Data Heterogeneity on the Convergence Rates of Distributed Linear System Solvers

Arxiv

0+阅读 · 2023年4月20日

Preconditioned Gradient Descent for Overparameterized Nonconvex Burer--Monteiro Factorization with Global Optimality Certification

Arxiv

0+阅读 · 2023年4月20日

Approximate Shielding of Atari Agents for Safe Exploration

Approximate Shielding of Atari Agents for Safe Exploration

Arxiv

0+阅读 · 2023年4月21日

参考链接

微信扫码咨询专知VIP会员