Batch Size论文 - 专知

会员服务 ·

Batch Size

Hybrid Dual-Batch and Cyclic Progressive Learning for Efficient Distributed Training

Arxiv

0+阅读 · 10月31日

Faster and Memory-Efficient Training of Sequential Recommendation Models for Large Catalogs

Arxiv

0+阅读 · 10月24日

Seesaw: Accelerating Training by Balancing Learning Rate and Batch Size Scheduling

Arxiv

0+阅读 · 10月16日

Faster Convergence of Riemannian Stochastic Gradient Descent with Increasing Batch Size

Arxiv

0+阅读 · 10月13日

Adaptive Execution Scheduler for DataDios SmartDiff

Arxiv

0+阅读 · 10月9日

GATO: GPU-Accelerated and Batched Trajectory Optimization for Scalable Edge Model Predictive Control

Arxiv

0+阅读 · 10月8日

DYNAMIX: RL-based Adaptive Batch Size Optimization in Distributed Machine Learning Systems

DYNAMIX: RL-based Adaptive Batch Size Optimization in Distributed Machine Learning Systems

Arxiv

0+阅读 · 10月9日

Efficient Distributed Training via Dual Batch Sizes and Cyclic Progressive Learning

Arxiv

0+阅读 · 9月30日

Faster Convergence of Riemannian Stochastic Gradient Descent with Increasing Batch Size

Arxiv

0+阅读 · 9月27日

SparseServe: Unlocking Parallelism for Dynamic Sparse Attention in Long-Context LLM Serving

Arxiv

0+阅读 · 9月29日

Fisher-Orthogonal Projection Methods for Natural Gradient Descent with Large Batches

Arxiv

0+阅读 · 8月19日

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

Arxiv

0+阅读 · 4月2日

Batch List-Decodable Linear Regression via Higher Moments

Arxiv

0+阅读 · 3月12日

BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt Compression

Arxiv

0+阅读 · 3月4日

Updating Graph-based Index with Fine-grained Blocks for Large-scale Streaming High-dimensional Vectors

Arxiv

0+阅读 · 3月1日

参考链接

微信扫码咨询专知VIP会员