Runtime error Featured 2.95k The Smol Training Playbook ๐ 2.95k The secrets to building world-class LLMs
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 โข 92
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper โข 2508.09834 โข Published Aug 13, 2025 โข 53
Running 3.67k The Ultra-Scale Playbook ๐ 3.67k The ultimate guide to training LLM on large GPU Clusters