Running on CPU Upgrade Featured 2.79k The Smol Training Playbook 📚 2.79k The secrets to building world-class LLMs
Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters
Open-Reasoner-Zero/Open-Reasoner-Zero-7B Reinforcement Learning • 8B • Updated Apr 7, 2025 • 111 • 33
Open-Reasoner-Zero/Open-Reasoner-Zero-32B Reinforcement Learning • 33B • Updated Apr 7, 2025 • 22 • 33
unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit Text Generation • 18B • Updated Feb 14, 2025 • 4.39k • 29
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4 Reinforcement Learning • 8B • Updated Mar 26, 2025 • 1.43k • 227
ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 Reinforcement Learning • 15B • Updated Feb 13, 2025 • 2.32k • 816