deepseek-ai/DeepSeek-R1-0528 Text Generation β’ 685B β’ Updated May 29, 2025 β’ 340k β’ β’ 2.39k
Running 3.62k The Ultra-Scale Playbook π 3.62k The ultimate guide to training LLM on large GPU Clusters
ISTA-DASLab/Meta-Llama-3.1-70B-Instruct-AQLM-PV-2Bit-1x16 Text Generation β’ 11B β’ Updated Sep 17, 2024 β’ 329 β’ 46