togethercomputer/Aurora-Spec-Qwen3-Coder-Next-FP8 Text Generation • 0.5B • Updated 1 day ago • 140 • 4
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models Nov 19, 2025 • 34
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models Paper • 2504.10449 • Published Apr 14, 2025 • 15
M1 Collection M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449 • 9 items • Updated Jul 3, 2025
M1 Collection M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449 • 9 items • Updated Jul 3, 2025