Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights Paper • 2512.01816 • Published Dec 1, 2025 • 88
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 279
Running on CPU Upgrade Featured 2.76k The Smol Training Playbook 📚 2.76k The secrets to building world-class LLMs
deepseek-ai/DeepSeek-V3.2-Exp Text Generation • 685B • Updated Nov 18, 2025 • 74.6k • • 930
Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution Paper • 2509.21072 • Published Sep 25, 2025 • 15
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9, 2025 • 101
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation • 2B • Updated Nov 21, 2025 • 2k • 234
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 259
Running on CPU Upgrade Featured 329 GPT-OSS-120B on AMD MI300X 💻 329 gpt-oss-120b on AMD MI300X GPUs