Co-GRPO: Co-Optimized Group Relative Policy Optimization for Masked Diffusion Model Paper • 2512.22288 • Published 10 days ago • 1
MemoryVLA Collection Checkpoints, data and logs of MemoryVLA & MemoryVLA+. https://github.com/shihao1895/MemoryVLA • 20 items • Updated 6 days ago • 7
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance Paper • 2509.26231 • Published Sep 30, 2025 • 17
LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS Paper • 2507.07136 • Published Jul 9, 2025 • 39
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2, 2025 • 187
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published Apr 18, 2025 • 139
CODA: Repurposing Continuous VAEs for Discrete Tokenization Paper • 2503.17760 • Published Mar 22, 2025 • 4