Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published 4 days ago • 184
Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation Paper • 2601.20614 • Published 17 days ago • 118
Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models Paper • 2601.20354 • Published 17 days ago • 110
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 30 days ago • 155
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published Jan 8 • 166
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published Dec 30, 2025 • 63
Region-Constraint In-Context Generation for Instructional Video Editing Paper • 2512.17650 • Published Dec 19, 2025 • 51
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation Paper • 2508.07981 • Published Aug 11, 2025 • 63
MotionPro: A Precise Motion Controller for Image-to-Video Generation Paper • 2505.20287 • Published May 26, 2025 • 20