SpotEdit: Selective Region Editing in Diffusion Transformers Paper • 2512.22323 • Published 6 days ago • 32
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 3 days ago • 83
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published 6 days ago • 18
ProEdit: Inversion-based Editing From Prompts Done Right Paper • 2512.22118 • Published 6 days ago • 15
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published 13 days ago • 92
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models Paper • 2512.19526 • Published 10 days ago • 10
Scaling Laws for Code: Every Programming Language Matters Paper • 2512.13472 • Published 17 days ago • 9
FaithLens: Detecting and Explaining Faithfulness Hallucination Paper • 2512.20182 • Published 9 days ago • 8
WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion Paper • 2512.19678 • Published 10 days ago • 29
Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience Paper • 2512.17260 • Published 13 days ago • 48
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published 15 days ago • 56
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 17 days ago • 72
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 17 days ago • 103
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper • 2512.10881 • Published 21 days ago • 29