TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper β’ 2512.16093 β’ Published 16 days ago β’ 90
view post Post 6131 Introducing Anim Lab AIβ‘ My submission for the MCP 1st Birthday HackathonTurn any math concept or logic into a clear video explanation instantly using AI.π Try it now: MCP-1st-Birthday/anim-lab-aiDemo outputs are attached π See translation π₯ 10 10 β€οΈ 2 2 π 2 2 π 1 1 π 1 1 + Reply
view article Article LeRobot v0.4.0οΌε ¨ι’ζεεΌζΊζΊε¨δΊΊηε¦δΉ θ½ε +7 Oct 24, 2025 β’ 12
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper β’ 2506.03143 β’ Published Jun 3, 2025 β’ 53
GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding Paper β’ 2511.00810 β’ Published Nov 2, 2025 β’ 3
Running 215 FineVision: Open Data is All You Need π 215 A new open-source dataset for training VLMs
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics β’ 438 items β’ Updated 18 days ago β’ 66
view article Article A failed experiment: Infini-Attention, and why we should keep trying? +1 Aug 14, 2024 β’ 73
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming Paper β’ 2408.16725 β’ Published Aug 29, 2024 β’ 53