DAMO Academy

company

https://huggingface.co/alibaba-damo-academy

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

yifanpu001 updated a model 5 days ago

Alibaba-DAMO-Academy/T2I-Distill

yifanpu001 published a model 6 days ago

Alibaba-DAMO-Academy/T2I-Distill

SteveZeyuZhang authored a paper 6 days ago

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

View all activity

Papers

Few-Step Distillation for Text-to-Image Generation: A Practical Guide

BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

View all Papers

Articles

RynnEC: Bringing MLLMs into Embodied World

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

yifanpu001

updated a model 5 days ago

Alibaba-DAMO-Academy/T2I-Distill

Updated 5 days ago

yifanpu001

published a model 6 days ago

Alibaba-DAMO-Academy/T2I-Distill

Updated 5 days ago

SteveZeyuZhang

authored a paper 6 days ago

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

Paper • 2510.15264 • Published Oct 17, 2025 • 2

yifanpu001

authored a paper 18 days ago

Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Paper • 2512.13006 • Published 21 days ago • 7

Jiasheng1110

submitted a paper to Daily Papers 19 days ago

Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Paper • 2512.13006 • Published 21 days ago • 7

jcenaa

submitted a paper to Daily Papers 20 days ago

VLSA: Vision-Language-Action Models with Plug-and-Play Safety Constraint Layer

Paper • 2512.11891 • Published 26 days ago • 8

SteveZeyuZhang

submitted a paper to Daily Papers 23 days ago

DragMesh: Interactive 3D Generation Made Easy

Paper • 2512.06424 • Published 29 days ago

JacobYuan

authored a paper 23 days ago

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Paper • 2512.09924 • Published 25 days ago • 3

JacobYuan

submitted a paper to Daily Papers 23 days ago

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Paper • 2512.09924 • Published 25 days ago • 3

huangsiteng

authored 11 papers 24 days ago

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published Mar 28, 2025 • 39

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

Paper • 2505.03912 • Published May 6, 2025 • 9

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Paper • 2505.12448 • Published May 18, 2025 • 10

VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Paper • 2505.15791 • Published May 21, 2025 • 6

WorldVLA: Towards Autoregressive Action World Model

Paper • 2506.21539 • Published Jun 26, 2025 • 40

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Paper • 2508.08896 • Published Aug 12, 2025 • 10

QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning

Paper • 2412.15576 • Published Dec 20, 2024

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 243

Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation

Paper • 2508.19958 • Published Aug 27, 2025

High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting

Paper • 2510.10637 • Published Oct 12, 2025 • 12

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Paper • 2512.09928 • Published 25 days ago • 11