35 75 112

Li Dong

unilm

AI & ML interests

Language Model Pre-Training

Recent Activity

liked a model 7 days ago

microsoft/VibeVoice-AcousticTokenizer

liked a model 10 days ago

kugelaudio/kugelaudio-0-open

updated a Space 18 days ago

microsoft/VibeVoice-ASR

View all activity

Organizations

upvoted a paper 19 days ago

VIBEVOICE-ASR Technical Report

Paper • 2601.18184 • Published 20 days ago • 20

upvoted a paper 23 days ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published 23 days ago • 84

upvoted an article 26 days ago

Article

Differential Transformer V2

26 days ago

•

upvoted 2 papers 26 days ago

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published Jan 13 • 39

Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts

Paper • 2510.23027 • Published Oct 27, 2025 • 1

upvoted a collection 2 months ago

VibeVoice Models

Collection

3 items • Updated Dec 6, 2025 • 6

upvoted a collection 3 months ago

GAD-Models

Collection

Model checkpoints of Black-Box On-Policy Distillation of Large Language Models • 5 items • Updated Nov 17, 2025 • 6

upvoted a paper 3 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 52

upvoted 12 papers 4 months ago

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30, 2025 • 29

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 62

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 115

Li Dong

AI & ML interests

Recent Activity

Organizations

unilm's activity

Differential Transformer V2