AEPO - a dongguanting Collection

dongguanting 's Collections

AEPO

ARPO

AEPO

updated 15 days ago

The official datasets and model checkpoints of AEPO

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 104
dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated 15 days ago • 23 • 2
dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated 15 days ago • 13 • 1
dongguanting/Qwen3-14B-AEPO-DeepSearch

Robotics • 15B • Updated Oct 21, 2025 • 10 • 1
dongguanting/Qwen2.5-7B-AEPO

Text Generation • 8B • Updated Oct 27, 2025 • 16 • 1