Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dongguanting
's Collections
AEPO
ARPO
Tool-Star
RAG-Critic
AEPO
updated
15 days ago
The official datasets and model checkpoints of AEPO
Upvote
4
Agentic Entropy-Balanced Policy Optimization
Paper
•
2510.14545
•
Published
Oct 16, 2025
•
104
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
15 days ago
•
23
•
2
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
•
33B
•
Updated
15 days ago
•
13
•
1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
Oct 21, 2025
•
10
•
1
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
Oct 27, 2025
•
16
•
1
Upvote
4
Share collection
View history
Collection guide
Browse collections