Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Project of MoE reward model

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

zhuokai  authored a paper 1 day ago
Preference Optimization with Multi-Sample Comparisons
zhuokai  authored a paper 1 day ago
Token-Level LLM Collaboration via FusionRoute
zyhang1998  authored a paper 4 days ago
Token-Level LLM Collaboration via FusionRoute
View all activity

Zhuokai Zhao's profile picture Shengyi Qian's profile picture Yuhang Zhou's profile picture Xiaoyu Liu's profile picture Jing Zhu's profile picture wave's profile picture

MoeReward 's models 6

MoeReward/rl_checkpoints

Updated Jun 27, 2025

MoeReward/lora_checkpoint

Updated Mar 30, 2025

MoeReward/reward_lora_qwen_1_5_base

Updated Mar 21, 2025 • 1

MoeReward/reward_qwen_1_5

14B • Updated Mar 17, 2025 • 3

MoeReward/reward_lora_qwen_1_5

Updated Mar 17, 2025 • 2

MoeReward/sft_full_param_qwen_1_5

14B • Updated Mar 16, 2025 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs