AI & ML interests
None defined yet.
tmpmodelsave/qw_self_corr_dpo_correctness_iter5_new
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/qw_self_corr_dpo_correctness_iter4_new
Text Generation
•
8B
•
Updated
•
6
tmpmodelsave/qw_self_corr_dpo_correctness_iter3_new
Text Generation
•
8B
•
Updated
•
8
tmpmodelsave/qw_self_corr_dpo_correctness_iter2_new
Text Generation
•
8B
•
Updated
•
6
tmpmodelsave/qw_self_corr_dpo_correctness_iter1
Text Generation
•
8B
•
Updated
•
6
tmpmodelsave/qw_self_corr_dpo_correctness_iter2
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo180
Text Generation
•
8B
•
Updated
•
7
tmpmodelsave/qwen_qwq_warmup_ppo170
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/qwen_qwq_warmup_ppo160
Text Generation
•
8B
•
Updated
•
8
tmpmodelsave/qwen_qwq_warmup_ppo150
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo140
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo130
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo120
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo110
Text Generation
•
8B
•
Updated
•
6
tmpmodelsave/qwen_qwq_warmup_ppo100
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo90
Text Generation
•
8B
•
Updated
•
6
tmpmodelsave/qwen_qwq_warmup_ppo80
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo70
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/qwen_qwq_warmup_ppo60
Text Generation
•
8B
•
Updated
•
6
tmpmodelsave/qwen_qwq_warmup_ppo50
Text Generation
•
8B
•
Updated
•
6
tmpmodelsave/qwen_qwq_warmup_ppo40
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo30
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo20
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_ppo10
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_qwq_warmup_dpo_iter5_trainonhard
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/qwen_self_corr_warmup_ep3
Text Generation
•
8B
•
Updated
•
3
tmpmodelsave/qwen_self_corr_warmup_ep2
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/qwen_self_corr_warmup_ep1
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/raft_iter3_new_script_deletebx3_and_python_mix_iter1
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/raft_iter1_qwq_warmup_1e5
Text Generation
•
8B
•
Updated
•
5