AI & ML interests
None defined yet.
tmpmodelsave/qwen7bmath_base_ppo_45tmp0
Viewer
•
Updated
•
5k
•
3
tmpmodelsave/qwen7bmath_base_ppo_40tmp0
Viewer
•
Updated
•
5k
•
3
tmpmodelsave/qwen7bmath_base_ppo_35tmp0
Viewer
•
Updated
•
5k
•
4
tmpmodelsave/qwen7bmath_base_ppo_30tmp0
Viewer
•
Updated
•
5k
•
4
tmpmodelsave/qwen7bmath_base_ppo_25tmp0
Viewer
•
Updated
•
5k
•
3
tmpmodelsave/qwen7bmath_base_ppo_20tmp0
Viewer
•
Updated
•
5k
•
6
tmpmodelsave/qwen7bmath_base_ppo_15tmp0
Viewer
•
Updated
•
5k
•
4
tmpmodelsave/qwen7bmath_base_ppo_10tmp0
Viewer
•
Updated
•
5k
•
5
tmpmodelsave/qwen7bmath_base_ppo_5tmp0
Viewer
•
Updated
•
5k
•
4
tmpmodelsave/qw_external_orm_tmp07
Viewer
•
Updated
•
20k
•
4
tmpmodelsave/qw_external_orm_tmp10
Viewer
•
Updated
•
20k
•
4
tmpmodelsave/dpo_math_gen1
Viewer
•
Updated
•
7.5k
•
3
tmpmodelsave/dpo_augmath_gen1
Viewer
•
Updated
•
10k
•
4
tmpmodelsave/qwen25_math_7b_base_math_test
Viewer
•
Updated
•
5k
•
3
tmpmodelsave/qwen2_7b_it_math_turn2
Viewer
•
Updated
•
5k
•
9
tmpmodelsave/qwen2_7b_it_math_turn1
Viewer
•
Updated
•
5k
•
3
tmpmodelsave/distill_exp_0kc2ctmp07
Viewer
•
Updated
•
5k
•
3
tmpmodelsave/distill_exp_0kc2ctmp10
Viewer
•
Updated
•
15k
•
3
tmpmodelsave/distill_exptmp07
Viewer
•
Updated
•
5k
•
7
tmpmodelsave/distill_exptmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_100tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
6
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_550tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
4
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_500tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_450tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_400tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_350tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_300tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_250tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
4
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_200tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
3
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_100tmp10_vllmexp3
Viewer
•
Updated
•
15k
•
3