-
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-1.0-step-500
Text Generation • 7B • Updated • 138 -
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-1.0-step-675
Text Generation • 7B • Updated -
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.25-step-520
Text Generation • 7B • Updated -
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.25-step-675
Text Generation • 7B • Updated
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-1.0-step-500
Text Generation • 7B • Updated • 138 -
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-1.0-step-675
Text Generation • 7B • Updated -
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.25-step-520
Text Generation • 7B • Updated -
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.25-step-675
Text Generation • 7B • Updated
models
14
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-675
Text Generation
•
7B
•
Updated
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-300
Text Generation
•
7B
•
Updated
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.25-step-675
Text Generation
•
7B
•
Updated
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.25-step-520
Text Generation
•
7B
•
Updated
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-1.0-step-675
Text Generation
•
7B
•
Updated
RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-1.0-step-500
Text Generation
•
7B
•
Updated
•
138
RLLab/gemma-3-4b-it-dolci-sft
Text Generation
•
4B
•
Updated
•
9
RLLab/gemma-3-4b-it
Text Generation
•
4B
•
Updated
•
39
RLLab/gemma-3-4b-pt
Text Generation
•
4B
•
Updated
•
305
RLLab/gemma-3-1b-it
Text Generation
•
1.0B
•
Updated
•
20
datasets
9
RLLab/Dolci-Instruct-DPO
Viewer
•
Updated
•
203k
•
60
RLLab/Dolci-Instruct-DPO-Generations
Viewer
•
Updated
•
349k
•
33
RLLab/Dolci-DPO-Generations
Viewer
•
Updated
•
1.09M
•
97
RLLab/Dolci-Instruct-DPO-Delta-Generations
Viewer
•
Updated
•
3.18M
•
114
RLLab/Dolci-Instruct-SFT-NoFuncCalls
Viewer
•
Updated
•
1.92M
•
72
RLLab/cve-dpo-4b
Viewer
•
Updated
•
32k
•
12
RLLab/cve-all
Viewer
•
Updated
•
16.7k
•
6
RLLab/math-rl
Viewer
•
Updated
•
57.5k
•
73
RLLab/eval-set
Viewer
•
Updated
•
12.4k
•
78