RLLab

https://github.com

AI & ML interests

None defined yet.

Recent Activity

JixuanLeng updated a model about 7 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-675

JixuanLeng published a model about 7 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-675

JixuanLeng updated a model about 7 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-300

View all activity

Collections 4

View 4 collections

models 14

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-675

Text Generation • 7B • Updated about 7 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.0-step-300

Text Generation • 7B • Updated about 7 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.25-step-675

Text Generation • 7B • Updated about 16 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-0.25-step-520

Text Generation • 7B • Updated about 16 hours ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-1.0-step-675

Text Generation • 7B • Updated 1 day ago

RLLab/olmo-3-7b-it-sft-base-DPO-beta-5.0-nll-1.0-step-500

Text Generation • 7B • Updated 1 day ago • 138

RLLab/gemma-3-4b-it-dolci-sft

Text Generation • 4B • Updated 11 days ago • 9

RLLab/gemma-3-4b-it

Text Generation • 4B • Updated 12 days ago • 39

RLLab/gemma-3-4b-pt

Text Generation • 4B • Updated 13 days ago • 305

RLLab/gemma-3-1b-it

Text Generation • 1.0B • Updated 18 days ago • 20

datasets 9

RLLab/Dolci-Instruct-DPO

Viewer • Updated 10 days ago • 203k • 60

RLLab/Dolci-Instruct-DPO-Generations

Viewer • Updated 10 days ago • 349k • 33

RLLab/Dolci-DPO-Generations

Viewer • Updated 15 days ago • 1.09M • 97

RLLab/Dolci-Instruct-DPO-Delta-Generations

Viewer • Updated 18 days ago • 3.18M • 114

RLLab/Dolci-Instruct-SFT-NoFuncCalls

Viewer • Updated 26 days ago • 1.92M • 72

RLLab/cve-dpo-4b

Viewer • Updated Dec 10, 2025 • 32k • 12

RLLab/cve-all

Viewer • Updated Dec 10, 2025 • 16.7k • 6

RLLab/math-rl

Viewer • Updated Nov 25, 2025 • 57.5k • 73

RLLab/eval-set

Viewer • Updated Oct 27, 2025 • 12.4k • 78