ky666 (k) – Likes

liked a Space 2 months ago

The Smol Training Playbook

📚

2.81k

The secrets to building world-class LLMs

liked a model 3 months ago

microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9, 2025 • 615 • 360

liked 3 models 6 months ago

liked a model 9 months ago

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18, 2025 • 483k • 467

liked 2 models 10 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 250k • • 3.08k

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 124k • • 2.88k

liked a dataset 11 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated Feb 19, 2025 • 110k • 553 • 215

liked 2 models 11 months ago

microsoft/OmniParser-v2.0

Updated Mar 28, 2025 • 681 • 1.31k

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Text Generation • 8B • Updated May 10, 2025 • 22.5k • 293

liked a dataset 11 months ago

Conard/fortune-telling

Viewer • Updated Feb 17, 2025 • 207 • 396 • 166

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.63k

The ultimate guide to training LLM on large GPU Clusters

liked 6 models 11 months ago

Open-Reasoner-Zero/Open-Reasoner-Zero-7B

Reinforcement Learning • 8B • Updated Apr 7, 2025 • 135 • 33

Open-Reasoner-Zero/Open-Reasoner-Zero-32B

Reinforcement Learning • 33B • Updated Apr 7, 2025 • 25 • 33

unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit

Text Generation • 18B • Updated Feb 14, 2025 • 4.48k • 29

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

33B • Updated Jan 25, 2025 • 11.3k • 142

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

Reinforcement Learning • 8B • Updated Mar 26, 2025 • 1.37k • 227

ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

Reinforcement Learning • 15B • Updated Feb 13, 2025 • 2.28k • 816

liked a dataset 11 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 12.4k • 691

k

AI & ML interests

Organizations

The Smol Training Playbook

microsoft/UserLM-8b

unsloth/Qwen3-32B-unsloth-bnb-4bit

unsloth/Qwen3-14B-unsloth-bnb-4bit

unsloth/GLM-Z1-32B-0414

ByteDance-Seed/UI-TARS-1.5-7B

deepseek-ai/DeepSeek-V3-0324

Qwen/QwQ-32B

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

microsoft/OmniParser-v2.0

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Conard/fortune-telling

The Ultra-Scale Playbook

Open-Reasoner-Zero/Open-Reasoner-Zero-7B

Open-Reasoner-Zero/Open-Reasoner-Zero-32B

unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

open-r1/OpenR1-Math-220k

k

AI & ML interests

Organizations

ky666's activity

The Smol Training Playbook

The Ultra-Scale Playbook