小明

xiaoming

xiaominghero

AI & ML interests

nlp

Recent Activity

upvoted a paper 5 days ago

Step-DeepResearch Technical Report

upvoted a paper 12 days ago

Step-GUI Technical Report

upvoted an article 18 days ago

We Got Claude to Fine-Tune an Open Source LLM

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published 6 days ago • 76

upvoted a paper 12 days ago

Step-GUI Technical Report

Paper • 2512.15431 • Published 12 days ago • 123

upvoted an article 18 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

26 days ago

•

545

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.73k

The secrets to building world-class LLMs

liked a dataset 3 months ago

allenai/CoSyn-400K

Viewer • Updated Feb 28 • 408k • 2.11k • 44

upvoted a collection 4 months ago

MobileLLM-R1

Collection

MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21 • 27

liked 3 datasets 4 months ago

liked 2 models 4 months ago

stepfun-ai/Step-Audio-2-mini

Any-to-Any • 8B • Updated Sep 5 • 1.28k • 241

ByteDance-Seed/Seed-OSS-36B-Base

Text Generation • 36B • Updated Aug 26 • 4.05k • 57

upvoted 2 papers 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 211

DINOv3

Paper • 2508.10104 • Published Aug 13 • 291

liked a dataset 4 months ago

nvidia/Nemotron-Pretraining-Dataset-sample

Viewer • Updated 7 days ago • 27.7k • 1.07k • 33

upvoted a collection 4 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 6 days ago • 83

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated Aug 26 • 6.13k • 1k

liked a dataset 4 months ago

stemdataset/STEM

Viewer • Updated Apr 30, 2024 • 1.07M • 1.13k • 5

upvoted a paper 5 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 145

upvoted an article 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

740

liked a dataset 5 months ago

nvidia/Llama-Nemotron-VLM-Dataset-v1

Viewer • Updated Oct 22 • 2.86M • 2.54k • 154

小明

AI & ML interests

Recent Activity

Organizations

xiaoming's activity

We Got Claude to Fine-Tune an Open Source LLM

The Smol Training Playbook

SmolLM3: smol, multilingual, long-context reasoner