1 14 6

Jeff Gao

jeff-gao

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

upvoted a paper 3 days ago

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

upvoted a paper 17 days ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

View all activity

Organizations

None yet

upvoted 2 papers 3 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 7 days ago • 80

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published 8 days ago • 56

upvoted a paper 17 days ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published 23 days ago • 126

upvoted 2 papers 22 days ago

User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

Paper • 2601.08225 • Published 24 days ago • 51

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Paper • 2601.03986 • Published 30 days ago • 34

upvoted a paper 3 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 52

liked a model 4 months ago

ASLP-lab/Easy-Turn

Updated Oct 11, 2025 • 92 • 14

liked a model 5 months ago

inclusionAI/Rubicon-Preview

Text Generation • 31B • Updated Aug 19, 2025 • 164 • 24

upvoted a paper 6 months ago

Evaluating, Synthesizing, and Enhancing for Customer Support Conversation

Paper • 2508.04423 • Published Aug 6, 2025 • 9

upvoted a paper 8 months ago

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Paper • 2506.09827 • Published Jun 11, 2025 • 20

liked a model 11 months ago

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 36.1k • 1.61k

published a model 12 months ago

jeff-gao/Qwen2.5-1.5B-Open-R1-GRPO

Updated Feb 25, 2025

updated a model 12 months ago

jeff-gao/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Feb 24, 2025 • 5

published a model 12 months ago

jeff-gao/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Feb 24, 2025 • 5

liked a model over 1 year ago

jinaai/reader-lm-1.5b

Text Generation • 2B • Updated Jan 17, 2025 • 391 • • 609

upvoted a paper over 1 year ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83

liked a dataset over 1 year ago

facebook/covost2

Updated Jan 18, 2024 • 738 • 44

upvoted 2 papers over 1 year ago

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Paper • 2407.08348 • Published Jul 11, 2024 • 52

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 91

upvoted a paper almost 2 years ago

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

Jeff Gao

AI & ML interests

Recent Activity

Organizations

jeff-gao's activity