seojinlee

sjlee311

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

upvoted a paper about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper about 1 month ago

Revisiting Generalization Across Difficulty Levels: It's Not So Easy

View all activity

Organizations

None yet

upvoted a paper 17 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published 21 days ago • 103

upvoted 3 papers about 1 month ago

upvoted 3 papers 2 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 83

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published Oct 17, 2025 • 147

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 86

upvoted 4 papers 3 months ago

Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published Oct 6, 2025 • 22

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 501

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24, 2025 • 42

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 143

upvoted 7 papers 4 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 150

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1, 2025 • 57

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 228

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

Paper • 2508.20931 • Published Aug 28, 2025 • 15

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

upvoted 2 papers 5 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21, 2025 • 90

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

seojinlee

AI & ML interests

Recent Activity

Organizations

sjlee311's activity