In a Training Loop 🔄

4 41 68

Karsten Kuhnke PRO

mindchain

https://www.linkedin.com/in/jankarstenkuhnke/

AI & ML interests

Mechanistic Interpretability, Sparse Autoencoders, JumpReLU, Reward Modeling, RLHF, AI Alignment, Function Calling, Gemma, Nemotron

Recent Activity

updated a collection about 1 hour ago

OCR

liked a model about 1 hour ago

baidu/ERNIE-4.5-21B-A3B-Paddle

updated a collection about 1 hour ago

OCR

View all activity

Organizations

upvoted a paper about 1 hour ago

PaddleOCR 3.0 Technical Report

Paper • 2507.05595 • Published Jul 8, 2025 • 19

upvoted 2 collections about 1 hour ago

OCR

Collection

3 items • Updated about 1 hour ago • 1

PP-StructureV3

Collection

PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON. • 17 items • Updated Sep 15, 2025 • 12

upvoted 6 collections 4 days ago

upvoted a paper 4 days ago

Bolmo: Byteifying the Next Generation of Language Models

Paper • 2512.15586 • Published 17 days ago • 13

upvoted 4 papers 5 days ago

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28, 2025 • 36

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 125

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 125

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published 16 days ago • 90

upvoted an article 5 days ago

Article

Diffusers welcomes FLUX-2

Nov 25, 2025

•

167

upvoted a paper 5 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 11 days ago • 59

upvoted 3 collections 5 days ago

— Awesome RL datasets 📈 —

Collection

3 items • Updated Sep 23, 2025 • 1

— Long-context post-training 🧶 —

Collection

Resources for post-training LLMs with long-context samples • 5 items • Updated Sep 14, 2025 • 6

smol2operator Release

Collection

4 items • Updated Sep 23, 2025 • 24

upvoted a paper 5 days ago

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published 23 days ago • 22

Karsten Kuhnke PRO

AI & ML interests

Recent Activity

Organizations

mindchain's activity

Diffusers welcomes FLUX-2