3 17 68

Lei Mingcong

SP4595

https://sp4595.github.io/

SP4595

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Next-Embedding Prediction Makes Strong Vision Learners

upvoted a paper 25 days ago

GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies

upvoted a paper about 1 month ago

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

View all activity

Organizations

upvoted a paper 9 days ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 9 days ago • 79

upvoted a paper 25 days ago

GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies

Paper • 2512.02581 • Published 26 days ago • 14

upvoted a paper about 1 month ago

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Paper • 2511.20561 • Published Nov 25 • 31

liked a model 4 months ago

OpenGVLab/InternVL3_5-241B-A28B-Instruct

Image-Text-to-Text • 241B • Updated Aug 29 • 757 • 15

liked a dataset 4 months ago

ByteDance-Seed/M3-Bench

Viewer • Updated Aug 14 • 100 • 2.78k • 9

liked a model 5 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18 • 250k • • 2.31k

upvoted a paper 5 months ago

RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Lifelong Learning in Physical Embodied Systems

Paper • 2508.01415 • Published Aug 2 • 7

commented a paper 5 months ago

RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Lifelong Learning in Physical Embodied Systems

Paper • 2508.01415 • Published Aug 2 • 7 •

upvoted a paper 5 months ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22 • 40

liked a model 6 months ago

baidu/ERNIE-4.5-VL-424B-A47B-Paddle

Image-Text-to-Text • 424B • Updated Aug 20 • 45 • 21

liked a model 7 months ago

lerobot/smolvla_base

Robotics • Updated Oct 10 • 17.9k • 315

upvoted an article 7 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3

•

299

liked a dataset 7 months ago

BAAI/ShareRobot

Preview • Updated Aug 24 • 7.05k • 24

upvoted a paper 7 months ago

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete

Paper • 2502.21257 • Published Feb 28 • 2

liked a model 7 months ago

physical-intelligence/fast

Robotics • Updated Jan 16 • 161

liked a dataset 7 months ago

anthonyav/so100-lego-v3

Viewer • Updated May 3 • 82 • 231 • 2

liked a model 7 months ago

moojink/openvla-7b-oft-finetuned-libero-spatial

Robotics • 8B • Updated Jun 17 • 2.71k • 9

liked a model 8 months ago

declare-lab/nora

Robotics • 4B • Updated Aug 27 • 777 • 12

upvoted a paper 8 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 306

liked a model 8 months ago

OnomaAIResearch/Illustrious-XL-v2.0

Updated Apr 19 • 140

Lei Mingcong

AI & ML interests

Recent Activity

Organizations

SP4595's activity

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data