DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models Paper • 2512.15713 • Published 23 days ago • 16
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published 25 days ago • 100
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models Paper • 2512.08829 • Published about 1 month ago • 18
vibhorag101/phr-mental-therapy-dataset-conversational-format-1024-tokens Viewer • Updated Mar 13, 2024 • 31.8k • 42 • 5
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published Feb 18, 2025 • 38