SpecBundle Collection A collection of production-grade draft models for speculative decoding • 14 items • Updated 13 days ago • 13
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 132
ColModernVBERT Collection Resources for ColModernVBERT – the document retrieval–optimized variant of ModernVBERT • 5 items • Updated Oct 3, 2025 • 7
BERT Hash Nano Models Collection Set of BERT models with a modified embeddings layer • 4 items • Updated 14 days ago • 9
CoDA Collection CoDA is Salesforce AI Research's open, lightweight and diffusion-based language model. • 2 items • Updated Oct 31, 2025 • 5
AcceLLM: Accelerating LLM Inference using Redundancy for Load Balancing and Data Locality Paper • 2411.05555 • Published Nov 8, 2024 • 6
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights Paper • 2509.22944 • Published Sep 26, 2025 • 79
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29, 2025 • 206
Holo1.5 Collection Holo1.5 - Open Foundation Models for Computer Use Agents • 5 items • Updated Sep 15, 2025 • 34
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15, 2025 • 50
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper • 2509.02544 • Published Sep 2, 2025 • 124
TimesFM Release Collection TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 6 items • Updated Oct 4, 2025 • 29
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Paper • 2508.14041 • Published Aug 19, 2025 • 59
OpenReasoning-Nemotron Collection Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 13 days ago • 46
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8, 2025 • 82