microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 344k • 1.57k
Running on Zero MCP Featured 1.98k Qwen Image Edit Camera Control 🎬 1.98k Fast 4 step inference with Qwen Image Edit 2509
Running on Zero Featured 113 VLM Object Understanding 🦀 113 Explore object detection, visual grounding, keypoint Detecti