Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

video-understanding

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

42

Full-text search

Active filters: video-understanding

TencentARC/TimeLens-7B

Video-Text-to-Text • 8B • Updated 1 day ago • 26 • 4

TencentARC/TimeLens-8B

Video-Text-to-Text • 9B • Updated 1 day ago • 19 • 3

TencentARC/ARC-Hunyuan-Video-7B

Video-Text-to-Text • 9B • Updated Sep 19 • 7.94k • 31

TencentARC/ARC-Qwen-Video-7B-Narrator

Video-Text-to-Text • 9B • Updated Sep 21 • 62 • 9

dpaul06/VideoLights

Video-Text-to-Text • Updated 1 day ago

GoodiesHere/Apollo-LMMs-Apollo-1_5B-t32

Video-Text-to-Text • Updated Dec 18, 2024 • 26 • 10

GoodiesHere/Apollo-LMMs-Apollo-3B-t32

Text Generation • Updated Dec 18, 2024 • 34 • 21

GoodiesHere/Apollo-LMMs-Apollo-7B-t32

Video-Text-to-Text • Updated Dec 18, 2024 • 38 • 57

Sri-Vigneshwar-DJ/Apollo-LMMs-Apollo-1.5B-t32

Video-Text-to-Text • Updated Jan 1 • 48 • 1

Sri-Vigneshwar-DJ/Apollo-LMMs-Apollo-3B-t32

Video-Text-to-Text • Updated Jan 1 • 12

Sri-Vigneshwar-DJ/Apollo-LMMs-Apollo-7B-t32

Video-Text-to-Text • Updated Jan 1 • 13 • 1

BBBBCHAN/LLaVA-Scissor-baseline-7B

Video-Text-to-Text • 8B • Updated Jul 1 • 398 • 3

BBBBCHAN/LLaVA-Scissor-baseline-0.5B

Video-Text-to-Text • 0.9B • Updated Jul 1 • 21 • 4

Falconss1/TW-GRPO

Video-Text-to-Text • 8B • Updated Jun 15 • 26

UserJoseph/DisTime-1B

Video-Text-to-Text • 0.9B • Updated Sep 17 • 22

eagle0504/vjepa2-vitl-fpc16-256-ssv2-ucf101

Video Classification • 0.4B • Updated Jul 5 • 18

FOUND-AI/found_protocol

Text Generation • Updated Jul 29 • 2

haichaozhang/VQ-Token-llava-ov-0.5b

Video-Text-to-Text • 1B • Updated Sep 21 • 1 • 1

GrassData/cliptagger-12b

Image-Text-to-Text • Updated Aug 14 • 8

inference-net/ClipTagger-12b

Image-Text-to-Text • 12B • Updated Aug 14 • 274 • 53

abocide/matchcommentary

Text Generation • Updated Aug 30 • 12 • 3

TencentARC/ARC-Qwen-Video-7B

Video-Text-to-Text • 9B • Updated Sep 21 • 304 • 5

Enxin/VideoNSA

Video-Text-to-Text • 9B • Updated Oct 8 • 75 • 2

ZJU-AI4H/Hulu-Med-7B

Image-Text-to-Text • 8B • Updated 22 days ago • 8.81k • 46

ZJU-AI4H/Hulu-Med-14B

Image-Text-to-Text • 15B • Updated 22 days ago • 12.3k • 43

ZJU-AI4H/Hulu-Med-32B

Image-Text-to-Text • 33B • Updated 22 days ago • 1.52k • 46

wangkanai/qwen2.5-vl-32b-instruct

Image-Text-to-Text • 33B • Updated Nov 1 • 153 • 2

wangkanai/qwen3-vl-32b-instruct

Image-Text-to-Text • Updated Oct 28 • 1

wangkanai/qwen3-vl-4b-thinking

Image-Text-to-Text • 4B • Updated Nov 5 • 39 • 1

nyu-visionx/Cambrian-S-7B

Image-to-Text • 8B • Updated Nov 7 • 3.2k • 5