Official models and datasets for paper(https://arxiv.org/abs/2511.11910)
AI & ML interests
None defined yet.
Recent Activity
Papers
View all Papers
Organization Card
Hey there! Welcome to our team's corner at HuggingFace! We're a bunch of enthusiastic folks who are totally into the exciting world of Multimodal Large Language Models.
Our research explores innovative ways to enhance interactions between language and Image/Vidio/Audio, aiming to advance the capabilities of AI in understanding and generating multimodal content.
We're a curious bunch, always on the lookout for cool ways to make AI systems understand and generate human-like language.
models
6
AlpachinoNLP/LongCLIP-ViT-B-32
Zero-Shot Image Classification
•
0.2B
•
Updated
•
10
AlpachinoNLP/QTSplus-3B
Image-Text-to-Text
•
Updated
•
30
•
1
AlpachinoNLP/QTSplus-7B
Image-Text-to-Text
•
Updated
•
24
•
1
AlpachinoNLP/QTSplus-3B-FT
Image-Text-to-Text
•
Updated
•
10
•
1
AlpachinoNLP/Baichuan-7B-Instruction
Text Generation
•
7B
•
Updated
•
17
•
2
AlpachinoNLP/Baichuan-13B-Instruction
Text Generation
•
Updated
•
16
•
6