Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zeroMN
/
SHMT
like
1
Audio-Text-to-Text
Transformers
292 datasets
English
Chinese
transformer
multimodal
vqa
text
audio
Eval Results
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
0f364a7
SHMT
2.5 GB
1 contributor
History:
27 commits
zeroMN
Create test_date
0f364a7
verified
12 months ago
code_generator
Upload folder using huggingface_hub
12 months ago
nlp_encoder
Upload folder using huggingface_hub
12 months ago
speech_encoder
Upload folder using huggingface_hub
12 months ago
text_generator
Upload folder using huggingface_hub
12 months ago
vision_encoder
Upload folder using huggingface_hub
12 months ago
.gitattributes
1.57 kB
Upload folder using huggingface_hub
12 months ago
.yml
334 Bytes
Upload folder using huggingface_hub
12 months ago
README.md
6.26 kB
Update README.md
12 months ago
SJMT_model.pth
2.48 GB
xet
Upload 24 files
12 months ago
app.py
7.72 kB
Upload 24 files
12 months ago
config.json
1.06 kB
Upload folder using huggingface_hub
12 months ago
config.yml
2.07 kB
Upload folder using huggingface_hub
12 months ago
main.py
2.71 kB
Upload 9 files
12 months ago
multi_modal_model.py
7.4 kB
Upload 9 files
12 months ago
requirements.txt
113 Bytes
Upload 9 files
12 months ago
sample-15s.wav
3.38 MB
xet
Upload folder using huggingface_hub
12 months ago
test_date
0 Bytes
Create test_date
12 months ago
tuili.py
1.25 kB
Rename app.py to tuili.py
12 months ago