Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
LeroyDyer
/
SpydazWeb_Speech_Vision_EncoderDecoder_Multimodal_5b_Project
like
1
Transformers
Safetensors
English
vision
speech
image-text-text
audio-text-text
Multi-Modal
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
LeroyDyer
commited on
Apr 17, 2024
Commit
eef556f
·
verified
·
1 Parent(s):
d83aecc
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+0
-1
README.md
CHANGED
Viewed
@@ -9,7 +9,6 @@ tags:
9
- image-text-text
10
- audio-text-text
11
- Multi-Modal
12
-
pipeline_tag: automatic-speech-recognition
13
---
14
15
9
- image-text-text
10
- audio-text-text
11
- Multi-Modal
12
---
13
14