arxiv:2411.13209
Sushant Gautam
SushantGautam
AI & ML interests
multimodal, deep learning
Organizations
models
46
SushantGautam/whisper-small-no
Automatic Speech Recognition
•
Updated
•
42
SushantGautam/testkvasirscript
Updated
SushantGautam/MEDVQACpaligemma
Updated
SushantGautam/MEDVQACpaligemma-v0
Updated
•
7
SushantGautam/MEDVQACdiffusion3-adapter-v0
Updated
SushantGautam/MEDVQACpaligemma-adapter
Updated
•
4
SushantGautam/MEDVQACdiffusion-v0
Text-to-Image
•
Updated
•
1
SushantGautam/KG-LLM-roberta-base-single_stage
Text Classification
•
Updated
•
5
SushantGautam/KG-LLM-roberta-base-claim_only
Text Classification
•
Updated
•
5
SushantGautam/KG-LLM-bert-base
Text Classification
•
Updated
•
4
datasets
5
SushantGautam/kvasir-vqa
Viewer
•
Updated
•
6.5k
•
191
SushantGautam/ImageCLEFmed-MEDVQA-GI-2024-Dev_mod
Viewer
•
Updated
•
20.2k
•
19
SushantGautam/ImageCLEFmed-MEDVQA-GI-2024-Dev
Viewer
•
Updated
•
20.2k
•
8
SushantGautam/SoccerNet-Echoes
Updated
•
117
SushantGautam/SoccerNet-10s-5Class
Updated
•
3
•
1