vic-yes's picture
Duplicate from nielsr/comparing-captioning-models
9b29c7c
raw
history blame
102 Bytes
git+https://github.com/huggingface/transformers.git@main
torch
open_clip_torch
accelerate
bitsandbytes