Spaces:

flax-community
/

Multilingual-VQA

Runtime error

App Files Files Community

Multilingual-VQA / sections /conclusion_future_work /conclusion.md

gchhablani

Update conclusion

a03955b over 3 years ago

preview code

raw

history blame contribute delete

381 Bytes

A newer version of the Streamlit SDK is available: 1.44.1

Upgrade

In this project, we presented Proof-of-Concept with our CLIP Vision + BERT model baseline which leverages a multilingual checkpoint with pre-trained image encoders in four languages - English, French, German, and Spanish. Our model performs very well considering the amount of training time we were able to get and achieves 0.49 eval accuracy on our multilingual VQAv2 dataset.