TODO:
- Fix sample images
- Allow other image types
- Allow the model to iteratively sample text
- Add nucleus size and other advanced options


<p>
Please note that this model was explicitly not trained on images of people, and as a result is not designed to caption images with humans.
This demo accompanies our paper RedCaps
Created by [Karan Desai](mailto:kdexd@umich.edu), [Gaurav Kaul](mailto:kaulg@umich.edu), [Zubin Aysola](mailto:aysola@umich.edu), [Justin Johnson](justincj@umich.edu)
</p>