Running on GPU via HF transformers

#1
by sudhir2016 - opened

Runs out of memory on free tier Google Colab.

As suggested by Eric Alcaide I tried quantization with Hugging Face Quanto. It works fine now. Thanks to @dacorvo for the excellent blog post on Quanto.

sudhir2016 changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment