Quantize for speedup?

#44

by scottsus - opened 4 days ago

4 days ago

Are there instructions on how to quantize the model to achieve <2s inference times? It's ok if precision is reduced.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment