Update Transformers.js config to use fp16 kv cache for q4f16 model

#3
by Xenova HF staff - opened
No description provided.
Xenova changed pull request title from Update config.json to Update Transformers.js config to use fp16 kv cache for q4f16 model
Ready to merge
This branch is ready to get merged automatically.
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment