Update Transformers.js config to use fp16 kv cache for q4f16 model
#3
by
Xenova HF Staff - opened
No description provided.
Xenova changed pull request title from
Update config.json
to Update Transformers.js config to use fp16 kv cache for q4f16 model