Quantized AI Models - a Vedika-AI Collection

Vedika-AI 's Collections

updated 3 days ago

These models are specifically quantized for CPU optimization you can use this on a docker space up to the speed of 9 token per second