perf: switch to 1.5B Q2_K quantization for lowest possible latency on CPU

#3
by scriptsledge - opened
No description provided.
scriptsledge changed pull request status to closed

Sign up or log in to comment