Optimize for 16GB CPU: Enable 4-bit quantization and low memory loading f183b54 Ghaithhmz commited on Feb 12