Update README.md
Browse files
README.md
CHANGED
|
@@ -100,7 +100,7 @@ Even though the 7B INT8 model itself takes approximately **8–9 GB**, we recomm
|
|
| 100 |
For pure CPU inference (no GPU), we recommend at least **64GB system RAM** (Ryzen 9 7950X/9950X).
|
| 101 |
|
| 102 |
---
|
| 103 |
-
I
|
| 104 |
|
| 105 |
|
| 106 |
## 📧 Contact & Licensing
|
|
|
|
| 100 |
For pure CPU inference (no GPU), we recommend at least **64GB system RAM** (Ryzen 9 7950X/9950X).
|
| 101 |
|
| 102 |
---
|
| 103 |
+
I added the default model in full FP32 precision, which is approximately 62 GB in size. This serves as the base for quantization, allowing us to find the optimal balance between model size and performance.
|
| 104 |
|
| 105 |
|
| 106 |
## 📧 Contact & Licensing
|