rockylynnstein commited on
Commit
7f6d813
·
verified ·
1 Parent(s): 0741ae6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -30,6 +30,14 @@ FP8 (8-bit floating point) quantization of NextCoder-14B, optimized for fast cod
30
  | Quantization Date | 2025-11-22 |
31
  | Hardware Used | NVIDIA RTX 5000 Ada Generation (31.5 GB) |
32
 
 
 
 
 
 
 
 
 
33
  ## Usage
34
 
35
  ### Loading the Model
 
30
  | Quantization Date | 2025-11-22 |
31
  | Hardware Used | NVIDIA RTX 5000 Ada Generation (31.5 GB) |
32
 
33
+ #### Quantization Infrastructure
34
+
35
+ Quantized on professional hardware to ensure quality and reliability:
36
+ - **CPUs:** Dual Intel Xeon Max 9480 (224 threads, 128GB HBM2e)
37
+ - **GPU:** NVIDIA RTX 5000 Ada Generation (32GB VRAM) with native FP8 support
38
+ - **Memory:** 256GB DDR5 + 128GB HBM2e = 384GB total
39
+ - **Software:** Ubuntu 25.10 | Python 3.12 | PyTorch 2.8 | CUDA 13 | llm-compressor
40
+
41
  ## Usage
42
 
43
  ### Loading the Model