lpalbou commited on
Commit
9c00054
·
verified ·
1 Parent(s): d303c64

Put validation values directly in comparison table

Browse files
Files changed (1) hide show
  1. README.md +6 -12
README.md CHANGED
@@ -44,19 +44,13 @@ Bottom line:
44
  - The BF16 package reduces storage, not runtime memory. It is useful when you want a smaller, uploadable package with byte-identical output to the original source-cache run.
45
  - The mixed q8/BF16 package reduces both storage and runtime memory. Use it when memory footprint matters.
46
 
47
- | Layout | Disk | Runtime Memory | Improvement |
48
- | --- | ---: | --- | --- |
49
- | Original source snapshot | 118 GiB | Baseline | Baseline. |
50
- | This BF16 package | 64 GiB | Same class as original | Storage only; output was byte-identical. |
51
- | Mixed q8/BF16 package | 40 GiB | Lower | Storage and memory; side-by-side quality validation passed. |
52
 
53
- Compared with the original source snapshot, this BF16 package cuts disk usage by about 46% but does not materially reduce generation memory. The mixed q8/BF16 package cuts disk usage by about 66% and physical peak memory by about 35% in this validation run.
54
-
55
- Raw measurements:
56
-
57
- - Original source snapshot: 32.99 GiB MLX peak, 48.90 GiB physical peak, 108.31 s.
58
- - This BF16 package: 32.98 GiB MLX peak, 45.12 GiB physical peak, 114.39 s.
59
- - Mixed q8/BF16 package: 20.84 GiB MLX peak, 31.75 GiB physical peak, 110.34 s.
60
 
61
  ## Compatibility
62
 
 
44
  - The BF16 package reduces storage, not runtime memory. It is useful when you want a smaller, uploadable package with byte-identical output to the original source-cache run.
45
  - The mixed q8/BF16 package reduces both storage and runtime memory. Use it when memory footprint matters.
46
 
47
+ | Layout | Disk | MLX Peak | Physical Peak | Time | Result |
48
+ | --- | ---: | ---: | ---: | ---: | --- |
49
+ | Original source snapshot | 118 GiB | 32.99 GiB | 48.90 GiB | 108.31 s | Baseline. |
50
+ | This BF16 package | 64 GiB | 32.98 GiB | 45.12 GiB | 114.39 s | Storage only; output was byte-identical. |
51
+ | Mixed q8/BF16 package | 40 GiB | 20.84 GiB | 31.75 GiB | 110.34 s | Storage and memory; side-by-side quality validation passed. |
52
 
53
+ Compared with the original source snapshot, this BF16 package cuts disk usage by about 46% but does not materially reduce generation memory. The mixed q8/BF16 package cuts disk usage by about 66%, MLX peak memory by about 37%, and physical peak memory by about 35% in this validation run.
 
 
 
 
 
 
54
 
55
  ## Compatibility
56