TobDeBer commited on
Commit
9e01e86
·
verified ·
1 Parent(s): dbcf693

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -7
README.md CHANGED
@@ -11,14 +11,31 @@ Future modes will also follow the nautic theme.
11
  - tbd: all OSS models with Apache2.0 and MIT license
12
  - tbd: add larger models using advanced compression (REAP, M8, ...)
13
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  ## ELO (https://lmarena.ai/leaderboard/text)
15
- - Challenge: Best compressed model in 1/2/4/8 GB size
16
- - Phone 4GB
17
- - Home 8GB
18
- - Game 16GB
19
- - Pro 32GB
20
- - Zero 64GB - 71GB
21
- - Server 128GB+
22
  - Towards Frontier@Phone (within 40 ELO of #1) non plus ultra
23
  - qwen3-vl-235b-a22b-instruct 1415 (-37 ELO)
24
  - https://huggingface.co/Qwen/Qwen3-235B-A22B-Instruct-2507
@@ -58,3 +75,4 @@ Actual bpw are higher for small models and lower for larger models. Similar to J
58
  | Q2_K | T2UD1 | 60 | 0.8 |
59
  | Q2_K | M8HQ | 75 | 0.8 |
60
  | Q2_K | M8LQ | 60 | 0.4 .. 0.6 |
 
 
11
  - tbd: all OSS models with Apache2.0 and MIT license
12
  - tbd: add larger models using advanced compression (REAP, M8, ...)
13
 
14
+ ## Challenge: high quality models in 1/2/4/8/.. GB size
15
+ - Phone 4GB
16
+ - Home 8GB
17
+ - Game 16GB
18
+ - Pro 32GB
19
+ - Zero 64GB - 71GB
20
+ - Server 128GB+
21
+
22
+ | Quality vs. Size | Casual | Premium | Advanced | Frontier |
23
+ | :--- | :--- | :-: | :--- | :--- |
24
+ | 64-71 GB | SOTA | SOTA | SOTA | BETA |
25
+ | 32 GB | SOTA | SOTA | SOTA+ | RESEARCH |
26
+ | 16 GB | SOTA | SOTA+ | BETA | - |
27
+ | 8 GB | SOTA | BETA | BETA | - |
28
+ | 4 GB | SOTA | RESEARCH | - | - |
29
+ | 2 GB | RESEARCH | - | - | - |
30
+ | 1 GB | - | - | - | - |
31
+
32
+ - SOTA: K quants
33
+ - SOTA+: UD quants
34
+ - BETA: REAP + UD
35
+ - RESEARCH: M8 and better
36
+
37
  ## ELO (https://lmarena.ai/leaderboard/text)
38
+
 
 
 
 
 
 
39
  - Towards Frontier@Phone (within 40 ELO of #1) non plus ultra
40
  - qwen3-vl-235b-a22b-instruct 1415 (-37 ELO)
41
  - https://huggingface.co/Qwen/Qwen3-235B-A22B-Instruct-2507
 
75
  | Q2_K | T2UD1 | 60 | 0.8 |
76
  | Q2_K | M8HQ | 75 | 0.8 |
77
  | Q2_K | M8LQ | 60 | 0.4 .. 0.6 |
78
+