Update README.md
Browse files
README.md
CHANGED
|
@@ -11,14 +11,31 @@ Future modes will also follow the nautic theme.
|
|
| 11 |
- tbd: all OSS models with Apache2.0 and MIT license
|
| 12 |
- tbd: add larger models using advanced compression (REAP, M8, ...)
|
| 13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
## ELO (https://lmarena.ai/leaderboard/text)
|
| 15 |
-
|
| 16 |
-
- Phone 4GB
|
| 17 |
-
- Home 8GB
|
| 18 |
-
- Game 16GB
|
| 19 |
-
- Pro 32GB
|
| 20 |
-
- Zero 64GB - 71GB
|
| 21 |
-
- Server 128GB+
|
| 22 |
- Towards Frontier@Phone (within 40 ELO of #1) non plus ultra
|
| 23 |
- qwen3-vl-235b-a22b-instruct 1415 (-37 ELO)
|
| 24 |
- https://huggingface.co/Qwen/Qwen3-235B-A22B-Instruct-2507
|
|
@@ -58,3 +75,4 @@ Actual bpw are higher for small models and lower for larger models. Similar to J
|
|
| 58 |
| Q2_K | T2UD1 | 60 | 0.8 |
|
| 59 |
| Q2_K | M8HQ | 75 | 0.8 |
|
| 60 |
| Q2_K | M8LQ | 60 | 0.4 .. 0.6 |
|
|
|
|
|
|
| 11 |
- tbd: all OSS models with Apache2.0 and MIT license
|
| 12 |
- tbd: add larger models using advanced compression (REAP, M8, ...)
|
| 13 |
|
| 14 |
+
## Challenge: high quality models in 1/2/4/8/.. GB size
|
| 15 |
+
- Phone 4GB
|
| 16 |
+
- Home 8GB
|
| 17 |
+
- Game 16GB
|
| 18 |
+
- Pro 32GB
|
| 19 |
+
- Zero 64GB - 71GB
|
| 20 |
+
- Server 128GB+
|
| 21 |
+
|
| 22 |
+
| Quality vs. Size | Casual | Premium | Advanced | Frontier |
|
| 23 |
+
| :--- | :--- | :-: | :--- | :--- |
|
| 24 |
+
| 64-71 GB | SOTA | SOTA | SOTA | BETA |
|
| 25 |
+
| 32 GB | SOTA | SOTA | SOTA+ | RESEARCH |
|
| 26 |
+
| 16 GB | SOTA | SOTA+ | BETA | - |
|
| 27 |
+
| 8 GB | SOTA | BETA | BETA | - |
|
| 28 |
+
| 4 GB | SOTA | RESEARCH | - | - |
|
| 29 |
+
| 2 GB | RESEARCH | - | - | - |
|
| 30 |
+
| 1 GB | - | - | - | - |
|
| 31 |
+
|
| 32 |
+
- SOTA: K quants
|
| 33 |
+
- SOTA+: UD quants
|
| 34 |
+
- BETA: REAP + UD
|
| 35 |
+
- RESEARCH: M8 and better
|
| 36 |
+
|
| 37 |
## ELO (https://lmarena.ai/leaderboard/text)
|
| 38 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 39 |
- Towards Frontier@Phone (within 40 ELO of #1) non plus ultra
|
| 40 |
- qwen3-vl-235b-a22b-instruct 1415 (-37 ELO)
|
| 41 |
- https://huggingface.co/Qwen/Qwen3-235B-A22B-Instruct-2507
|
|
|
|
| 75 |
| Q2_K | T2UD1 | 60 | 0.8 |
|
| 76 |
| Q2_K | M8HQ | 75 | 0.8 |
|
| 77 |
| Q2_K | M8LQ | 60 | 0.4 .. 0.6 |
|
| 78 |
+
|