Update README.md
Browse files
README.md
CHANGED
|
@@ -118,9 +118,7 @@ Each quantized weight tensor has corresponding scale factors:
|
|
| 118 |
| Device | Memory | Notes |
|
| 119 |
|--------|--------|-------|
|
| 120 |
| Apple M4 Max | 36 GB+ | Via Metal Marlin |
|
| 121 |
-
| Apple M2 Ultra |
|
| 122 |
-
| NVIDIA RTX 3090 | 24 GB | With offloading |
|
| 123 |
-
| NVIDIA RTX 4090 | 24 GB | Native |
|
| 124 |
|
| 125 |
## Benchmarks
|
| 126 |
|
|
|
|
| 118 |
| Device | Memory | Notes |
|
| 119 |
|--------|--------|-------|
|
| 120 |
| Apple M4 Max | 36 GB+ | Via Metal Marlin |
|
| 121 |
+
| Apple M2 Ultra | 36 GB+ | Via Metal Marlin |
|
|
|
|
|
|
|
| 122 |
|
| 123 |
## Benchmarks
|
| 124 |
|