Update README.md
Browse files
README.md
CHANGED
|
@@ -71,7 +71,7 @@ We deploy K2-THINK on Cerebras Wafer-Scale Engine (WSE) systems, leveraging the
|
|
| 71 |
| Platform | Throughput (tokens/sec) | Example: 32k-token response (time) |
|
| 72 |
| --------------------------------- | ----------------------: | ---------------------------------: |
|
| 73 |
| **Cerebras WSE (our deployment)** | **\~2,000** | **\~16 s** |
|
| 74 |
-
| Typical
|
| 75 |
|
| 76 |
---
|
| 77 |
|
|
|
|
| 71 |
| Platform | Throughput (tokens/sec) | Example: 32k-token response (time) |
|
| 72 |
| --------------------------------- | ----------------------: | ---------------------------------: |
|
| 73 |
| **Cerebras WSE (our deployment)** | **\~2,000** | **\~16 s** |
|
| 74 |
+
| Typical Cloud Service setup | \~200 | \~160 s |
|
| 75 |
|
| 76 |
---
|
| 77 |
|