arturo-fredes commited on
Commit
984d231
·
verified ·
1 Parent(s): 050b75c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -5
README.md CHANGED
@@ -232,12 +232,9 @@ Benchmark scores were obtained with the following setups. Methodology varies by
232
  | Metric | GPT-OSS-120B | Hypernova 60B 2605 |
233
  |--------|-------------:|-------------------:|
234
  | Concurrency | 128 | 128 |
235
- | Throughput (tok/s) | 3,821 | 5,210 |
236
- | E2E latency (s) | 24.05 | 14.74 |
237
- | Output speed (tok/s) | 57.79 | 69.31 |
238
  | TTFT (s) | 7.04 | 4.85 |
239
- | Est. total memory (GB) | 123.55 | 38.83 |
240
- | Model weights (GB) | 121.54 | 31.81 |
241
 
242
 
243
  #### Performance evaluation conditions
 
232
  | Metric | GPT-OSS-120B | Hypernova 60B 2605 |
233
  |--------|-------------:|-------------------:|
234
  | Concurrency | 128 | 128 |
235
+ | Throughput (tok/s) | 3,821 | 5,210 ||
 
 
236
  | TTFT (s) | 7.04 | 4.85 |
237
+ | Model weights (GB) | 65 | 32 |
 
238
 
239
 
240
  #### Performance evaluation conditions