Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -57,3 +57,4 @@ curl http://localhost:8000/v1/completions -H "Content-Type: application/json
|
|
| 57 |
"prompt": "San Francisco is a"
|
| 58 |
} '
|
| 59 |
```
|
|
|
|
|
|
| 57 |
"prompt": "San Francisco is a"
|
| 58 |
} '
|
| 59 |
```
|
| 60 |
+
⚡ This model is optimized to handle heavy workloads providing a total throughput of ️**4623 tokens per second** using one NVIDIA L40S ⚡
|