v0.31.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.31.0 for changelog.
README.md
CHANGED
|
@@ -39,7 +39,7 @@ Please contact us to purchase this model. More details on model performance acro
|
|
| 39 |
|
| 40 |
| Model | Precision | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds)
|
| 41 |
|---|---|---|---|---|---|
|
| 42 |
-
| Mistral-3B | w4a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile |
|
| 43 |
|
| 44 |
## Deploying Mistral 3B on-device
|
| 45 |
|
|
|
|
| 39 |
|
| 40 |
| Model | Precision | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds)
|
| 41 |
|---|---|---|---|---|---|
|
| 42 |
+
| Mistral-3B | w4a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_CONTEXT_BINARY | 21.05 | 0.092289 - 2.9532736 | -- | -- |
|
| 43 |
|
| 44 |
## Deploying Mistral 3B on-device
|
| 45 |
|