v0.53.0
Browse filesSee https://github.com/qualcomm/ai-hub-models/releases/v0.53.0 for changelog.
README.md
CHANGED
|
@@ -46,6 +46,7 @@ This model is available for purchase. Please [contact us](mailto:ai-hub-support@
|
|
| 46 |
| Model | Runtime | Precision | Chipset | Context Length | Response Rate (tokens per second) | Time To First Token (range, seconds)
|
| 47 |
|---|---|---|---|---|---|---
|
| 48 |
| PLaMo-1B | QNN_CONTEXT_BINARY | w4a16 | Snapdragon® 8 Elite Mobile | 4096 | 68.21 | 0.031448000000000004 - 1.0063360000000001
|
|
|
|
| 49 |
|
| 50 |
|
| 51 |
|
|
|
|
| 46 |
| Model | Runtime | Precision | Chipset | Context Length | Response Rate (tokens per second) | Time To First Token (range, seconds)
|
| 47 |
|---|---|---|---|---|---|---
|
| 48 |
| PLaMo-1B | QNN_CONTEXT_BINARY | w4a16 | Snapdragon® 8 Elite Mobile | 4096 | 68.21 | 0.031448000000000004 - 1.0063360000000001
|
| 49 |
+
| PLaMo-1B | QNN_CONTEXT_BINARY | w4a16 | Snapdragon® 8 Elite Mobile | 4096 | 68.21 | 0.031448000000000004 - 1.0063360000000001
|
| 50 |
|
| 51 |
|
| 52 |
|