v0.53.0
Browse filesSee https://github.com/qualcomm/ai-hub-models/releases/v0.53.0 for changelog.
README.md
CHANGED
|
@@ -47,6 +47,7 @@ Download pre-exported model assets from **[Qwen2-7B-Instruct on Qualcomm® AI Hu
|
|
| 47 |
| Model | Runtime | Precision | Chipset | Context Length | Response Rate (tokens per second) | Time To First Token (range, seconds)
|
| 48 |
|---|---|---|---|---|---|---
|
| 49 |
| Qwen2-7B-Instruct | QNN_CONTEXT_BINARY | w4a16 | Snapdragon® 8 Elite Mobile | 4096 | 13.65 | 0.170593 - 5.458976
|
|
|
|
| 50 |
|
| 51 |
## License
|
| 52 |
* The license for the original implementation of Qwen2-7B-Instruct can be found
|
|
|
|
| 47 |
| Model | Runtime | Precision | Chipset | Context Length | Response Rate (tokens per second) | Time To First Token (range, seconds)
|
| 48 |
|---|---|---|---|---|---|---
|
| 49 |
| Qwen2-7B-Instruct | QNN_CONTEXT_BINARY | w4a16 | Snapdragon® 8 Elite Mobile | 4096 | 13.65 | 0.170593 - 5.458976
|
| 50 |
+
| Qwen2-7B-Instruct | QNN_CONTEXT_BINARY | w4a16 | Snapdragon® 8 Elite Mobile | 4096 | 13.65 | 0.170593 - 5.458976
|
| 51 |
|
| 52 |
## License
|
| 53 |
* The license for the original implementation of Qwen2-7B-Instruct can be found
|