v0.33.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.33.0 for changelog.
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
DEPLOYMENT_MODEL_LICENSE.pdf filter=lfs diff=lfs merge=lfs -text
|
Whisper-Medium-En_WhisperDecoderInf.onnx → DEPLOYMENT_MODEL_LICENSE.pdf
RENAMED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4409f93b0e82531303b3e10f52f1fdfb56467a25f05b7441c6bbd8bb8a64b42c
|
| 3 |
+
size 109629
|
LICENSE
ADDED
|
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
|
|
|
| 1 |
+
The license of the original trained model can be found at https://github.com/openai/whisper/blob/main/LICENSE.
|
| 2 |
+
The license for the deployable model files (.tflite, .onnx, .dlc, .bin, etc.) can be found in DEPLOYMENT_MODEL_LICENSE.pdf.
|
README.md
CHANGED
|
@@ -31,20 +31,19 @@ More details on model performance across various devices, can be found
|
|
| 31 |
- Model checkpoint: medium.en
|
| 32 |
- Input resolution: 80x3000 (30 seconds audio)
|
| 33 |
- Mean decoded sequence length: 224 tokens
|
| 34 |
-
- Number of parameters:
|
| 35 |
-
- Model size (
|
| 36 |
-
-
|
|
|
|
| 37 |
|
| 38 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 39 |
|---|---|---|---|---|---|---|---|---|
|
| 40 |
| WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 1969.856 ms | 201 - 251 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 41 |
| WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 1720.841 ms | 60 - 308 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 42 |
| WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 1509.053 ms | 229 - 275 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 43 |
-
| WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 1545.124 ms | 953 - 953 MB | NPU | [Whisper-Medium-En.onnx](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.onnx) |
|
| 44 |
| WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 92.152 ms | 42 - 1250 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 45 |
| WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 91.218 ms | 42 - 1597 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 46 |
| WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 80.416 ms | 43 - 1382 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 47 |
-
| WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 66.789 ms | 566 - 566 MB | NPU | [Whisper-Medium-En.onnx](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.onnx) |
|
| 48 |
|
| 49 |
|
| 50 |
|
|
|
|
| 31 |
- Model checkpoint: medium.en
|
| 32 |
- Input resolution: 80x3000 (30 seconds audio)
|
| 33 |
- Mean decoded sequence length: 224 tokens
|
| 34 |
+
- Number of parameters (WhisperEncoderInf): 358M
|
| 35 |
+
- Model size (WhisperEncoderInf) (float): 1.33 GB
|
| 36 |
+
- Number of parameters (WhisperDecoderInf): 406M
|
| 37 |
+
- Model size (WhisperDecoderInf) (float): 1.51 GB
|
| 38 |
|
| 39 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 40 |
|---|---|---|---|---|---|---|---|---|
|
| 41 |
| WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 1969.856 ms | 201 - 251 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 42 |
| WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 1720.841 ms | 60 - 308 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 43 |
| WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 1509.053 ms | 229 - 275 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
|
|
|
| 44 |
| WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 92.152 ms | 42 - 1250 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 45 |
| WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 91.218 ms | 42 - 1597 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
| 46 |
| WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 80.416 ms | 43 - 1382 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
|
|
|
|
| 47 |
|
| 48 |
|
| 49 |
|
Whisper-Medium-En_WhisperEncoderInf.onnx
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:b86b70e4b238462b25d7c48f68d401ed6e32d6dc72d849019b3b9e09dbfcf2b8
|
| 3 |
-
size 1430779879
|
|
|
|
|
|
|
|
|
|
|
|