qaihm-bot commited on
Commit
faaa7cb
·
verified ·
1 Parent(s): 8babe88

See https://github.com/quic/ai-hub-models/releases/v0.33.0 for changelog.

.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ DEPLOYMENT_MODEL_LICENSE.pdf filter=lfs diff=lfs merge=lfs -text
Whisper-Medium-En_WhisperDecoderInf.onnx → DEPLOYMENT_MODEL_LICENSE.pdf RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2bc4614f1f973a7f2b9ed0c4171d032761eac0cba67fc25d421f78cd8fa3275a
3
- size 1838005917
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4409f93b0e82531303b3e10f52f1fdfb56467a25f05b7441c6bbd8bb8a64b42c
3
+ size 109629
LICENSE ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ The license of the original trained model can be found at https://github.com/openai/whisper/blob/main/LICENSE.
2
+ The license for the deployable model files (.tflite, .onnx, .dlc, .bin, etc.) can be found in DEPLOYMENT_MODEL_LICENSE.pdf.
README.md CHANGED
@@ -31,20 +31,19 @@ More details on model performance across various devices, can be found
31
  - Model checkpoint: medium.en
32
  - Input resolution: 80x3000 (30 seconds audio)
33
  - Mean decoded sequence length: 224 tokens
34
- - Number of parameters: 769 M
35
- - Model size (WhisperEncoder): 769 MB
36
- - Model size (WhisperDecoder): 726 MB
 
37
 
38
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
39
  |---|---|---|---|---|---|---|---|---|
40
  | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 1969.856 ms | 201 - 251 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
41
  | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 1720.841 ms | 60 - 308 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
42
  | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 1509.053 ms | 229 - 275 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
43
- | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 1545.124 ms | 953 - 953 MB | NPU | [Whisper-Medium-En.onnx](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.onnx) |
44
  | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 92.152 ms | 42 - 1250 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
45
  | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 91.218 ms | 42 - 1597 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
46
  | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 80.416 ms | 43 - 1382 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
47
- | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 66.789 ms | 566 - 566 MB | NPU | [Whisper-Medium-En.onnx](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.onnx) |
48
 
49
 
50
 
 
31
  - Model checkpoint: medium.en
32
  - Input resolution: 80x3000 (30 seconds audio)
33
  - Mean decoded sequence length: 224 tokens
34
+ - Number of parameters (WhisperEncoderInf): 358M
35
+ - Model size (WhisperEncoderInf) (float): 1.33 GB
36
+ - Number of parameters (WhisperDecoderInf): 406M
37
+ - Model size (WhisperDecoderInf) (float): 1.51 GB
38
 
39
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
  | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 1969.856 ms | 201 - 251 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
42
  | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 1720.841 ms | 60 - 308 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
43
  | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 1509.053 ms | 229 - 275 MB | GPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
 
44
  | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 92.152 ms | 42 - 1250 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
45
  | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 91.218 ms | 42 - 1597 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
46
  | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 80.416 ms | 43 - 1382 MB | NPU | [Whisper-Medium-En.tflite](https://huggingface.co/qualcomm/Whisper-Medium-En/blob/main/Whisper-Medium-En.tflite) |
 
47
 
48
 
49
 
Whisper-Medium-En_WhisperEncoderInf.onnx DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:b86b70e4b238462b25d7c48f68d401ed6e32d6dc72d849019b3b9e09dbfcf2b8
3
- size 1430779879