Audio Classification
FBAGSTM commited on
Commit
b78758c
·
verified ·
1 Parent(s): ca7b1fc

Update Readme ST Model Zoo

Browse files
Files changed (1) hide show
  1. README.md +6 -12
README.md CHANGED
@@ -1,7 +1,3 @@
1
- ---
2
- license: apache-2.0
3
- pipeline_tag: audio-classification
4
- ---
5
  # Quantized miniresnet
6
 
7
  ## **Use case** : `AED`
@@ -56,17 +52,17 @@ It outputs embedding vectors of size 2048 for the 2 stacks version, and 3548 for
56
 
57
  ## Metrics
58
 
59
- * Measures are done with default STM32Cube.AI configuration with enabled input / output allocated option.
60
 
61
- * `tl` stands for "transfer learning", meaning that the model backbone weights were initialized from a pre-trained model, then only the last layer was unfrozen during the training.
 
62
 
63
  ### Reference MCU memory footprint based on ESC-10 dataset
64
 
65
 
66
  | Model | Format | Resolution | Series | Activation RAM (KiB) | Runtime RAM (KiB)| Weights Flash (KiB) | Code Flash (KiB) | Total RAM (KiB) | Total Flash (KiB)| STM32Cube.AI version |
67
  |-------------------|--------|------------|---------|----------------|-------------|---------------|------------|-------------|-------------|-----------------------|
68
- | [MiniResNet 1stack ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnet/ST_pretrainedmodel_public_dataset/esc10/miniresnet_1stacks_64x50_tl/miniresnet_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 59.89 | 5.38 | 123.6 | 56.9 | 65.27 | 180.5 | 10.0.0 |
69
- | [MiniResNet 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnet/ST_pretrainedmodel_public_dataset/esc10/miniresnet_2stacks_64x50_tl/miniresnet_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 59.89 | 8.37 | 431.1 | 63.69 | 68.26 | 494.9 | 10.0.0 |
70
 
71
 
72
  ### Reference inference time based on ESC-10 dataset
@@ -74,8 +70,8 @@ It outputs embedding vectors of size 2048 for the 2 stacks version, and 3548 for
74
 
75
  | Model | Format | Resolution | Board | Execution Engine | Frequency | Inference time (ms) | STM32Cube.AI version |
76
  |-------------------|--------|------------|------------------|------------------|-------------|-----------------|-----------------------|
77
- | [MiniResNet 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnet/ST_pretrainedmodel_public_dataset/esc10/miniresnet_1stacks_64x50_tl/miniresnet_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 1 CPU | 160 MHz | 92.25 | 10.0.0 |
78
- | [MiniResNet 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnet/ST_pretrainedmodel_public_dataset/esc10/miniresnet_2stacks_64x50_tl/miniresnet_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 1 CPU | 160 MHz | 142.69 | 10.0.0 |
79
 
80
 
81
  ### Accuracy with ESC-10 dataset
@@ -95,5 +91,3 @@ The reason this metric is used instead of patch-level accuracy is because patch-
95
 
96
  Please refer to the stm32ai-modelzoo-services GitHub [here](https://github.com/STMicroelectronics/stm32ai-modelzoo-services)
97
 
98
-
99
-
 
 
 
 
 
1
  # Quantized miniresnet
2
 
3
  ## **Use case** : `AED`
 
52
 
53
  ## Metrics
54
 
 
55
 
56
+ Measures are done with default STM32Cube.AI configuration with enabled input / output allocated option.
57
+
58
 
59
  ### Reference MCU memory footprint based on ESC-10 dataset
60
 
61
 
62
  | Model | Format | Resolution | Series | Activation RAM (KiB) | Runtime RAM (KiB)| Weights Flash (KiB) | Code Flash (KiB) | Total RAM (KiB) | Total Flash (KiB)| STM32Cube.AI version |
63
  |-------------------|--------|------------|---------|----------------|-------------|---------------|------------|-------------|-------------|-----------------------|
64
+ | [MiniResNet 1stack ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnet/ST_pretrainedmodel_public_dataset/esc10/miniresnet_1stacks_64x50_tl/miniresnet_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 59.89 | 5.38 | 123.6 | 55.89 | 65.27 | 179.49 | 10.2.0 |
65
+ | [MiniResNet 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnet/ST_pretrainedmodel_public_dataset/esc10/miniresnet_2stacks_64x50_tl/miniresnet_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 59.89 | 8.37 | 431.1 | 62.68 | 68.26 | 493.78 | 10.2.0 |
66
 
67
 
68
  ### Reference inference time based on ESC-10 dataset
 
70
 
71
  | Model | Format | Resolution | Board | Execution Engine | Frequency | Inference time (ms) | STM32Cube.AI version |
72
  |-------------------|--------|------------|------------------|------------------|-------------|-----------------|-----------------------|
73
+ | [MiniResNet 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnet/ST_pretrainedmodel_public_dataset/esc10/miniresnet_1stacks_64x50_tl/miniresnet_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 1 CPU | 160 MHz | 91.47 | 10.2.0 |
74
+ | [MiniResNet 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnet/ST_pretrainedmodel_public_dataset/esc10/miniresnet_2stacks_64x50_tl/miniresnet_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 1 CPU | 160 MHz | 141.86 | 10.2.0 |
75
 
76
 
77
  ### Accuracy with ESC-10 dataset
 
91
 
92
  Please refer to the stm32ai-modelzoo-services GitHub [here](https://github.com/STMicroelectronics/stm32ai-modelzoo-services)
93