Release AI-ModelZoo-4.0.0
Browse files
README.md
CHANGED
|
@@ -4,13 +4,13 @@ license_name: sla0044
|
|
| 4 |
license_link: >-
|
| 5 |
https://github.com/STMicroelectronics/stm32ai-modelzoo/raw/refs/heads/main/human_activity_recognition/LICENSE.md
|
| 6 |
---
|
| 7 |
-
#
|
| 8 |
|
| 9 |
## **Use case** : `Human activity recognition`
|
| 10 |
|
| 11 |
# Model description
|
| 12 |
|
| 13 |
-
GMP is an acronym for Global Max Pooling. It is a convolutional neural network (CNN) based model that uses Global Max Pooling before feeding the data to the fully-connected (Dense) layer for performing the human activity recognition (HAR) task based on the accelerometer data. It uses the 3D raw data with gravity rotation and supression filter as preprocessing. This is a very light model with very small foot prints in terms of FLASH and RAM as well as computational requirements.
|
| 14 |
|
| 15 |
This network supports any input size greater than (3 x 3 x 1) but we recommend to use at least (24 x 3 x 1), i.e. a window length of 24 samples. In this folder we provide GMP models trained with two different window lenghts [24 and 48].
|
| 16 |
|
|
@@ -54,7 +54,7 @@ For a frame of resolution of (wl x 3) and P classes
|
|
| 54 |
|
| 55 |
## Metrics
|
| 56 |
|
| 57 |
-
Measures are done with default
|
| 58 |
|
| 59 |
The inference time is reported is calculated on STM32 board **B-U585I-IOT02A** running at Frequency of **160 MHz**.
|
| 60 |
|
|
@@ -62,10 +62,10 @@ The inference time is reported is calculated on STM32 board **B-U585I-IOT02A** r
|
|
| 62 |
### Reference memory footprint based on WISDM dataset (see Accuracy for details on dataset)
|
| 63 |
|
| 64 |
|
| 65 |
-
| Model | Format | Input Shape | Target Board | Activation RAM (KiB) | Runtime RAM (KiB) | Weights Flash (KiB) | Code Flash (KiB) | Total RAM (KiB) | Total Flash (KiB) | Inference Time (ms) |
|
| 66 |
|:----------------------------------------------------------------------------:|:------:|:-----------:|:-------:|:--------------------:|:-----------------:|:-------------------:|:----------------:|:-----------------:|:-----------------:|:---------------------:|:---------------------:|
|
| 67 |
-
| [
|
| 68 |
-
| [
|
| 69 |
|
| 70 |
|
| 71 |
|
|
@@ -77,13 +77,13 @@ Dataset details: A custom dataset and not publically available, Number of classe
|
|
| 77 |
|
| 78 |
| Model | Format | Resolution | Accuracy (%) |
|
| 79 |
|:----------------------------------------------------------------------------------------------:|:--------:|:----------:|:-------------:|
|
| 80 |
-
| [
|
| 81 |
-
| [
|
| 82 |
|
| 83 |
|
| 84 |
-
Confusion matrix for
|
| 85 |
|
| 86 |
-
)
|
|
| 92 |
|
| 93 |
| Model | Format | Resolution | Accuracy (%) |
|
| 94 |
|:--------------------------------------------------------------------------------------:|:--------:|:-----------:|:--------------:|
|
| 95 |
-
| [
|
| 96 |
-
| [
|
| 97 |
|
| 98 |
|
| 99 |
## Retraining and Integration in a simple example:
|
|
|
|
| 4 |
license_link: >-
|
| 5 |
https://github.com/STMicroelectronics/stm32ai-modelzoo/raw/refs/heads/main/human_activity_recognition/LICENSE.md
|
| 6 |
---
|
| 7 |
+
# ST_GMP HAR model
|
| 8 |
|
| 9 |
## **Use case** : `Human activity recognition`
|
| 10 |
|
| 11 |
# Model description
|
| 12 |
|
| 13 |
+
GMP is an acronym for Global Max Pooling. It is a convolutional neural network (CNN) based model that uses Global Max Pooling before feeding the data to the fully-connected (Dense) layer for performing the human activity recognition (HAR) task based on the accelerometer data. Prefix `st_` denotes it is a variation of the model built by STMicroelectronics. It uses the 3D raw data with gravity rotation and supression filter as preprocessing. This is a very light model with very small foot prints in terms of FLASH and RAM as well as computational requirements.
|
| 14 |
|
| 15 |
This network supports any input size greater than (3 x 3 x 1) but we recommend to use at least (24 x 3 x 1), i.e. a window length of 24 samples. In this folder we provide GMP models trained with two different window lenghts [24 and 48].
|
| 16 |
|
|
|
|
| 54 |
|
| 55 |
## Metrics
|
| 56 |
|
| 57 |
+
Measures are done with default STEdge AI Dev Cloud version 3.0.0 and for target board B-U585I-IOT02A. In addition the configuration were enabled input / output allocated option and `balanced` as optimization choice.
|
| 58 |
|
| 59 |
The inference time is reported is calculated on STM32 board **B-U585I-IOT02A** running at Frequency of **160 MHz**.
|
| 60 |
|
|
|
|
| 62 |
### Reference memory footprint based on WISDM dataset (see Accuracy for details on dataset)
|
| 63 |
|
| 64 |
|
| 65 |
+
| Model | Format | Input Shape | Target Board | Activation RAM (KiB) | Runtime RAM (KiB) | Weights Flash (KiB) | Code Flash (KiB) | Total RAM (KiB) | Total Flash (KiB) | Inference Time (ms) | STEdge AI Core version |
|
| 66 |
|:----------------------------------------------------------------------------:|:------:|:-----------:|:-------:|:--------------------:|:-----------------:|:-------------------:|:----------------:|:-----------------:|:-----------------:|:---------------------:|:---------------------:|
|
| 67 |
+
| [st_gpm_wl_24](ST_pretrainedmodel_public_dataset/WISDM/st_gmp_wl_24/st_gmp_wl_24.keras) | FLOAT32| 24 x 3 x 1 | B-U585I-IOT02A | 4.25 | 0.28 | 5.70 | 6.08 | 4.53 | 11.78 | 4.29 | 3.0.0 |
|
| 68 |
+
| [st_gmp_wl_48](ST_pretrainedmodel_public_dataset/WISDM/st_gmp_wl_48/st_gmp_wl_48.keras) | FLOAT32| 48 x 3 x 1 | B-U585I-IOT02A | 8.83 | 0.28 | 5.70 | 6.08 | 9.11 | 11.78 | 8.83 | 3.0.0 |
|
| 69 |
|
| 70 |
|
| 71 |
|
|
|
|
| 77 |
|
| 78 |
| Model | Format | Resolution | Accuracy (%) |
|
| 79 |
|:----------------------------------------------------------------------------------------------:|:--------:|:----------:|:-------------:|
|
| 80 |
+
| [st_gmp_wl_24](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/human_activity_recognition/st_gmp/ST_pretrainedmodel_custom_dataset/mobility_v1/st_gmp_wl_24/st_gmp_wl_24.keras) | FLOAT32 | 24 x 3 x 1 | 93.93 |
|
| 81 |
+
| [st_gmp_wl_48](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/human_activity_recognition/st_gmp/ST_pretrainedmodel_custom_dataset/mobility_v1/st_gmp_wl_48/st_gmp_wl_48.keras) | FLOAT32 | 48 x 3 x 1 | 93.71 |
|
| 82 |
|
| 83 |
|
| 84 |
+
Confusion matrix for st_gmp_wl_24 with Float32 weights for mobility_v1 dataset is given below.
|
| 85 |
|
| 86 |
+

|
| 87 |
|
| 88 |
### Accuracy with WISDM dataset
|
| 89 |
|
|
|
|
| 92 |
|
| 93 |
| Model | Format | Resolution | Accuracy (%) |
|
| 94 |
|:--------------------------------------------------------------------------------------:|:--------:|:-----------:|:--------------:|
|
| 95 |
+
| [st_gmp_wl_24](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/human_activity_recognition/st_gmp/ST_pretrainedmodel_public_dataset/WISDM/st_gmp_wl_24/st_gmp_wl_24.keras) | FLOAT32 | 24 x 3 x 1 | 83.54 |
|
| 96 |
+
| [st_gmp_wl_48](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/human_activity_recognition/st_gmp/ST_pretrainedmodel_public_dataset/WISDM/st_gmp_wl_48/st_gmp_wl_48.keras) | FLOAT32 | 48 x 3 x 1 | 86.59 |
|
| 97 |
|
| 98 |
|
| 99 |
## Retraining and Integration in a simple example:
|