STMicroelectronics
/

stft_tcnn

Model card Files Files and versions

xet

Community

FBAGSTM commited on Jan 23

Commit

8b5c9e5

verified ·

1 Parent(s): 0b6b66d

Release AI-ModelZoo-4.0.0

Browse files

Files changed (1) hide show

README.md +15 -10

README.md CHANGED Viewed

@@ -1,3 +1,9 @@
 # STFT-TCNN
 ## **Use case** : `speech enhancement`
@@ -53,9 +59,9 @@ We also provide the original .yaml config file used to train the model. For deta
 Measures are done with default STEDGEAI configuration with enabled input / output allocated option.
 ### Reference **NPU** memory footprint
-|Model      | Dataset       | Format   | Resolution | Series    | Internal RAM | External RAM | Weights Flash | STM32Cube.AI version | STEdgeAI Core version |
-|----------|------------------|--------|-------------|------------------|------------------|---------------------|-------|----------------------|-------------------------|
-| [STFT-TCNN Medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/valentini/stft_tcnn_medium_sigmoid_257x40_qdq_int8.onnx)  | valentini     | Int8     | 257x40  | STM32N6   |     100.09    |   0.0              |    1599.39       |       10.2.0        |     2.2.0   |
 ### Reference **NPU**  inference time
@@ -66,9 +72,9 @@ The figures listed in this table correspond to the version of ST Edge AI with th
 You can expect significant improvements once this issue is resolved.
-| Model  | Dataset          | Format | Resolution  | Board            | Execution Engine | Inference time (ms) | Inf / sec   | STM32Cube.AI version  |  STEdgeAI Core version |
-|--------|------------------|--------|-------------|------------------|------------------|---------------------|-------|----------------------|-------------------------|
-| [STFT-TCNN medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/valentini/stft_tcnn_medium_sigmoid_257x40_qdq_int8.onnx) | valentini     | Int8     | 257x40  | STM32N6570-DK   |   NPU/MCU      |       52.09        |    19.19      |       10.2.0        |     2.2.0   |
 ### Metrics on the Valentini dataset
@@ -83,8 +89,8 @@ We report five metrics :
 | Model | Format | Resolution | PESQ | STOI | SNR | SI-SNR | Waveform MSE |
 |-------|--------|------------|------|------|-----|--------|--------------|
-| [STFT-TCNN Medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/valentini/stft_tcnn_medium_sigmoid_257xsl_float.onnx) | float32 | 257x? | 2.480 | 0.931 | 18.190 | 18.104 | 1.136e-4 |
-| [STFT-TCNN Medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/valentini/stft_tcnn_medium_sigmoid_257xsl_qdq_int8.onnx) | int8 | 257x? | 2.372 | 0.932 | 18.190 | 18.100 | 1.109e-4 |
 ### Limitations
@@ -92,5 +98,4 @@ The models provided here typically have trouble denoising speech at SNRs beyond
 ## Retraining and Integration in a simple example:
-Please refer to the stm32ai-modelzoo-services GitHub [here](https://github.com/STMicroelectronics/stm32ai-modelzoo-services)

+---
+license: other
+license_name: sla0044
+license_link: >-
+  https://github.com/STMicroelectronics/stm32ai-modelzoo/blob/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/LICENSE.md
+---
 # STFT-TCNN
 ## **Use case** : `speech enhancement`
 Measures are done with default STEDGEAI configuration with enabled input / output allocated option.
 ### Reference **NPU** memory footprint
+|Model      | Dataset       | Format   | Resolution | Series    | Internal RAM | External RAM | Weights Flash | STEdgeAI Core version |
+|----------|------------------|--------|-------------|------------------|------------------|---------------------|-------|-------------------------|
+| [STFT-TCNN Medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/valentini/stft_tcnn_medium_sigmoid_257x40_qdq_int8.onnx)  | valentini     | Int8     | 257x40  | STM32N6   |     100.09    |   0.0              |    1578.39       |     3.0.0   |
 ### Reference **NPU**  inference time
 You can expect significant improvements once this issue is resolved.
+| Model  | Dataset          | Format | Resolution  | Board            | Execution Engine | Inference time (ms) | Inf / sec   |  STEdgeAI Core version |
+|--------|------------------|--------|-------------|------------------|------------------|---------------------|-------|------------------------|
+| [STFT-TCNN medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/valentini/stft_tcnn_medium_sigmoid_257x40_qdq_int8.onnx) | valentini     | Int8     | 257x40  | STM32N6570-DK   |   NPU/MCU      |       51.11       |    19.56      |     3.0.0   |
 ### Metrics on the Valentini dataset
 | Model | Format | Resolution | PESQ | STOI | SNR | SI-SNR | Waveform MSE |
 |-------|--------|------------|------|------|-----|--------|--------------|
+| [STFT-TCNN Medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/valentini/stft_tcnn_medium_sigmoid_257xsl_float.onnx) | float32 | 257x? | 2.480 | 0.932 | 18.190 | 18.104 | 1.136e-4 |
+| [STFT-TCNN Medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/valentini/stft_tcnn_medium_sigmoid_257xsl_qdq_int8.onnx) | int8 | 257x? | 2.372 | 0.932 | 18.190 | 18.100 | 1.109e-4 |
 ### Limitations
 ## Retraining and Integration in a simple example:
+Please refer to the stm32ai-modelzoo-services GitHub [here](https://github.com/STMicroelectronics/stm32ai-modelzoo-services)