Audio Classification
FBAGSTM commited on
Commit
dd6ede8
·
verified ·
1 Parent(s): 3c2d375

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -66,8 +66,8 @@ Measures are done with default STM32Cube.AI configuration with enabled input / o
66
 
67
  | Model | Format | Resolution | Series | Activation RAM (KiB) | Runtime RAM (KiB) | Weights Flash (KiB) | Code Flash (KiB) | Total RAM (KiB) | Total Flash (kB) | STM32Cube.AI version |
68
  |-------------------|--------|------------|---------|----------------|-------------|---------------|------------|-------------|-------------|-----------------------|
69
- | [miniresnet v2 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 59.89 | 7.09 | 123.98 | 61.57 | 66.98 | 185.55 | 10.0.0 |
70
- | [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 59.89 | 11.28 | 431.98 | 69.86 | 71.17 | 501.84 | 10.0.0 |
71
 
72
 
73
  ### Reference inference time based on ESC-10 dataset
@@ -75,8 +75,8 @@ Measures are done with default STM32Cube.AI configuration with enabled input / o
75
 
76
  | Model | Format | Resolution | Board | Execution Engine | Frequency | Inference time (ms) | STM32Cube.AI version |
77
  |-------------------|--------|------------|------------------|------------------|--------------|-------|-----------------------|
78
- | [miniresnet v2 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 1 CPU | 160 | 188.36 | 10.0.0 |
79
- | [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 1 CPU | 160 | 308.81 | 10.0.0 |
80
 
81
 
82
  ### Accuracy with ESC-10 dataset
@@ -87,10 +87,10 @@ The reason this metric is used instead of patch-level accuracy is because patch-
87
 
88
  | Model | Format | Resolution | Clip-level Accuracy |
89
  |-------|--------|------------|----------------|
90
- | [miniresnet v2 1stack ](https://github.com/STMicroelectronics/stm32ai-modelzoo/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl.h5) | float32 | 64x50x1 | 91.1% |
91
- | [miniresnet v2 1stack ](https://github.com/STMicroelectronics/stm32ai-modelzoo/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | 91.1% |
92
- | [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl.h5) | float32 | 64x50x1 | 92.4% |
93
- | [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | 92.6% |
94
 
95
 
96
 
 
66
 
67
  | Model | Format | Resolution | Series | Activation RAM (KiB) | Runtime RAM (KiB) | Weights Flash (KiB) | Code Flash (KiB) | Total RAM (KiB) | Total Flash (kB) | STM32Cube.AI version |
68
  |-------------------|--------|------------|---------|----------------|-------------|---------------|------------|-------------|-------------|-----------------------|
69
+ | [miniresnet v2 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 59.89 | 7.09 | 123.98 | 61.57 | 66.98 | 185.55 | 10.0.0 |
70
+ | [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 59.89 | 11.28 | 431.98 | 69.86 | 71.17 | 501.84 | 10.0.0 |
71
 
72
 
73
  ### Reference inference time based on ESC-10 dataset
 
75
 
76
  | Model | Format | Resolution | Board | Execution Engine | Frequency | Inference time (ms) | STM32Cube.AI version |
77
  |-------------------|--------|------------|------------------|------------------|--------------|-------|-----------------------|
78
+ | [miniresnet v2 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 1 CPU | 160 | 188.36 | 10.0.0 |
79
+ | [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A | 1 CPU | 160 | 308.81 | 10.0.0 |
80
 
81
 
82
  ### Accuracy with ESC-10 dataset
 
87
 
88
  | Model | Format | Resolution | Clip-level Accuracy |
89
  |-------|--------|------------|----------------|
90
+ | [miniresnet v2 1stack ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl.h5) | float32 | 64x50x1 | 91.1% |
91
+ | [miniresnet v2 1stack ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | 91.1% |
92
+ | [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl.h5) | float32 | 64x50x1 | 92.4% |
93
+ | [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | 92.6% |
94
 
95
 
96