NeMo
CasanovaE commited on
Commit
1d7c8f6
•
1 Parent(s): 2f151d0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -144,6 +144,18 @@ The Low Frame-rate Speech Codec is trained on a total of 28.7k hrs of speech dat
144
 
145
  - Properties: To assess our models' performance on studio-quality audio, we utilized the F10 and M10 speakers from the DAPS Clear dataset. These speakers were also employed in the evaluation of the [DAC model](https://arxiv.org/abs/2306.06546).
146
 
 
 
 
 
 
 
 
 
 
 
 
 
147
  ## Software Integration
148
 
149
  ### Supported Hardware Microarchitecture Compatibility:
 
144
 
145
  - Properties: To assess our models' performance on studio-quality audio, we utilized the F10 and M10 speakers from the DAPS Clear dataset. These speakers were also employed in the evaluation of the [DAC model](https://arxiv.org/abs/2306.06546).
146
 
147
+
148
+ ## Performance
149
+
150
+ We evaluated our codec using multiple objective audio quality metrics across two distinct test sets. Additionally, we compared our model's performance with state-of-the-art codecs. For further details, please refer to[our paper](https://arxiv.org/abs/2409.12117).
151
+
152
+ | Dataset | Squim MOS (↑) |SI-SDR(↑) |Mel Dist. (↓) |STFT Dist.(↓) |CER (↓)|
153
+ |:-----------:|:----------:|:----------:|:----------:|:-----------:|:-----------:|
154
+ | MLS | 4.43 | 4.46 | 0.147 | 0.061 | 2.09 |
155
+ | DAPS | 4.68 | 6.93 | 0.142 | 0.058 | 0.86 |
156
+
157
+
158
+
159
  ## Software Integration
160
 
161
  ### Supported Hardware Microarchitecture Compatibility: