Update README.md
Browse files
README.md
CHANGED
@@ -144,6 +144,18 @@ The Low Frame-rate Speech Codec is trained on a total of 28.7k hrs of speech dat
|
|
144 |
|
145 |
- Properties: To assess our models' performance on studio-quality audio, we utilized the F10 and M10 speakers from the DAPS Clear dataset. These speakers were also employed in the evaluation of the [DAC model](https://arxiv.org/abs/2306.06546).
|
146 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
147 |
## Software Integration
|
148 |
|
149 |
### Supported Hardware Microarchitecture Compatibility:
|
|
|
144 |
|
145 |
- Properties: To assess our models' performance on studio-quality audio, we utilized the F10 and M10 speakers from the DAPS Clear dataset. These speakers were also employed in the evaluation of the [DAC model](https://arxiv.org/abs/2306.06546).
|
146 |
|
147 |
+
|
148 |
+
## Performance
|
149 |
+
|
150 |
+
We evaluated our codec using multiple objective audio quality metrics across two distinct test sets. Additionally, we compared our model's performance with state-of-the-art codecs. For further details, please refer to[our paper](https://arxiv.org/abs/2409.12117).
|
151 |
+
|
152 |
+
| Dataset | Squim MOS (↑) |SI-SDR(↑) |Mel Dist. (↓) |STFT Dist.(↓) |CER (↓)|
|
153 |
+
|:-----------:|:----------:|:----------:|:----------:|:-----------:|:-----------:|
|
154 |
+
| MLS | 4.43 | 4.46 | 0.147 | 0.061 | 2.09 |
|
155 |
+
| DAPS | 4.68 | 6.93 | 0.142 | 0.058 | 0.86 |
|
156 |
+
|
157 |
+
|
158 |
+
|
159 |
## Software Integration
|
160 |
|
161 |
### Supported Hardware Microarchitecture Compatibility:
|