Spaces:

ttsds
/

README

Running

App Files Files Community

cdminix commited on Jul 19, 2024

Commit

f8d5c98

verified ·

1 Parent(s): 3f277ef

Update README.md

Browse files

Files changed (1) hide show

README.md +17 -26

README.md CHANGED Viewed

@@ -20,29 +20,20 @@ More details can be found in our paper [*TTSDS -- Text-to-Speech Distribution Sc
 ## Reproducibility
 To reproduce our results, check out our repository [here](https://github.com/ttsds/ttsds).
-## Credits
-This benchmark is inspired by [TTS Arena](https://huggingface.co/spaces/TTS-AGI/TTS-Arena) which instead focuses on the subjective evaluation of TTS models.
-Our benchmark would not be possible without the many open-source TTS models on Hugging Face and GitHub.
-Additionally, our benchmark uses the following datasets:
-- [LJSpeech](https://keithito.com/LJ-Speech-Dataset/h)
-- [LibriTTS](https://www.openslr.org/60/)
-- [VCTK](https://datashare.ed.ac.uk/handle/10283/2950)
-- [Common Voice](https://commonvoice.mozilla.org/)
-- [ESC-50](https://github.com/karolpiczak/ESC-50)
-And the following metrics/representations/tools:
-- [Wav2Vec2](https://arxiv.org/abs/2006.11477)
-- [Hubert](https://arxiv.org/abs/2006.11477)
-- [WavLM](https://arxiv.org/abs/2110.13900)
-- [PESQ](https://en.wikipedia.org/wiki/Perceptual_Evaluation_of_Speech_Quality)
-- [VoiceFixer](https://arxiv.org/abs/2204.05841)
-- [WADA SNR](https://www.cs.cmu.edu/~robust/Papers/KimSternIS08.pdf)
-- [Whisper](https://arxiv.org/abs/2212.04356)
-- [Masked Prosody Model](https://huggingface.co/cdminix/masked_prosody_model)
-- [PyWorld](https://github.com/JeremyCCHsu/Python-Wrapper-for-World-Vocoder)
-- [WeSpeaker](https://arxiv.org/abs/2210.17016)
-- [D-Vector](https://github.com/yistLin/dvector)
-Authors: Christoph Minixhofer, Ondřej Klejch, and Peter Bell
-of the University of Edinburgh.

 ## Reproducibility
 To reproduce our results, check out our repository [here](https://github.com/ttsds/ttsds).
+Authors:
+Christoph Minixhofer, Ondřej Klejch, and Peter Bell
+The University of Edinburgh.
+## Citation
+```
+@misc{minixhofer2024ttsdstexttospeechdistribution,
+      title={TTSDS -- Text-to-Speech Distribution Score},
+      author={Christoph Minixhofer and Ondřej Klejch and Peter Bell},
+      year={2024},
+      eprint={2407.12707},
+      archivePrefix={arXiv},
+      primaryClass={eess.AS},
+      url={https://arxiv.org/abs/2407.12707},
+}
+```