mah92
/

hifigan_for_sherpa

Model card Files Files and versions Community

hifigan_for_sherpa / README.md

mah92's picture

Update README.md

8f8ee25 verified 28 days ago

|

history blame contribute delete

913 Bytes

	---
	license: cc0-1.0
	---

	# بسم الله

	1) Converted universal v1, v2, v3 22050Hz hifigan models from [here](https://github.com/jik876/hifi-gan) and [here](https://drive.google.com/drive/folders/1-eEYTB5Av9jNql0WGBlRoi-WH2J7bp5Y)

	2) Converted to onnx using [this](https://github.com/k2-fsa/icefall/blob/master/egs/ljspeech/TTS/matcha/export_onnx_hifigan.py)

	3) Added metadata as such using the above python script:
	{'model_type': 'hifigan', 'model_filename': 'hifigan_universal_22050hz_v1', 'sample_rate': 22050, 'version': 1, 'model_author': 'jik876', 'maintainer': 'k2-fsa', 'dataset': 'Universal', 'url1': 'https://github.com/jik876/hifi-gan', 'url2': ''}

	Note1: hifigan v2 is much smaller, faster and with lower memory need

	Note2: As LJ speech is a female single speaker dataset, hifigan based on it does not work well on men's voice. vctk models work well for both genders.

	Conclusion: Use vctk_v2!