classla
/

wav2vec2-xls-r-parlaspeech-hr

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

wav2vec2-xls-r-parlaspeech-hr / README.md

nljubesi's picture

Update README.md

57c08e8 almost 3 years ago

|

1.09 kB

	---
	language: hr
	datasets:
	- parlaspeech
	tags:
	- audio
	- automatic-speech-recognition
	widget:
	- example_title: example 1
	src: https://huggingface.co/classla/wav2vec2-xls-r-sabor-hr/raw/main/00020570a.flac.wav
	- example_title: example 2
	src: https://huggingface.co/classla/wav2vec2-xls-r-sabor-hr/raw/main/00020578b.flac.wav

	---

	# wav2vec2-xls-r-sabor-hr

	This model is based on the [facebook/wav2vec2-xls-r-300m model](https://huggingface.co/facebook/wav2vec2-xls-r-300m) and was fine-tuned over 72 hours of recordings and transcripts from the Croatian parliament. These transcripts are an early result of the second iteration of the [ParlaMint project](https://www.clarin.eu/content/parlamint-towards-comparable-parliamentary-corpora) and will be extended and published under a permissive license.

	These efforts were coordinated by Nikola Ljubešić, the manual data alignment was performed by Ivo-Pavao Jazbec, the method from [Plüss et al](https://arxiv.org/abs/2010.02810) was applied by Vuk Batanović and Lenka Bajčetić, while the final modelling was performed by Peter Rupnik.