|
--- |
|
language: |
|
- la |
|
license: agpl-3.0 |
|
tags: |
|
- robust-speech-event |
|
- hf-asr-leaderboard |
|
datasets: |
|
- lsb/poetaexmachina-mp3-recitations |
|
metrics: |
|
- wer |
|
model-index: |
|
- name: wav2vec2-base-it-latin |
|
results: |
|
- task: |
|
type: automatic-speech-recognition |
|
name: Speech Recognition |
|
dataset: |
|
type: lsb/poetaexmachina-mp3-recitations |
|
name: Poeta Ex Machina mp3 recitations |
|
metrics: |
|
- type: wer |
|
value: 0.398 |
|
name: Test WER |
|
--- |
|
--- |
|
|
|
# wav2vec2-base-it-latin |
|
|
|
This model is a fine-tuned version of [wav2vec2-base-it-voxpopuli](https://huggingface.co/facebook/wav2vec2-base-it-voxpopuli) |
|
|
|
The dataset used is the [poetaexmachina-mp3-recitations](https://github.com/lsb/poetaexmachina-mp3-recitations), |
|
all of the 2-series texts (vergil) and every tenth 1-series text (words from Poeta Ex Machina's [database](https://github.com/lsb/poetaexmachina/blob/master/merged-scansions.db) of words with scansions). |
|
|
|
It achieves the following [results](https://github.com/lsb/tironiculum/blame/trunk/wav2vec2%20base%20it%20latin.ipynb#L1234) on the evaluation set: |
|
|
|
- Loss: 0.1943 |
|
- WER: 0.398 |
|
|