any training details?

by seastar105 - opened Jun 6, 2023

Jun 6, 2023

it's good to see new hubert model trained on korea, could you inform training details(e.g. hyperparams, dataset) about this model?

hyunwoo3235

lucid org Jul 19, 2023

Sorry for the late confirmation
Currently, the model card has been updated, so please check it.

I will close the issue

hyunwoo3235 changed discussion status to closed Jul 19, 2023

seastar105

Jul 19, 2023

thanks for reply.

could you explain more about which dataset is used for training kmeans model for second training?
like in hubert paper, authors used subset of librispeech

seastar105 changed discussion status to open Jul 19, 2023

hyunwoo3235

lucid org Jul 22, 2023

Similarly, we randomly sampled 100 hours from the combination of datasets we used and used them to train the kmeans model.

seastar105

Aug 4, 2023

@hyunwoo3235 thanks for sharing, what was final loss and top-k accuracy of training? for both base and large model

hyunwoo3235

lucid org Aug 8, 2023

The final losses for the base and large models were 1.995 and 2.25, respectively.
unfortunately, the model was not tested as it was trained as a validation for another project. therefore, there are no other performance metrics.

seastar105

Aug 9, 2023

thanks!

seastar105 changed discussion status to closed Aug 9, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment