nu-wave-x2

Model description

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

This model was trained by contributor Frederico S. Oliveira, who graciously provided the checkpoint in the original author's GitHub repo.

This model was trained using source code written by Junhyeok Lee and Seungu Han under the BSD 3.0 License. All credit goes to them for this work.

This model takes in audio at 24kHz and upsamples it to 48kHz.

Intended uses & limitations

How to use

You can try out this model here: Open In Colab

Limitations and bias

Provide examples of latent issues and potential remediations.

Training data

Describe the data you used to train the model. If you initialized it with pre-trained weights, add a link to the pre-trained model card or repository with description of the pre-training data.

Training procedure

Preprocessing, hardware used, hyperparameters...

Eval results

You can check out the authors' results at their project page. The project page contains many samples of upsampled audio from the authors' models.

BibTeX entry and citation info

@inproceedings{lee21nuwave,
  author={Junhyeok Lee and Seungu Han},
  title={{NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling}},
  year=2021,
  booktitle={Proc. Interspeech 2021},
  pages={1634--1638},
  doi={10.21437/Interspeech.2021-36}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Examples
Inference API (serverless) does not yet support pytorch-lightning models for this pipeline type.

Dataset used to train nateraw/nu-wave-x2