Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,88 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- English
|
4 |
+
tags:
|
5 |
+
- MusicGeneration
|
6 |
+
---
|
7 |
+
|
8 |
+
<!--Copyright 2020 The HuggingFace Team. All rights reserved.
|
9 |
+
|
10 |
+
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
|
11 |
+
the License. You may obtain a copy of the License at
|
12 |
+
|
13 |
+
http://www.apache.org/licenses/LICENSE-2.0
|
14 |
+
|
15 |
+
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
|
16 |
+
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
|
17 |
+
specific language governing permissions and limitations under the License.
|
18 |
+
-->
|
19 |
+
|
20 |
+
# Jukebox
|
21 |
+
|
22 |
+
## Overview
|
23 |
+
|
24 |
+
The Jukebox model was proposed in [Jukebox: A generative model for music](https://arxiv.org/pdf/2005.00341.pdf)
|
25 |
+
by Prafulla Dhariwal, Heewoo Jun, Christine Payne, Jong Wook Kim, Alec Radford,
|
26 |
+
Ilya Sutskever.
|
27 |
+
|
28 |
+
This model proposes a generative music model which can be produce minute long samples which can bne conditionned on
|
29 |
+
artist, genre and lyrics.
|
30 |
+
|
31 |
+
The abstract from the paper is the following:
|
32 |
+
|
33 |
+
We introduce Jukebox, a model that generates
|
34 |
+
music with singing in the raw audio domain. We
|
35 |
+
tackle the long context of raw audio using a multiscale VQ-VAE to compress it to discrete codes,
|
36 |
+
and modeling those using autoregressive Transformers. We show that the combined model at
|
37 |
+
scale can generate high-fidelity and diverse songs
|
38 |
+
with coherence up to multiple minutes. We can
|
39 |
+
condition on artist and genre to steer the musical
|
40 |
+
and vocal style, and on unaligned lyrics to make
|
41 |
+
the singing more controllable. We are releasing
|
42 |
+
thousands of non cherry-picked samples, along
|
43 |
+
with model weights and code.
|
44 |
+
|
45 |
+
Tips:
|
46 |
+
|
47 |
+
This model is very slow for now, and takes 18h to generate a minute long audio.
|
48 |
+
|
49 |
+
This model was contributed by [Arthur Zucker](https://huggingface.co/ArthurZ).
|
50 |
+
The original code can be found [here](https://github.com/openai/jukebox).
|
51 |
+
|
52 |
+
## JukeboxConfig
|
53 |
+
|
54 |
+
[[autodoc]] JukeboxConfig
|
55 |
+
|
56 |
+
## JukeboxTokenizer
|
57 |
+
|
58 |
+
[[autodoc]] JukeboxTokenizer - save_vocabulary
|
59 |
+
|
60 |
+
## JukeboxTokenizerFast
|
61 |
+
|
62 |
+
[[autodoc]] JukeboxTokenizerFast
|
63 |
+
|
64 |
+
## Jukebox specific outputs
|
65 |
+
|
66 |
+
[[autodoc]] models.jukebox.modeling_jukebox.JukeboxDoubleHeadsModelOutput
|
67 |
+
|
68 |
+
[[autodoc]] models.jukebox.modeling_tf_jukebox.TFJukeboxDoubleHeadsModelOutput
|
69 |
+
|
70 |
+
## JukeboxModel
|
71 |
+
|
72 |
+
[[autodoc]] JukeboxModel - forward - parallelize - deparallelize
|
73 |
+
|
74 |
+
## JukeboxLMHeadModel
|
75 |
+
|
76 |
+
[[autodoc]] JukeboxLMHeadModel - forward - parallelize - deparallelize
|
77 |
+
|
78 |
+
## JukeboxDoubleHeadsModel
|
79 |
+
|
80 |
+
[[autodoc]] JukeboxDoubleHeadsModel - forward
|
81 |
+
|
82 |
+
## JukeboxForSequenceClassification
|
83 |
+
|
84 |
+
[[autodoc]] JukeboxForSequenceClassification - forward
|
85 |
+
|
86 |
+
## JukeboxForTokenClassification
|
87 |
+
|
88 |
+
[[autodoc]] JukeboxForTokenClassification - forward
|