added info about new archive formats & prompts
Browse files
README.md
CHANGED
@@ -7,21 +7,35 @@ On 2024-11-26, temporary access to OpenAI's video generation model Sora (turbo)
|
|
7 |
https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora
|
8 |
|
9 |
After a few hours, OpenAI invalidated the API key used by the repo and removed access to the generated videos.
|
10 |
-
This release consists of 87 videos (~702 MB) and 83 corresponding prompts that were archived, from the publicly displayed generations, in anticipation of that event.
|
11 |
-
|
12 |
Not all videos generated were able to be archived, due to HF load issues. All videos are of MIME type video/mp4 and have a framerate of 30 FPS.
|
13 |
-
The generation
|
14 |
|
15 |
The Sora backend that was used for generation was the following:
|
16 |
`https://sora.openai.com/backend/video_gen`
|
17 |
|
18 |
-
|
19 |
-
|
20 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
21 |
A "long" prompt limit was based off the filename length limit (around 255 B) for Windows/macOS/Linux.
|
22 |
All short prompts are used as filenames in the "short_prompts" directory.
|
23 |
The ten longer prompts in "full_long_prompts.txt" were used for the videos in the "long_prompts" directory.
|
24 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
25 |
~ desuAnon
|
26 |
|
27 |
https://rentry.org/desuAnon
|
@@ -39,10 +53,6 @@ https://rentry.org/desuAnon
|
|
39 |
|
40 |
[PUBLIC DOMAIN: CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/)
|
41 |
|
42 |
-
![img](https://licensebuttons.net/p/zero/1.0/88x31.png)
|
43 |
-
|
44 |
-
---
|
45 |
-
|
46 |
*This public release of content produced by generative ML is intended for educational, artistic, and research purposes.*
|
47 |
*Sora is a pending trademark of OpenAI, Inc, and is used for descriptive purposes only.*
|
48 |
*The original videos were watermarked by OpenAI to reflect the origin of the generated content.*
|
|
|
7 |
https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora
|
8 |
|
9 |
After a few hours, OpenAI invalidated the API key used by the repo and removed access to the generated videos.
|
10 |
+
This release consists of 87 videos (~702 MB) and 83 corresponding prompts that were archived, from the publicly displayed generations, in anticipation of that event.
|
|
|
11 |
Not all videos generated were able to be archived, due to HF load issues. All videos are of MIME type video/mp4 and have a framerate of 30 FPS.
|
12 |
+
The generation parameters may be found in the app.py of the original repo [here](https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora/blob/main/app.py); an archive of this file may be found [here](https://archive.is/r70Ao).
|
13 |
|
14 |
The Sora backend that was used for generation was the following:
|
15 |
`https://sora.openai.com/backend/video_gen`
|
16 |
|
17 |
+
Please note that user prompts are often "augmented" (changed by some LLM) before generating videos, so the prompts listed may not be the exact one used by the model.
|
18 |
+
The prompt used for four videos are not known, and these are denoted as [unknown_n].
|
19 |
+
|
20 |
+
---
|
21 |
+
### Archive versions
|
22 |
+
|
23 |
+
|
24 |
+
**sora-turbo-vids.zip**
|
25 |
+
This was the original upload, and had some encoding/compatibility issues for some users.
|
26 |
+
The "short" video filenames are the full original prompts used for the API request for each video.
|
27 |
A "long" prompt limit was based off the filename length limit (around 255 B) for Windows/macOS/Linux.
|
28 |
All short prompts are used as filenames in the "short_prompts" directory.
|
29 |
The ten longer prompts in "full_long_prompts.txt" were used for the videos in the "long_prompts" directory.
|
30 |
|
31 |
+
|
32 |
+
**videos_only.zip** and **videos_only.7z**
|
33 |
+
These identical archives (in different compression formats) contain only the original videos, with names such as `video_24.mp4`.
|
34 |
+
The `video_24` part is the video ID, and the prompt used for a specific video ID is listed in the separate CSV and JSONL files (video_id, prompt).
|
35 |
+
You should be able to easily view both those files in a text editor, and they are easy to import and process in various programming languages.
|
36 |
+
|
37 |
+
---
|
38 |
+
|
39 |
~ desuAnon
|
40 |
|
41 |
https://rentry.org/desuAnon
|
|
|
53 |
|
54 |
[PUBLIC DOMAIN: CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/)
|
55 |
|
|
|
|
|
|
|
|
|
56 |
*This public release of content produced by generative ML is intended for educational, artistic, and research purposes.*
|
57 |
*Sora is a pending trademark of OpenAI, Inc, and is used for descriptive purposes only.*
|
58 |
*The original videos were watermarked by OpenAI to reflect the origin of the generated content.*
|