desuAnon commited on
Commit
2d6682d
·
verified ·
1 Parent(s): 74ad958

added info about new archive formats & prompts

Browse files
Files changed (1) hide show
  1. README.md +20 -10
README.md CHANGED
@@ -7,21 +7,35 @@ On 2024-11-26, temporary access to OpenAI's video generation model Sora (turbo)
7
  https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora
8
 
9
  After a few hours, OpenAI invalidated the API key used by the repo and removed access to the generated videos.
10
- This release consists of 87 videos (~702 MB) and 83 corresponding prompts that were archived, from the publicly displayed generations, in anticipation of that event.
11
-
12
  Not all videos generated were able to be archived, due to HF load issues. All videos are of MIME type video/mp4 and have a framerate of 30 FPS.
13
- The generation params may be found in the app.py of the original repo [here](https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora/blob/main/app.py); an archive of this file may be found [here](https://archive.is/r70Ao).
14
 
15
  The Sora backend that was used for generation was the following:
16
  `https://sora.openai.com/backend/video_gen`
17
 
18
- The "short" video filenames are the full original prompts used for the API request for each video.
19
- User prompts are often "augmented" (by some LLM) before generating videos, so it may not be the exact prompt used.
20
- The prompt used for four videos is not known, and these are denoted as [unknown_n].
 
 
 
 
 
 
 
21
  A "long" prompt limit was based off the filename length limit (around 255 B) for Windows/macOS/Linux.
22
  All short prompts are used as filenames in the "short_prompts" directory.
23
  The ten longer prompts in "full_long_prompts.txt" were used for the videos in the "long_prompts" directory.
24
 
 
 
 
 
 
 
 
 
25
  ~ desuAnon
26
 
27
  https://rentry.org/desuAnon
@@ -39,10 +53,6 @@ https://rentry.org/desuAnon
39
 
40
  [PUBLIC DOMAIN: CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/)
41
 
42
- ![img](https://licensebuttons.net/p/zero/1.0/88x31.png)
43
-
44
- ---
45
-
46
  *This public release of content produced by generative ML is intended for educational, artistic, and research purposes.*
47
  *Sora is a pending trademark of OpenAI, Inc, and is used for descriptive purposes only.*
48
  *The original videos were watermarked by OpenAI to reflect the origin of the generated content.*
 
7
  https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora
8
 
9
  After a few hours, OpenAI invalidated the API key used by the repo and removed access to the generated videos.
10
+ This release consists of 87 videos (~702 MB) and 83 corresponding prompts that were archived, from the publicly displayed generations, in anticipation of that event.
 
11
  Not all videos generated were able to be archived, due to HF load issues. All videos are of MIME type video/mp4 and have a framerate of 30 FPS.
12
+ The generation parameters may be found in the app.py of the original repo [here](https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora/blob/main/app.py); an archive of this file may be found [here](https://archive.is/r70Ao).
13
 
14
  The Sora backend that was used for generation was the following:
15
  `https://sora.openai.com/backend/video_gen`
16
 
17
+ Please note that user prompts are often "augmented" (changed by some LLM) before generating videos, so the prompts listed may not be the exact one used by the model.
18
+ The prompt used for four videos are not known, and these are denoted as [unknown_n].
19
+
20
+ ---
21
+ ### Archive versions
22
+
23
+
24
+ **sora-turbo-vids.zip**
25
+ This was the original upload, and had some encoding/compatibility issues for some users.
26
+ The "short" video filenames are the full original prompts used for the API request for each video.
27
  A "long" prompt limit was based off the filename length limit (around 255 B) for Windows/macOS/Linux.
28
  All short prompts are used as filenames in the "short_prompts" directory.
29
  The ten longer prompts in "full_long_prompts.txt" were used for the videos in the "long_prompts" directory.
30
 
31
+
32
+ **videos_only.zip** and **videos_only.7z**
33
+ These identical archives (in different compression formats) contain only the original videos, with names such as `video_24.mp4`.
34
+ The `video_24` part is the video ID, and the prompt used for a specific video ID is listed in the separate CSV and JSONL files (video_id, prompt).
35
+ You should be able to easily view both those files in a text editor, and they are easy to import and process in various programming languages.
36
+
37
+ ---
38
+
39
  ~ desuAnon
40
 
41
  https://rentry.org/desuAnon
 
53
 
54
  [PUBLIC DOMAIN: CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/)
55
 
 
 
 
 
56
  *This public release of content produced by generative ML is intended for educational, artistic, and research purposes.*
57
  *Sora is a pending trademark of OpenAI, Inc, and is used for descriptive purposes only.*
58
  *The original videos were watermarked by OpenAI to reflect the origin of the generated content.*