File size: 4,203 Bytes
3cf75de
 
 
92c295d
3cf75de
35505f3
3cf75de
096a33f
 
 
 
 
3cf75de
096a33f
 
2d6682d
096a33f
2d6682d
92c295d
 
 
096a33f
2d6682d
 
096a33f
 
3cf75de
92c295d
36206e7
 
92c295d
2d6682d
1687744
096a33f
 
 
1687744
 
 
 
 
2d6682d
92c295d
 
3cf75de
 
 
 
40ee0bd
 
 
 
 
 
 
 
 
3cf75de
 
1bf9000
3cf75de
1687744
bd7283f
 
40ee0bd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
---
license: cc0-1.0
---
### Release Information  

Temporary access to OpenAI's video generation model Sora (turbo) was provided by the HF repo [PR-Puppet-Sora](https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora), on November 26th. After a few hours, OpenAI revoked the API key used by the repo and removed access to the generated videos. In anticipation of that event, the publicly displayed videos and their prompts were archived.  

This release contains 87 archived videos (~702 MB) and 83 of their prompts, and is dedicated to the public domain (CC0 1.0 Universal).  
The generation parameters may be found in the app.py of the original repo [here](https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora/blob/main/app.py).  
An archive of this script is available [here](https://archive.is/r70Ao).  User prompts are often "augmented" (changed by some LLM) before generating videos, and this may be true for these videos as well.  
The Sora backend that was used for generation was:  
`https://sora.openai.com/backend/video_gen`  

Contrary to some rumors online, the generations were *not* uncensored. User prompts, as well as the generated videos, passed through OpenAI's content moderation normally. This is partly the reason why none of the videos in this archive are NSFW, or similar, despite a few *brave attempts* in the prompts.  
It is also incorrect that "Sora leaked", since the model itself (its model parameters) had not been acquired by outsiders. The only thing that "leaked" was previewer/beta tester access to Sora video generation, via a single HF repo - while keeping its API keys secret.  

### Archive Versions  

All videos are `.mp4`, of varying resolutions, and a framerate of 30 FPS.  
Not all of the videos that were generated were able to be archived, due to HF server load issues.  
The prompts used for four videos are not known, and these are denoted as [unknown_n].  
Hugging Face performs *File Security Scans* of uploaded files, and you can click on the icon next to each file to see the result of this scan.

**sora-turbo-vids.zip**  
This is the original archive containing both videos and their prompts, and some users experienced encoding/compatibility issues with it. Consider using the more recent "separated" uploads if you encounter similar issues.  
The filenames in the `short_prompts` directory are the full prompts used for each video generation request. The filenames in the `long_prompts` directory are shortened versions of the long prompts (above 256 chars), and their full versions are found in `full_long_prompts.txt`.

**videos_only.zip** & **videos_only.7z**  
These identical archives (in different compression formats) contain only the original videos, with names such as `vid_24.mp4`.  
The `vid_24` part is the video ID, and the prompt used for a specific video ID is listed in the separate CSV and JSONL files (video_id, prompt).  
You may easily view both those files in a text editor, and they are easy to import and process in various programming languages.


### YouTube Compilations

This collection of videos have been uploaded to YouTube for easy viewing (by someone other than me). You can watch them here:

- [All "Leaked" Sora Videos & Prompts! (No Commentary, just Videos)](https://www.youtube.com/watch?v=FI0wWpmraW0)

- [Sora Leak - all new videos](https://www.youtube.com/watch?v=Gz33LlwsPVM)


Even though this is a *dataset* upload, I went with a *model* repo because a) the URL is shorter, and b) the original upload wasn't compatible with the HF dataset viewer.

~ desuAnon

https://rentry.org/desuAnon

---

![img1](https://files.catbox.moe/zghdas.jpg)

![img2](https://files.catbox.moe/yumx7q.jpg)

![img3](https://files.catbox.moe/q51g6k.jpg)


---

[PUBLIC DOMAIN: CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/)

*This public release of content produced by generative ML is intended for educational, artistic, and research purposes.* 
*Sora is a pending trademark of OpenAI, Inc, and is used for descriptive purposes only.*   
*The original videos were watermarked by OpenAI to reflect the origin of the generated content.*   
*This work is not endorsed by, or affiliated with, OpenAI.*