added info about new archive formats & prompts
Browse files
README.md
CHANGED
|
@@ -7,21 +7,35 @@ On 2024-11-26, temporary access to OpenAI's video generation model Sora (turbo)
|
|
| 7 |
https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora
|
| 8 |
|
| 9 |
After a few hours, OpenAI invalidated the API key used by the repo and removed access to the generated videos.
|
| 10 |
-
This release consists of 87 videos (~702 MB) and 83 corresponding prompts that were archived, from the publicly displayed generations, in anticipation of that event.
|
| 11 |
-
|
| 12 |
Not all videos generated were able to be archived, due to HF load issues. All videos are of MIME type video/mp4 and have a framerate of 30 FPS.
|
| 13 |
-
The generation
|
| 14 |
|
| 15 |
The Sora backend that was used for generation was the following:
|
| 16 |
`https://sora.openai.com/backend/video_gen`
|
| 17 |
|
| 18 |
-
|
| 19 |
-
|
| 20 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
A "long" prompt limit was based off the filename length limit (around 255 B) for Windows/macOS/Linux.
|
| 22 |
All short prompts are used as filenames in the "short_prompts" directory.
|
| 23 |
The ten longer prompts in "full_long_prompts.txt" were used for the videos in the "long_prompts" directory.
|
| 24 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
~ desuAnon
|
| 26 |
|
| 27 |
https://rentry.org/desuAnon
|
|
@@ -39,10 +53,6 @@ https://rentry.org/desuAnon
|
|
| 39 |
|
| 40 |
[PUBLIC DOMAIN: CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/)
|
| 41 |
|
| 42 |
-

|
| 43 |
-
|
| 44 |
-
---
|
| 45 |
-
|
| 46 |
*This public release of content produced by generative ML is intended for educational, artistic, and research purposes.*
|
| 47 |
*Sora is a pending trademark of OpenAI, Inc, and is used for descriptive purposes only.*
|
| 48 |
*The original videos were watermarked by OpenAI to reflect the origin of the generated content.*
|
|
|
|
| 7 |
https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora
|
| 8 |
|
| 9 |
After a few hours, OpenAI invalidated the API key used by the repo and removed access to the generated videos.
|
| 10 |
+
This release consists of 87 videos (~702 MB) and 83 corresponding prompts that were archived, from the publicly displayed generations, in anticipation of that event.
|
|
|
|
| 11 |
Not all videos generated were able to be archived, due to HF load issues. All videos are of MIME type video/mp4 and have a framerate of 30 FPS.
|
| 12 |
+
The generation parameters may be found in the app.py of the original repo [here](https://huggingface.co/spaces/PR-Puppets/PR-Puppet-Sora/blob/main/app.py); an archive of this file may be found [here](https://archive.is/r70Ao).
|
| 13 |
|
| 14 |
The Sora backend that was used for generation was the following:
|
| 15 |
`https://sora.openai.com/backend/video_gen`
|
| 16 |
|
| 17 |
+
Please note that user prompts are often "augmented" (changed by some LLM) before generating videos, so the prompts listed may not be the exact one used by the model.
|
| 18 |
+
The prompt used for four videos are not known, and these are denoted as [unknown_n].
|
| 19 |
+
|
| 20 |
+
---
|
| 21 |
+
### Archive versions
|
| 22 |
+
|
| 23 |
+
|
| 24 |
+
**sora-turbo-vids.zip**
|
| 25 |
+
This was the original upload, and had some encoding/compatibility issues for some users.
|
| 26 |
+
The "short" video filenames are the full original prompts used for the API request for each video.
|
| 27 |
A "long" prompt limit was based off the filename length limit (around 255 B) for Windows/macOS/Linux.
|
| 28 |
All short prompts are used as filenames in the "short_prompts" directory.
|
| 29 |
The ten longer prompts in "full_long_prompts.txt" were used for the videos in the "long_prompts" directory.
|
| 30 |
|
| 31 |
+
|
| 32 |
+
**videos_only.zip** and **videos_only.7z**
|
| 33 |
+
These identical archives (in different compression formats) contain only the original videos, with names such as `video_24.mp4`.
|
| 34 |
+
The `video_24` part is the video ID, and the prompt used for a specific video ID is listed in the separate CSV and JSONL files (video_id, prompt).
|
| 35 |
+
You should be able to easily view both those files in a text editor, and they are easy to import and process in various programming languages.
|
| 36 |
+
|
| 37 |
+
---
|
| 38 |
+
|
| 39 |
~ desuAnon
|
| 40 |
|
| 41 |
https://rentry.org/desuAnon
|
|
|
|
| 53 |
|
| 54 |
[PUBLIC DOMAIN: CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/)
|
| 55 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 56 |
*This public release of content produced by generative ML is intended for educational, artistic, and research purposes.*
|
| 57 |
*Sora is a pending trademark of OpenAI, Inc, and is used for descriptive purposes only.*
|
| 58 |
*The original videos were watermarked by OpenAI to reflect the origin of the generated content.*
|