Spaces:
Sleeping
Sleeping
Update app.py
Browse files
app.py
CHANGED
|
@@ -79,31 +79,27 @@ class YouTubeDownloader:
|
|
| 79 |
- Visual effects, text overlays, or graphics
|
| 80 |
- Mood, tone, and atmosphere
|
| 81 |
- Camera movements or angles (if apparent)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 82 |
|
| 83 |
-
|
| 84 |
- For videos under 1 minute: 2-3 second segments
|
| 85 |
- For videos 1-5 minutes: 3-5 second segments
|
| 86 |
- For videos 5-15 minutes: 5-10 second segments
|
| 87 |
- For videos over 15 minutes: 10-15 second segments
|
| 88 |
- Maximum 20 scenes total for longer videos
|
| 89 |
|
| 90 |
-
|
| 91 |
**[MM:SS-MM:SS]**: Detailed description including who is visible, what they're wearing, what they're doing, what they're saying (if applicable), setting details, objects shown, and any visual elements.
|
| 92 |
|
| 93 |
-
|
| 94 |
-
- Character descriptions (appearance, clothing, expressions)
|
| 95 |
-
- Actions and movements
|
| 96 |
-
- Objects, products, or props being displayed
|
| 97 |
-
- Setting and background details
|
| 98 |
-
- Any text, graphics, or overlays
|
| 99 |
-
- Transitions between scenes
|
| 100 |
-
|
| 101 |
5. Write descriptions as if you're watching the video in real-time, noting everything visible and audible.
|
| 102 |
|
| 103 |
-
|
| 104 |
-
- Include short direct speech/dialogue wherever possible.
|
| 105 |
-
- If no exact lines are known, intelligently infer short probable phrases (e.g., "Let's get started!", "Here's how you do it.", etc.)
|
| 106 |
-
|
| 107 |
Based on the title and description, intelligently infer what would likely happen in each time segment. Consider the video type and create contextually appropriate, detailed descriptions.
|
| 108 |
"""
|
| 109 |
|
|
|
|
| 79 |
- Visual effects, text overlays, or graphics
|
| 80 |
- Mood, tone, and atmosphere
|
| 81 |
- Camera movements or angles (if apparent)
|
| 82 |
+
|
| 83 |
+
2. Dialogue Emphasis:
|
| 84 |
+
- Include short dialogue lines in **every scene** wherever plausible.
|
| 85 |
+
- Write lines like: Character: "Actual or inferred line..."
|
| 86 |
+
- If dialogue is not available, intelligently infer probable phrases (e.g., "Welcome!", "Try this now!", "It feels amazing!").
|
| 87 |
+
- Do NOT skip dialogue unless it’s clearly impossible.
|
| 88 |
|
| 89 |
+
3. Timestamp Guidelines:
|
| 90 |
- For videos under 1 minute: 2-3 second segments
|
| 91 |
- For videos 1-5 minutes: 3-5 second segments
|
| 92 |
- For videos 5-15 minutes: 5-10 second segments
|
| 93 |
- For videos over 15 minutes: 10-15 second segments
|
| 94 |
- Maximum 20 scenes total for longer videos
|
| 95 |
|
| 96 |
+
4. Format each scene EXACTLY like this:
|
| 97 |
**[MM:SS-MM:SS]**: Detailed description including who is visible, what they're wearing, what they're doing, what they're saying (if applicable), setting details, objects shown, and any visual elements.
|
| 98 |
|
| 99 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 100 |
5. Write descriptions as if you're watching the video in real-time, noting everything visible and audible.
|
| 101 |
|
| 102 |
+
|
|
|
|
|
|
|
|
|
|
| 103 |
Based on the title and description, intelligently infer what would likely happen in each time segment. Consider the video type and create contextually appropriate, detailed descriptions.
|
| 104 |
"""
|
| 105 |
|