Spaces:
Runtime error
Runtime error
Update README.md
Browse files
README.md
CHANGED
|
@@ -15,22 +15,17 @@ tag: agent-demo-track
|
|
| 15 |
|
| 16 |
## Introduction
|
| 17 |
|
| 18 |
-
**Scriptura** is a multi-agent AI
|
| 19 |
|
| 20 |
-
|
| 21 |
-
-
|
| 22 |
-
-
|
| 23 |
-
- **MusicGen (facebook/musicgen-melody)** to create short audio tracks or sound effects, useful for prototyping or presenting.
|
| 24 |
-
- Optional web search (integrated with DuckDuckGo API) to fetch external resources (original scripts, sound effects, reference materials).
|
| 25 |
|
| 26 |
-
|
| 27 |
-
|
| 28 |
-
-
|
| 29 |
-
- **Audio**: MP3, WAV (for transcribing dialogue or analyzing uploaded soundtracks)
|
| 30 |
|
| 31 |
-
|
| 32 |
-
|
| 33 |
-
---
|
| 34 |
|
| 35 |
## Agent Capabilities
|
| 36 |
|
|
|
|
| 15 |
|
| 16 |
## Introduction
|
| 17 |
|
| 18 |
+
**Scriptura** is a multi-agent AI framework based on HF-SmolAgents that streamlines the creation of screenplays, storyboards, and soundtracks by automating the stages of analysis, summarization, and multimodal enrichment—freeing authors to focus on pure creativity.
|
| 19 |
|
| 20 |
+
At its heart:
|
| 21 |
+
• Qwen3-32B serves as the primary orchestrating agent, coordinating workflows and managing high-level reasoning across the system.
|
| 22 |
+
• Gemma-3-27B-IT acts as a specialized assistant for multimodal tasks, supporting both text and audio inputs to refine narrative elements and prepare them for downstream generation.
|
|
|
|
|
|
|
| 23 |
|
| 24 |
+
For media generation, Scriptura integrates:
|
| 25 |
+
• MusicGen models (per the AudioCraft MusicGen specification), deployed via Hugging Face Spaces, enabling the agent to produce original soundtracks and sound effects from text prompts or combined text + audio samples.
|
| 26 |
+
• FLUX (black-forest-labs/FLUX.1-dev) for on-the-fly image creation—ideal for storyboards, concept art, and visual references that seamlessly tie into the narrative flow.
|
|
|
|
| 27 |
|
| 28 |
+
Optionally, Scriptura can query external sources (e.g., via a DuckDuckGo API integration) to pull in reference scripts, sound samples, or research materials, ensuring that every draft is not only creatively rich but also contextually informed.
|
|
|
|
|
|
|
| 29 |
|
| 30 |
## Agent Capabilities
|
| 31 |
|