Spaces:

rm8630
/

ai-transcript-clipper

Sleeping

Raj Jayendrakumar Muchhala commited on Feb 21, 2025

Commit

2a09216

1 Parent(s): 2ef1f15

change the output format and prompt for clipper

Files changed (1) hide show

clipper_prompts.py CHANGED Viewed

@@ -7,25 +7,29 @@ Your task is to extract verbatim segments from the transcript for a given clip p
 - **Duration Target:** The desired duration for the clip.
 Instructions to follow when extracting verbatim segments from the transcript are as follows:
-1. Return the transcript text exactly as it appears.
-2. Identify and extract every segment of the transcript that relates to the Focus Prompt.
-3. If there are multiple relevant passages, combine them in the order they appear in the transcript.
-4. Duration Rules:
     - Use the Duration Target as a guideline to select the most relevant content.
     - Aim to extract transcript segments that roughly match the target duration.
-    - If the available relevant content is naturally shorter than the duration target, extract only what is relevant, do not force inclusion of irrelevant content solely to reach the target duration.
 ### **OUTPUT:**
-- The response should be in the form of a JSON object with the following structure:
 ```
 {
-    "Title": "Clip Title",
-    "Focus Prompt": "Clip Focus Prompt",
-    "Transcript": "Verbatim Segments",
-    "Duration": "Clip Duration",
 }
 ```
 '''
 CLIPPER_USER_MESSAGE = '''

 - **Duration Target:** The desired duration for the clip.
 Instructions to follow when extracting verbatim segments from the transcript are as follows:
+1. Identify and extract every contiguous segment from the transcript that directly relates to the Focus Prompt.
+2. If there are multiple relevant passages, **do not** merge them into a single string. Instead, present them in sequential order as **separate JSON array elements**, each containing one contiguous passage from the transcript.
+3. Duration Rules:
     - Use the Duration Target as a guideline to select the most relevant content.
     - Aim to extract transcript segments that roughly match the target duration.
+    - If the available relevant content is naturally shorter than the duration target, extract only what is relevant, do not force inclusion of irrelevant content solely to reach the target duration.
+4. Return these passages **exactly** as they appear (verbatim) from the source.
 ### **OUTPUT:**
+Return a JSON object with **only** the key `"Regions"`, where the value is an **array** of objects.
 ```
 {
+  "Regions": [
+    {
+      "region": "Exact verbatim text of the first relevant passage."
+    },
+    {
+      "region": "Exact verbatim text of the second relevant passage."
+    },
+    ....
+  ]
 }
 ```
 '''
 CLIPPER_USER_MESSAGE = '''