Dc-4nderson
/

transcript_summarizer_model

transcript-chunking

text-segmentation

topic-detection

Model card Files Files and versions

Dc-4nderson commited on Nov 10, 2025

Commit

83a1a3f

·

verified ·

1 Parent(s): 34e1b2b

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ metrics:
 # 🧠 Mistral LoRA Transcript Chunking Model
 ## Model Overview
-This LoRA adapter was trained on a custom dataset of **1,000 English transcript examples** to teach a **Mistral-7B-v0.2** model how to segment long transcripts into topic-based chunks using `--` as delimiters.
 It enables automated **topic boundary detection** in conversation, meeting, and podcast transcripts — ideal for preprocessing before summarization, classification, or retrieval.
 ---
@@ -78,7 +78,7 @@ model = AutoModelForCausalLM.from_pretrained(base)
 model = PeftModel.from_pretrained(model, adapter)
 text = (
-    "Break this transcript wherever a new topic begins. Use -- as a delimiter.\n"
     "Transcript: Let's start with last week's performance metrics. "
     "Next, we’ll review upcoming campaign deadlines."
 )

 # 🧠 Mistral LoRA Transcript Chunking Model
 ## Model Overview
+This LoRA adapter was trained on a custom dataset of **1,000 English transcript examples** to teach a **Mistral-7B-v0.2** model how to segment long transcripts into topic-based chunks using 'section #:' as delimiters.
 It enables automated **topic boundary detection** in conversation, meeting, and podcast transcripts — ideal for preprocessing before summarization, classification, or retrieval.
 ---
 model = PeftModel.from_pretrained(model, adapter)
 text = (
+    "Break this transcript wherever a new topic begins. Use 'section #:' as a delimiter.\n"
     "Transcript: Let's start with last week's performance metrics. "
     "Next, we’ll review upcoming campaign deadlines."
 )