Shaelois
/

MeetingScript

bigbird_pegasus

text2text-generation

Model card Files Files and versions

Shaelois commited on Apr 20, 2025

Commit

c88c8a1

·

verified ·

1 Parent(s): 2a38af2

Update README.md

Files changed (1) hide show

README.md +36 -0

README.md CHANGED Viewed

@@ -45,6 +45,42 @@ Evaluated on the held‑out test split of MeetingBank (≈ 600 transcripts), us
 | **ROUGE‑Lsum** |    48.0142 |
 ---
 ## Training Data
 Dataset: MeetingBank
 Splits: Train (5000+), Validation (600+), Test (600+)

 | **ROUGE‑Lsum** |    48.0142 |
 ---
+## Usage
+```python
+from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
+import torch
+# 1) Load from the Hub
+tokenizer = AutoTokenizer.from_pretrained("Shaelois/MeetingScript")
+model = AutoModelForSeq2SeqLM.from_pretrained("Shaelois/MeetingScript")
+# 2) Summarize a long transcript
+transcript = """
+    Alice: Good morning everyone, let’s get started…
+    Bob: I updated the design mockups…
+    … (thousands of words) …
+"""
+inputs = tokenizer(
+    transcript,
+    max_length=4096,
+    truncation=True,
+    return_tensors="pt"
+).to("cuda")
+summary_ids = model.generate(
+    **inputs,
+    num_beams=4,
+    max_length=150,
+    early_stopping=True
+)
+summary = tokenizer.decode(summary_ids[0], skip_special_tokens=True)
+print("📝 Summary:", summary)```
+---
 ## Training Data
 Dataset: MeetingBank
 Splits: Train (5000+), Validation (600+), Test (600+)