Shaelois commited on
Commit
2eedb8f
·
verified ·
1 Parent(s): 61e7dec

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -28,7 +28,7 @@ pipeline_tag: summarization
28
  **MeetingScript** is a sequence‑to‑sequence model based on
29
  [google/bigbird-pegasus-large-bigpatent](https://huggingface.co/google/bigbird-pegasus-large-bigpatent)
30
  and fine‑tuned on the [MeetingBank](https://huggingface.co/datasets/huuuyeah/meetingbank) corpus of meeting transcripts paired with human‐written summaries.
31
- It is designed to take long meeting transcripts (up to 4 096 tokens) and produce concise, coherent summaries.
32
 
33
  ---
34
 
@@ -46,6 +46,6 @@ Evaluated on the held‑out test split of MeetingBank (≈ 600 transcripts), us
46
  ---
47
  ## Training Data
48
  Dataset: MeetingBank
49
- Splits: Train (~5 000), Validation (~600), Test (~600)
50
- Preprocessing: Sliding‑window chunking for sequences > 4 096 tokens
51
 
 
28
  **MeetingScript** is a sequence‑to‑sequence model based on
29
  [google/bigbird-pegasus-large-bigpatent](https://huggingface.co/google/bigbird-pegasus-large-bigpatent)
30
  and fine‑tuned on the [MeetingBank](https://huggingface.co/datasets/huuuyeah/meetingbank) corpus of meeting transcripts paired with human‐written summaries.
31
+ It is designed to take long meeting transcripts (up to 4096 tokens) and produce concise, coherent summaries.
32
 
33
  ---
34
 
 
46
  ---
47
  ## Training Data
48
  Dataset: MeetingBank
49
+ Splits: Train (~5000), Validation (~600), Test (~600)
50
+ Preprocessing: Sliding‑window chunking for sequences > 4096 tokens
51