PiotrWarzachowski commited on
Commit
9991c01
verified
1 Parent(s): ef317f2

Add model card

Browse files
Files changed (1) hide show
  1. README.md +54 -0
README.md ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - pl
4
+ license: apache-2.0
5
+ tags:
6
+ - summarization
7
+ - polish
8
+ - flan-t5
9
+ - text2text-generation
10
+ datasets:
11
+ - allegro/summarization-polish-summaries-corpus
12
+ pipeline_tag: summarization
13
+ ---
14
+
15
+ # Polish Text Summarizer
16
+
17
+ FLAN-T5-base fine-tuned for Polish text summarization.
18
+
19
+ ## Model Details
20
+
21
+ - **Base model:** google/flan-t5-base (248M parameters)
22
+ - **Task:** Text summarization
23
+ - **Language:** Polish
24
+ - **Dataset:** allegro/summarization-polish-summaries-corpus
25
+
26
+ ## Usage
27
+
28
+ ```python
29
+ from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
30
+
31
+ tokenizer = AutoTokenizer.from_pretrained("PiotrWarzachowski/polish-text-summarizer")
32
+ model = AutoModelForSeq2SeqLM.from_pretrained("PiotrWarzachowski/polish-text-summarizer")
33
+
34
+ article = "Tw贸j d艂ugi artyku艂 po polsku..."
35
+
36
+ inputs = tokenizer(article, max_length=512, truncation=True, return_tensors="pt")
37
+ outputs = model.generate(**inputs, max_length=128, num_beams=4, no_repeat_ngram_size=3)
38
+ summary = tokenizer.decode(outputs[0], skip_special_tokens=True)
39
+
40
+ print(summary)
41
+ ```
42
+
43
+ ## Limitations
44
+
45
+ - Max input: 512 tokens (~2000-3000 characters)
46
+ - Max output: 128 tokens (~500 characters)
47
+ - Polish diacritics (膮, 臋, 艂, etc.) may be simplified to ASCII equivalents
48
+
49
+ ## Training
50
+
51
+ - Optimizer: Adafactor
52
+ - Batch size: 1 (with gradient accumulation 8)
53
+ - Epochs: 3
54
+ - Learning rate: 1e-4