Spaces:
Sleeping
Sleeping
Enhance text generation parameters by adding eos_token_id, pad_token_id, repetition_penalty, and no_repeat_ngram_size to improve output quality and prevent repetition in chatbot application.
Browse files
app.py
CHANGED
|
@@ -24,7 +24,11 @@ text_generator = pipeline(
|
|
| 24 |
device=device,
|
| 25 |
max_length=520,
|
| 26 |
do_sample=True,
|
| 27 |
-
temperature=0.7
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
)
|
| 29 |
|
| 30 |
model.eval()
|
|
|
|
| 24 |
device=device,
|
| 25 |
max_length=520,
|
| 26 |
do_sample=True,
|
| 27 |
+
temperature=0.7,
|
| 28 |
+
eos_token_id=tokenizer.eos_token_id, # Dừng khi gặp EOS token
|
| 29 |
+
pad_token_id=tokenizer.pad_token_id, # Sử dụng pad token
|
| 30 |
+
repetition_penalty=1.2, # Tránh lặp vô hạn (penalty > 1.0)
|
| 31 |
+
no_repeat_ngram_size=3, # Tránh lặp n-gram 3 từ
|
| 32 |
)
|
| 33 |
|
| 34 |
model.eval()
|