Spaces:

samzito12
/

iris

Sleeping

samzito12 commited on Dec 3, 2025

Commit

a8e01ad

1 Parent(s): 5558821

try to improve the inference

Files changed (1) hide show

app.py CHANGED Viewed

@@ -46,7 +46,7 @@ def chat(message, history):
     with torch.no_grad():
         outputs = model.generate(
             **inputs,
-            max_new_tokens=128,
             temperature=0.7,
             do_sample=True,
             use_cache=True,

     with torch.no_grad():
         outputs = model.generate(
             **inputs,
+            max_new_tokens=256,
             temperature=0.7,
             do_sample=True,
             use_cache=True,