Fix handler - remove no_repeat_ngram_size parameter that may not be supported by inference endpoints d15fa5d verified 0chanly commited on 11 days ago
Update generation parameters to match local chatbot (max_tokens=180, repetition_penalty=1.2, top_p=0.9, top_k=50) 370c4cf verified 0chanly commited on 12 days ago