Limit of tokens

by francostan - opened Feb 6, 2025

Hi everyone, im facing this problems that all the ai response generated through generate() are limited on 256 tokens:

Prompt: 707 tokens, 180.446 tokens-per-sec
Generation: 256 tokens, 20.440 tokens-per-sec
Peak memory: 4.440 GB
Respuesta generada:
...

Anyone know how to change this limit, should be on load() ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment