Add think flag: optionally stream reasoning wrapped in <think> f1b8cae Running verified polats commited on 3 days ago
Reasoning model: think in discarded block, stream only clean response code 8fce99a verified polats commited on 3 days ago
Fix: splat chat-template dict into generate (BatchEncoding has no .shape) 8f651ac verified polats commited on 3 days ago