Commit History

Fix history unpacking for Gradio chat interface
9df24f1
Running
verified

Jodaro commited on

Reduce max_new_tokens to 64 for faster replies
e6b8d52
verified

Jodaro commited on

Switch to TinyLlama 1.1B Chat Q4_K_M
398f222
verified

Jodaro commited on

Switch to Llama 3.2 3B Instruct Q4_K_M
863eb49
verified

Jodaro commited on

Set ctransformers model_type to qwen
5db6945
verified

Jodaro commited on

Switch to Qwen2.5-3B-Instruct Q4_K_M (GGUF)
d966321
verified

Jodaro commited on

Fix gradio launch and mistral prompt formatting
807809c
verified

Jodaro commited on

Switch to Mistral 7B GGUF
de96a1d
verified

Jodaro commited on

Use ctransformers qwen model
0e9e41e
verified

Jodaro commited on

Use ctransformers for Qwen
e9ddae9
verified

Jodaro commited on

Switch to Qwen3-4B
4323878
verified

Jodaro commited on

Switch to llama_cpp
689f1fc
verified

Jodaro commited on

Fix model loading (remove hf_model, set model_type)
192caec
verified

Jodaro commited on

Create app.py with ctransformers
cbf8005
verified

Jodaro commited on