openwolf-text / app.py

Commit History

fix: upgrade to Q6_K (600MB) better quality
82ad52b
verified

hugh007 commited on

feat: switch to MiniCPM-V-4.6-Thinking GGUF
0745c6f
verified

hugh007 commited on

fix: use hf_hub_download at runtime like OpenWolf-Agent
0677a3d
verified

hugh007 commited on

fix: pin llama-cpp-python==0.3.23, disable mmap
0bf1c9b
verified

hugh007 commited on

fix: use create_completion instead of create_chat_completion, reduce ctx
1d5ac04
verified

hugh007 commited on

fix: switch to MiniCPM3-4B (official openbmb GGUF)
ada47e2
verified

hugh007 commited on

fix: switch to Qwen2.5-1.5B for better llama.cpp compatibility
c558c22
verified

hugh007 commited on

fix: use pre-compiled llama-cpp-python wheel + model in image
ac7a783
verified

hugh007 commited on

fix: bake GGUF model into Docker image during build
43fe9e7
verified

hugh007 commited on

fix: use pre-compiled llama-server binary (zero compilation)
9d01ea4
verified

hugh007 commited on

fix: use ninja-build + CMAKE_ARGS for llama-cpp-python build
8067761
verified

hugh007 commited on

init: MiniCPM-2B GGUF text Space
37de7ca

Hugh commited on