Refactor the C++ LLM manager into modular components, moves Python modules under python/, and keeps the current control-plane behavior intact. The C++ server now has clearer separation for config, model lifecycle, runtime services, request parsing, HTTP helpers, and server routing, while Docker build/runtime paths were updated to compile multiple C++ files and load Python code from the new package folder
332826f
Dmitry Beresnevcommited on
add new build profile
a97386f
Dmitry Beresnevcommited on
fix model config
057edf0
Dmitry Beresnevcommited on
add cpp server
fc0860f
Dmitry Beresnevcommited on
change llm model
f41621b
Dmitry Beresnevcommited on
change llm model
4f2dffc
Dmitry Beresnevcommited on
change model to Qwen2.5-Math-7B-Instruct-GGUF
cca3c7b
Dmitry Beresnevcommited on
change llm model to qwen2 math
fe7089d
Dmitry Beresnevcommited on
change llm model to mistral
97d9520
Dmitry Beresnevcommited on
fix dockerfile
c33410f
Dmitry Beresnevcommited on
change compilation flags
0e913e4
Dmitry Beresnevcommited on
change compilation flags
1a4efad
Dmitry Beresnevcommited on
reduce context and batch
34775a7
Dmitry Beresnevcommited on
fix repo name of model
dc883f9
Dmitry Beresnevcommited on
fix repo of model
c7c8563
Dmitry Beresnevcommited on
fix cmd in dockerfile
0fbce92
Dmitry Beresnevcommited on
fix dockerfile
c261631
Dmitry Beresnevcommited on
switch to qwen model via cpp server
9a590ac
Dmitry Beresnevcommited on
fix dockerfile
950f41b
Dmitry Beresnevcommited on
fix dockerfile
f64a284
Dmitry Beresnevcommited on
fix gitignore, app and logger, etc
7763bf4
Dmitry Beresnevcommited on
Force Docker rebuild for web search dependencies
9345f95
Dmitry Beresnevcommited on
fix app, dockerfile, pyproject.toml to add web search