Spaces:

ResearchEngineering
/

AGI

Running

App Files Files Community

AGI / cpp /server.cpp

Commit History

fix error format wrapping now applies to /v1/chat/completions and generation stats

470e737

Running

Dmitry Beresnev commited on about 1 hour ago

add token generation speed to ui

e8080f5

Dmitry Beresnev commited on about 1 hour ago

Log detailed error bodies for UI failures

7caa6ba

Dmitry Beresnev commited on about 7 hours ago

Fix 400 for llama.cpp web UI completion requests

677456b

Dmitry Beresnev commited on about 7 hours ago

Fix web UI chat by adding buffered SSE fallback

6379bd0

Dmitry Beresnev commited on about 7 hours ago

fix build bugs

acdc6c1

Dmitry Beresnev commited on about 10 hours ago

Refactor the C++ LLM manager into modular components, moves Python modules under python/, and keeps the current control-plane behavior intact. The C++ server now has clearer separation for config, model lifecycle, runtime services, request parsing, HTTP helpers, and server routing, while Docker build/runtime paths were updated to compile multiple C++ files and load Python code from the new package folder

332826f

Dmitry Beresnev commited on 1 day ago