Merge branch 'main' of https://huggingface.co/spaces/innovatorved/llama-server fa38229 Ved Gupta commited on Apr 26
Initial commit: llama.cpp OpenAI-compatible server for Gemma 4 E2B 82bd3da vedgupta commited on Apr 26
Keep upstream /app layout intact so dlopen finds GGML CPU backend plugin b00df39 innovatorved commited on Apr 26
Fix UID 1000 collision on ubuntu:24.04 (delete default 'ubuntu' user) 3c6e293 innovatorved commited on Apr 26
Use ubuntu:24.04 runtime to match upstream :server glibc/libstdc++ ABI b16d1e7 innovatorved commited on Apr 26
Use upstream ghcr.io/ggml-org/llama.cpp:server image (no source build) 8d5716a innovatorved commited on Apr 26
Fix build: keep LLAMA_BUILD_TOOLS=ON (server target lives under it) 563eb80 innovatorved commited on Apr 26
Slim down Docker build (server-only, no BLAS/curl, strip + symlinks) c8b257f innovatorved commited on Apr 26