feat: minimal FastAPI app for Llama via HF Inference Endpoint; Dockerfile + requirements 02a6500 harismlnaslm commited on Oct 30, 2025