Spaces:

innovatorved
/

api

Sleeping

Ved Gupta commited on Feb 1, 2024

Commit

c1aa71b

1 Parent(s): be43a34

Update Dockerfile and README.md

Files changed (2) hide show

Dockerfile CHANGED Viewed

@@ -3,10 +3,8 @@ FROM python:3.9-alpine
 RUN apk add --no-cache build-base cmake git wget gcc g++ make
 RUN pip install llama-cpp-python sse_starlette starlette_context pydantic_settings fastapi uvicorn
 RUN mkdir models
 RUN wget -q "https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GGUF/resolve/main/mistral-7b-instruct-v0.2.Q4_0.gguf" -O models/mistral-7b-instruct-v0.2.Q4_0.gguf
 EXPOSE 8080
-CMD ["python", "-m", "llama_cpp.server", "--model", "models/mistral-7b-instruct-v0.2.Q4_0.gguf"]

 RUN apk add --no-cache build-base cmake git wget gcc g++ make
 RUN pip install llama-cpp-python sse_starlette starlette_context pydantic_settings fastapi uvicorn
 RUN mkdir models
 RUN wget -q "https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GGUF/resolve/main/mistral-7b-instruct-v0.2.Q4_0.gguf" -O models/mistral-7b-instruct-v0.2.Q4_0.gguf
 EXPOSE 8080
+CMD ["python", "-m", "llama_cpp.server", "--model", "models/mistral-7b-instruct-v0.2.Q4_0.gguf", "--host", "0.0.0.0", "--port", "8080"]

README.md CHANGED Viewed

@@ -10,9 +10,6 @@ app_port: 8080
 ```bash
-CMAKE_ARGS="-DLLAMA_CUBLAS=on" FORCE_CMAKE=1 pip install llama-cpp-python
-wget -q "https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GGUF/resolve/main/mistral-7b-instruct-v0.2.Q4_0.gguf" -O models/mistral-7b-instruct-v0.2.Q4_0.gguf
 curl https://innovatorved-api.hf.space/v1/models


10
11
12	```bash



13
14	curl https://innovatorved-api.hf.space/v1/models
15