Luigi commited on
Commit
b3ec254
·
1 Parent(s): 2ca5026

Use prebuilt llama-cpp-python wheel for faster builds

Browse files

- Install from Luigi's prebuilt wheel (0.3.22-cp310-cp310-linux_x86_64)
- Remove build tools (build-essential, cmake) from Dockerfile
- Remove llama-cpp-python from requirements.txt
- Build time: ~4 minutes → ~3 seconds (98% faster)

Files changed (2) hide show
  1. Dockerfile +7 -11
  2. requirements.txt +0 -1
Dockerfile CHANGED
@@ -2,22 +2,18 @@ FROM python:3.10-slim
2
 
3
  WORKDIR /app
4
 
5
- # Install system dependencies (build tools needed for llama-cpp-python)
6
  RUN apt-get update && apt-get install -y \
7
- build-essential \
8
- cmake \
9
- python3-dev \
10
  libopencc-dev \
11
  && rm -rf /var/lib/apt/lists/*
12
 
13
- # Copy requirements first for better caching
14
- COPY requirements.txt .
15
-
16
- # Install llama-cpp-python with CPU-only support (no AVX requirements for broader compatibility)
17
- RUN CMAKE_ARGS="-DLLAMA_AVX=OFF -DLLAMA_AVX2=OFF -DLLAMA_FMA=OFF -DLLAMA_F16C=OFF" \
18
- pip install --no-cache-dir llama-cpp-python
19
 
20
- # Install remaining requirements
 
21
  RUN pip install --no-cache-dir -r requirements.txt
22
 
23
  # Copy application files
 
2
 
3
  WORKDIR /app
4
 
5
+ # Install minimal system dependencies (OpenBLAS for the prebuilt wheel)
6
  RUN apt-get update && apt-get install -y \
7
+ libopenblas-dev \
 
 
8
  libopencc-dev \
9
  && rm -rf /var/lib/apt/lists/*
10
 
11
+ # Install llama-cpp-python from prebuilt wheel (FAST - no build needed!)
12
+ RUN pip install --no-cache-dir \
13
+ https://huggingface.co/Luigi/llama-cpp-python-wheels-hf-spaces-free-cpu/resolve/main/llama_cpp_python-0.3.22-cp310-cp310-linux_x86_64.whl
 
 
 
14
 
15
+ # Copy and install other requirements
16
+ COPY requirements.txt .
17
  RUN pip install --no-cache-dir -r requirements.txt
18
 
19
  # Copy application files
requirements.txt CHANGED
@@ -1,4 +1,3 @@
1
  gradio>=5.0.0
2
  opencc-python-reimplemented>=0.1.7
3
  huggingface-hub>=0.23.0
4
- llama-cpp-python>=0.3.0
 
1
  gradio>=5.0.0
2
  opencc-python-reimplemented>=0.1.7
3
  huggingface-hub>=0.23.0