James040
/

llama-cpp-python-wheels

llama-cpp-python

prebuilt-wheels

huggingface-spaces

Model card Files Files and versions

James040 commited on Mar 10

Commit

ff952d6

·

verified ·

1 Parent(s): 14da0d0

Update README.md

Files changed (1) hide show

README.md +28 -11

README.md CHANGED Viewed

@@ -8,18 +8,35 @@ tags:
 - python-3.13
 ---
-# Llama-CPP-Python Pre-built Wheels (CPU Only)
-This repository provides pre-compiled Python wheels for `llama-cpp-python`, specifically optimized for **Hugging Face Spaces (Free CPU Tier)**.
-## 🚀 Key Features
-- **Zero Compilation:** Skips the 15-minute C++ build process on HF Spaces.
-- **Python 3.13 Support:** Built for the latest Python version.
-- **Generic CPU:** Compiled with `GGML_NATIVE=OFF` to ensure compatibility with older cloud processors (no "Illegal Instruction" errors).
-## 🛠 Usage in HF Spaces
-### Dockerfile (Recommended)
-Add this to your Dockerfile to install the wheel instantly:
-```dockerfile
-RUN pip install [https://huggingface.co/Jameson040/llama-cpp-python-wheels/resolve/main/llama_cpp_python-0.3.16-cp313-cp313-linux_x86_64.whl](https://huggingface.co/Jameson040/llama-cpp-python-wheels/resolve/main/llama_cpp_python-0.3.16-cp313-cp313-linux_x86_64.whl)

 - python-3.13
 ---
+# 🦙 Llama-CPP-Python Pre-built Wheels (Python 3.13)
+### The solution for Hugging Face "Build Timeout" errors on the Free CPU Tier.
+If you are using **Python 3.13** on a Hugging Face Free Space, compiling `llama-cpp-python` from source usually crashes or times out. This repository provides pre-compiled **manylinux** wheels that install in seconds.
+---
+## 🚀 Why use these wheels?
+* **No Compilation:** Skips the 15+ minute build process.
+* **Python 3.13 Support:** Specifically built for the latest Python version.
+* **Generic CPU Optimization:** Compiled with `GGML_NATIVE=OFF`. This ensures the model runs on HF's shared CPUs without "Illegal Instruction" or "Core Dump" errors.
+* **Lightweight:** Only ~4.3 MB compared to the massive overhead of building from source.
+---
+## 🛠️ How to use in your HF Space
+### Option A: Using `requirements.txt`
+Simply paste this direct link into your `requirements.txt` file:
+```text
+[https://huggingface.co/James040/llama-cpp-python-wheels/resolve/main/llama_cpp_python-0.3.16-cp313-cp313-linux_x86_64.whl](https://huggingface.co/James040/llama-cpp-python-wheels/resolve/main/llama_cpp_python-0.3.16-cp313-cp313-linux_x86_64.whl)
+Option B: Using a Dockerfile
+If you are using a custom Docker setup, add this line:
+RUN pip install [https://huggingface.co/James040/llama-cpp-python-wheels/resolve/main/llama_cpp_python-0.3.16-cp313-cp313-linux_x86_64.whl](https://huggingface.co/James040/llama-cpp-python-wheels/resolve/main/llama_cpp_python-0.3.16-cp313-cp313-linux_x86_64.whl)
+📦 Build SpecificationsThese wheels were built using a high-performance automated pipeline on GitHub.SpecificationValuePython Version3.13PlatformLinux x86_64 (Manylinux)Build FlagsGGML_NATIVE=OFF, GGML_BLAS=OFFBuild SourceJameson040/my_lama-wheels