llama-cpp-python
llama-cpp
wheel
windows
cuda-12
blackwell
sm_100
sm_90
sm_89
sm_86
sm_80
sm_75
sm_72
sm_70
sm_62
sm_61
cp312
Instructions to use trajis-tech/llama-cpp-python-trajis-tech-nonavx512-cuda with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use trajis-tech/llama-cpp-python-trajis-tech-nonavx512-cuda with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="trajis-tech/llama-cpp-python-trajis-tech-nonavx512-cuda", filename="{{GGUF_FILE}}", )output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
Upload llama_cpp_python-0.3.16-cp312-cp312-win_amd64.whl
Browse files
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
llama_cpp_python-0.3.16-cp312-cp312-win_amd64.whl filter=lfs diff=lfs merge=lfs -text
|
llama_cpp_python-0.3.16-cp312-cp312-win_amd64.whl
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ff5605e3b812b34b879ee41969cc5d8a199dca5dd4f465825ce8aa20ebcd0f54
|
| 3 |
+
size 479191838
|