Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
prism-ml
/
demo
like
8
Running
on
L40S
App
Files
Files
Fetching metadata from the HF Docker repository...
main
demo
87 MB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
pashak
Update CUDA binaries with mmq fix (b8190 -> b8191)
10697bd
21 days ago
bin
Update CUDA binaries with mmq fix (b8190 -> b8191)
21 days ago
img
Docker Space: llama.cpp CUDA inference with multi-GPU load balancing
25 days ago
.gitattributes
Safe
1.7 kB
Docker Space: llama.cpp CUDA inference with multi-GPU load balancing
25 days ago
Dockerfile
Safe
642 Bytes
Docker Space: llama.cpp CUDA inference with multi-GPU load balancing
25 days ago
README.md
Safe
1.32 kB
Docker Space: llama.cpp CUDA inference with multi-GPU load balancing
25 days ago
auth_service.py
Safe
8.69 kB
Docker Space: llama.cpp CUDA inference with multi-GPU load balancing
25 days ago
entrypoint.sh
Safe
3.6 kB
Docker Space: llama.cpp CUDA inference with multi-GPU load balancing
25 days ago
nginx.conf
Safe
2.7 kB
Docker Space: llama.cpp CUDA inference with multi-GPU load balancing
25 days ago