Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
prism-ml
/
demo
Running on L40S

App Files Files
Fetching metadata from the HF Docker repository...
demo
87 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
pashak's picture
pashak
Update CUDA binaries with mmq fix (b8190 -> b8191)
10697bd 21 days ago
  • bin
    Update CUDA binaries with mmq fix (b8190 -> b8191) 21 days ago
  • img
    Docker Space: llama.cpp CUDA inference with multi-GPU load balancing 25 days ago
  • .gitattributes
    1.7 kB
    Docker Space: llama.cpp CUDA inference with multi-GPU load balancing 25 days ago
  • Dockerfile
    642 Bytes
    Docker Space: llama.cpp CUDA inference with multi-GPU load balancing 25 days ago
  • README.md
    1.32 kB
    Docker Space: llama.cpp CUDA inference with multi-GPU load balancing 25 days ago
  • auth_service.py
    8.69 kB
    Docker Space: llama.cpp CUDA inference with multi-GPU load balancing 25 days ago
  • entrypoint.sh
    3.6 kB
    Docker Space: llama.cpp CUDA inference with multi-GPU load balancing 25 days ago
  • nginx.conf
    2.7 kB
    Docker Space: llama.cpp CUDA inference with multi-GPU load balancing 25 days ago