Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
text-generation-inference
Instructions to use ninagroot/Llama-360M-RUN1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ninagroot/Llama-360M-RUN1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="ninagroot/Llama-360M-RUN1")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("ninagroot/Llama-360M-RUN1") model = AutoModelForCausalLM.from_pretrained("ninagroot/Llama-360M-RUN1") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use ninagroot/Llama-360M-RUN1 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "ninagroot/Llama-360M-RUN1" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ninagroot/Llama-360M-RUN1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/ninagroot/Llama-360M-RUN1
- SGLang
How to use ninagroot/Llama-360M-RUN1 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "ninagroot/Llama-360M-RUN1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ninagroot/Llama-360M-RUN1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "ninagroot/Llama-360M-RUN1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ninagroot/Llama-360M-RUN1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use ninagroot/Llama-360M-RUN1 with Docker Model Runner:
docker model run hf.co/ninagroot/Llama-360M-RUN1
Ctrl+K
- Apr02_11-45-25_gcn21.local.snellius.surf.nl
- Apr02_13-40-50_gcn64.local.snellius.surf.nl
- Apr02_13-51-03_gcn32.local.snellius.surf.nl
- Apr02_14-53-04_gcn41.local.snellius.surf.nl
- Apr02_14-56-41_gcn41.local.snellius.surf.nl
- Apr02_15-25-19_gcn25.local.snellius.surf.nl
- Apr02_15-38-36_gcn31.local.snellius.surf.nl
- Apr02_16-38-15_gcn36.local.snellius.surf.nl
- Apr04_11-16-01_gcn68.local.snellius.surf.nl
- Apr04_11-22-32_gcn21.local.snellius.surf.nl
- Apr05_13-26-18_gcn23.local.snellius.surf.nl
- Apr05_13-43-15_gcn31.local.snellius.surf.nl
- Apr05_13-55-20_gcn33.local.snellius.surf.nl
- Apr05_14-39-02_gcn72.local.snellius.surf.nl
- Apr10_09-53-11_gcn37.local.snellius.surf.nl
- Apr10_10-03-18_gcn25.local.snellius.surf.nl
- Apr10_10-05-43_gcn25.local.snellius.surf.nl
- Apr10_10-13-45_gcn68.local.snellius.surf.nl
- Apr10_10-19-18_gcn16.local.snellius.surf.nl
- Apr10_10-27-16_gcn13.local.snellius.surf.nl
- Apr16_10-04-58_gcn61.local.snellius.surf.nl
- Apr16_10-21-50_gcn22.local.snellius.surf.nl
- Apr16_10-38-14_gcn66.local.snellius.surf.nl
- Apr16_10-43-51_gcn8.local.snellius.surf.nl
- Apr16_11-29-24_gcn42.local.snellius.surf.nl
- Apr16_11-35-47_gcn42.local.snellius.surf.nl
- Apr16_11-43-12_gcn29.local.snellius.surf.nl
- Apr16_11-53-20_gcn20.local.snellius.surf.nl
- Apr16_13-14-33_gcn71.local.snellius.surf.nl
- Apr16_13-22-53_gcn20.local.snellius.surf.nl
- Apr16_13-30-13_gcn34.local.snellius.surf.nl
- Apr16_13-36-00_gcn62.local.snellius.surf.nl
- Apr16_13-42-43_gcn62.local.snellius.surf.nl
- Apr16_13-55-22_gcn38.local.snellius.surf.nl
- Apr16_14-02-42_gcn20.local.snellius.surf.nl
- Apr16_14-32-30_gcn20.local.snellius.surf.nl
- Apr16_14-32-34_gcn55.local.snellius.surf.nl
- Apr17_10-07-02_gcn7.local.snellius.surf.nl
- Apr17_11-21-11_gcn40.local.snellius.surf.nl
- Apr17_11-29-46_gcn21.local.snellius.surf.nl
- Apr17_11-43-33_gcn46.local.snellius.surf.nl
- Apr17_12-23-14_gcn20.local.snellius.surf.nl
- Apr17_14-11-12_gcn25.local.snellius.surf.nl
- Mar15_10-30-19_gcn53.local.snellius.surf.nl
- Mar20_14-28-19_gcn59.local.snellius.surf.nl
- Mar20_14-33-09_gcn44.local.snellius.surf.nl
- Mar20_14-44-38_gcn37.local.snellius.surf.nl
- Mar20_14-47-36_gcn28.local.snellius.surf.nl
- Mar20_14-59-31_gcn51.local.snellius.surf.nl
- Mar20_15-08-12_gcn7.local.snellius.surf.nl