Text Generation
Transformers
PyTorch
Safetensors
English
llama
facebook
meta
llama-2
text-generation-inference
Instructions to use Recag/1hf with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Recag/1hf with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="Recag/1hf")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("Recag/1hf") model = AutoModelForCausalLM.from_pretrained("Recag/1hf") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use Recag/1hf with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "Recag/1hf" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Recag/1hf", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/Recag/1hf
- SGLang
How to use Recag/1hf with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "Recag/1hf" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Recag/1hf", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "Recag/1hf" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Recag/1hf", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use Recag/1hf with Docker Model Runner:
docker model run hf.co/Recag/1hf
Access Llama 2 on Hugging Face
This is a form to enable access to Llama 2 on Hugging Face after you have been granted access from Meta. Please visit the Meta website and accept our license terms and acceptable use policy before submitting this form. Requests will be processed in 1-2 days.
Your Hugging Face account email address MUST match the email you provide on the Meta website, or your request will not be approved.
Log in or Sign Up to review the conditions and access this model content.
Gated model You can list files but not access them
Preview of files found in this repository
- 1.58 kB
- 7.02 kB
- 10.4 kB
- 1.25 MB xet
- 4.77 kB
- 609 Bytes
- 167 Bytes
- 1.58 kB
- 9.85 GB xet
- 9.8 GB xet
- 9.97 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.97 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.97 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.5 GB xet
- 524 MB xet
- 66.7 kB
- 9.85 GB xet
- 9.8 GB xet
- 9.97 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.97 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.97 GB xet
- 9.8 GB xet
- 9.8 GB xet
- 9.5 GB xet
- 524 MB xet
- 66.7 kB
- 414 Bytes
- 1.84 MB
- 500 kB xet
- 776 Bytes