Text Generation
Transformers
PyTorch
Safetensors
English
German
Spanish
llama
language
instruct
text-generation-inference
Instructions to use FinchResearch/seal-7b-chat with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use FinchResearch/seal-7b-chat with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="FinchResearch/seal-7b-chat")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("FinchResearch/seal-7b-chat") model = AutoModelForCausalLM.from_pretrained("FinchResearch/seal-7b-chat") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use FinchResearch/seal-7b-chat with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "FinchResearch/seal-7b-chat" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FinchResearch/seal-7b-chat", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/FinchResearch/seal-7b-chat
- SGLang
How to use FinchResearch/seal-7b-chat with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "FinchResearch/seal-7b-chat" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FinchResearch/seal-7b-chat", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "FinchResearch/seal-7b-chat" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FinchResearch/seal-7b-chat", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use FinchResearch/seal-7b-chat with Docker Model Runner:
docker model run hf.co/FinchResearch/seal-7b-chat
Commit History
Delete adapter_model.bin d015548
chew commited on
Delete adapter_config.json dffed7e
chew commited on
Update README.md 16cd0ce
chew commited on
Update README.md 5e1ab48
chew commited on
Upload logo.png 43754f5
chew commited on
Update README.md 7e19284
chew commited on
Upload logo.png 0429526
chew commited on
Delete icon.jpg 64ecb56
chew commited on
Update README.md c960e8b
chew commited on
Upload icon.jpg f0c8a2e
chew commited on
Upload folder using huggingface_hub 78ae17f
chew commited on
Update README.md 9e0d363
chew commited on
Update README.md 0953aa7
chew commited on
Upload model 303ab2d
chew commited on