Instructions to use Nitral-Archive/Pasta-PrimaMaid-7b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Nitral-Archive/Pasta-PrimaMaid-7b with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Nitral-Archive/Pasta-PrimaMaid-7b")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Nitral-Archive/Pasta-PrimaMaid-7b")
model = AutoModelForCausalLM.from_pretrained("Nitral-Archive/Pasta-PrimaMaid-7b")

Inference
Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Nitral-Archive/Pasta-PrimaMaid-7b with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Nitral-Archive/Pasta-PrimaMaid-7b"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Nitral-Archive/Pasta-PrimaMaid-7b",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/Nitral-Archive/Pasta-PrimaMaid-7b

SGLang

How to use Nitral-Archive/Pasta-PrimaMaid-7b with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Nitral-Archive/Pasta-PrimaMaid-7b" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Nitral-Archive/Pasta-PrimaMaid-7b",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Nitral-Archive/Pasta-PrimaMaid-7b" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Nitral-Archive/Pasta-PrimaMaid-7b",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use Nitral-Archive/Pasta-PrimaMaid-7b with Docker Model Runner:
```
docker model run hf.co/Nitral-Archive/Pasta-PrimaMaid-7b
```

Pasta-PrimaMaid-7b

Commit History

Adding Evaluation Results (#1)

4f0dfdb
verified

Nitral

leaderboard-pr-bot commited on Mar 4, 2024

Upload 2 files

1a2a518
verified

Nitral commited on Feb 19, 2024

Update README.md

08b85c2
verified

Nitral commited on Feb 15, 2024

Update README.md

c00e0ed
verified

Nitral commited on Feb 14, 2024

Update README.md

ec23b3d
verified

Nitral commited on Feb 10, 2024

Update README.md

438b9ff
verified

Nitral commited on Feb 9, 2024

Update README.md

fa343de
verified

Nitral commited on Feb 8, 2024

Update README.md

dfe66e0
verified

Nitral commited on Feb 7, 2024

Upload folder using huggingface_hub

4302341
verified

Nitral commited on Feb 6, 2024

initial commit

7890dad
verified

Nitral commited on Feb 6, 2024

Commit History

Adding Evaluation Results (#1) 4f0dfdb verified

Upload 2 files 1a2a518 verified

Update README.md 08b85c2 verified

Update README.md c00e0ed verified

Update README.md ec23b3d verified

Update README.md 438b9ff verified

Update README.md fa343de verified

Update README.md dfe66e0 verified

Upload folder using huggingface_hub 4302341 verified

initial commit 7890dad verified