Instructions to use leon-se/Aria-sequential_mlp-FP8-dynamic with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use leon-se/Aria-sequential_mlp-FP8-dynamic with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="leon-se/Aria-sequential_mlp-FP8-dynamic", trust_remote_code=True)
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM

processor = AutoProcessor.from_pretrained("leon-se/Aria-sequential_mlp-FP8-dynamic", trust_remote_code=True)
model = AutoModelForMultimodalLM.from_pretrained("leon-se/Aria-sequential_mlp-FP8-dynamic", trust_remote_code=True)
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use leon-se/Aria-sequential_mlp-FP8-dynamic with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "leon-se/Aria-sequential_mlp-FP8-dynamic"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "leon-se/Aria-sequential_mlp-FP8-dynamic",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker

docker model run hf.co/leon-se/Aria-sequential_mlp-FP8-dynamic

SGLang

How to use leon-se/Aria-sequential_mlp-FP8-dynamic with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "leon-se/Aria-sequential_mlp-FP8-dynamic" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "leon-se/Aria-sequential_mlp-FP8-dynamic",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "leon-se/Aria-sequential_mlp-FP8-dynamic" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "leon-se/Aria-sequential_mlp-FP8-dynamic",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Docker Model Runner
How to use leon-se/Aria-sequential_mlp-FP8-dynamic with Docker Model Runner:
```
docker model run hf.co/leon-se/Aria-sequential_mlp-FP8-dynamic
```

Aria-sequential_mlp-FP8-dynamic

Commit History

Update README.md

35ea031
verified

leon-se commited on Dec 29, 2024

Update README.md

5c82453
verified

Leon commited on Oct 23, 2024

Update README.md

e4dcfa0
verified

Leon commited on Oct 23, 2024

Update README.md

50f77df
verified

Leon commited on Oct 23, 2024

Update README.md

ef43790
verified

Leon commited on Oct 23, 2024

Update README.md

88aa751
verified

Leon commited on Oct 22, 2024

Update README.md

ff24ae3
verified

Leon commited on Oct 22, 2024

Update README.md

23cc088
verified

Leon commited on Oct 20, 2024

Update README.md

e018348
verified

Leon commited on Oct 20, 2024

Update README.md

2cf9ce2
verified

Leon commited on Oct 20, 2024

Update README.md

60fe460
verified

Leon commited on Oct 20, 2024

Update README.md

7652df9
verified

Leon commited on Oct 20, 2024

Update README.md

ee960b9
verified

Leon commited on Oct 20, 2024

Create README.md

579a814
verified

Leon commited on Oct 20, 2024

Upload folder using huggingface_hub

e79c572
verified

Leon commited on Oct 20, 2024

initial commit

de678c4
verified

Leon commited on Oct 20, 2024

Commit History

Update README.md 35ea031 verified

Update README.md 5c82453 verified

Update README.md e4dcfa0 verified

Update README.md 50f77df verified

Update README.md ef43790 verified

Update README.md 88aa751 verified

Update README.md ff24ae3 verified

Update README.md 23cc088 verified

Update README.md e018348 verified

Update README.md 2cf9ce2 verified

Update README.md 60fe460 verified

Update README.md 7652df9 verified

Update README.md ee960b9 verified

Create README.md 579a814 verified

Upload folder using huggingface_hub e79c572 verified

initial commit de678c4 verified

Update README.md

35ea031
verified

Update README.md

5c82453
verified

Update README.md

e4dcfa0
verified

Update README.md

50f77df
verified

Update README.md

ef43790
verified

Update README.md

88aa751
verified

Update README.md

ff24ae3
verified

Update README.md

23cc088
verified

Update README.md

e018348
verified

Update README.md

2cf9ce2
verified

Update README.md

60fe460
verified

Update README.md

7652df9
verified

Update README.md

ee960b9
verified

Create README.md

579a814
verified

Upload folder using huggingface_hub

e79c572
verified

initial commit

de678c4
verified