Instructions to use Pharos-ai/Meridian with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Pharos-ai/Meridian with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Pharos-ai/Meridian")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM

processor = AutoProcessor.from_pretrained("Pharos-ai/Meridian")
model = AutoModelForMultimodalLM.from_pretrained("Pharos-ai/Meridian")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use Pharos-ai/Meridian with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Pharos-ai/Meridian"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Pharos-ai/Meridian",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Pharos-ai/Meridian

SGLang

How to use Pharos-ai/Meridian with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Pharos-ai/Meridian" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Pharos-ai/Meridian",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Pharos-ai/Meridian" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Pharos-ai/Meridian",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Pharos-ai/Meridian with Docker Model Runner:
```
docker model run hf.co/Pharos-ai/Meridian
```

Meridian

Meridian is a research-focused chat model built to work inside an app with live search, browsing, and source-reading tools.

It is designed for assistants that need to:

search the web before answering current or source-sensitive questions
read retrieved pages or documents
cite sources inline
answer concisely instead of rambling
separate tool calls from final responses

Meridian is not meant to be used as a plain offline chatbot for factual questions. It works best when your application gives it tools such as search, page open, document retrieval, or database lookup.

How it behaves

For questions that need fresh or grounded information, Meridian can emit tool calls like:

<call>
{"tool":"browser.search","input":{"query":"latest OpenAI ChatGPT model announcement"}}
</call>

After your app executes the tool and returns source text, Meridian is expected to answer with citations:

<response>
OpenAI announced GPT-5.2 as a newer frontier model for ChatGPT and API users, highlighting stronger reasoning, long-context understanding, coding, and vision capabilities. [source:1]
</response>

Your app is responsible for actually running the tools. The model only decides when and how to call them.

Recommended app loop

Send the user message and system prompt to Meridian.
Watch the generation for <call>...</call> blocks.
Parse the JSON inside each call.
Execute the matching tool in your app.
Feed the tool result back to the model as source text.
Ask the model for the final <response>.

Example source block:

<tool_result>
{"source_id":1,"title":"Example article","url":"https://example.com","text":"Relevant page text..."}
</tool_result>

Prompt style

A good system prompt is direct and tool-aware:

You are Meridian, a source-grounded research assistant.
Use browser.search when current or factual information is needed.
Use browser.open to read promising sources.
Answer concisely and cite sources inline as [source:1], [source:2].
If sources do not support a claim, say so.
Do not invent citations.

What Meridian is good at

research assistant workflows
live search and browsing agents
citation-style answers
tool-call formatting
short factual summaries from provided sources
app-integrated RAG experiences

What Meridian is not

a search engine by itself
a guarantee of truth without retrieval
a replacement for source filtering or ranking
a model that can browse unless your app gives it browsing tools

Safety and reliability notes

Meridian should be used with source validation in your application. For best results:

prefer trusted domains when ranking search results
open and read sources before asking for the final answer
require citations for factual claims
show source links to users
handle tool errors explicitly
avoid letting the model invent source IDs

Loading

This repository contains a merged Transformers model. Load it with transformers:

from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "Pharos-ai/Meridian"

tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype="auto",
    device_map="auto",
    trust_remote_code=True,
)

For production use, connect Meridian to your own search, browser, and retrieval tools.

Downloads last month: -

Safetensors

Model size

12B params

Tensor type

BF16