Instructions to use Pharos-ai/Meridian with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Pharos-ai/Meridian with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="Pharos-ai/Meridian") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("Pharos-ai/Meridian") model = AutoModelForMultimodalLM.from_pretrained("Pharos-ai/Meridian") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use Pharos-ai/Meridian with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "Pharos-ai/Meridian" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Pharos-ai/Meridian", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/Pharos-ai/Meridian
- SGLang
How to use Pharos-ai/Meridian with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "Pharos-ai/Meridian" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Pharos-ai/Meridian", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "Pharos-ai/Meridian" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Pharos-ai/Meridian", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use Pharos-ai/Meridian with Docker Model Runner:
docker model run hf.co/Pharos-ai/Meridian
Meridian
Meridian is a research-focused chat model built to work inside an app with live search, browsing, and source-reading tools.
It is designed for assistants that need to:
- search the web before answering current or source-sensitive questions
- read retrieved pages or documents
- cite sources inline
- answer concisely instead of rambling
- separate tool calls from final responses
Meridian is not meant to be used as a plain offline chatbot for factual questions. It works best when your application gives it tools such as search, page open, document retrieval, or database lookup.
How it behaves
For questions that need fresh or grounded information, Meridian can emit tool calls like:
<call>
{"tool":"browser.search","input":{"query":"latest OpenAI ChatGPT model announcement"}}
</call>
After your app executes the tool and returns source text, Meridian is expected to answer with citations:
<response>
OpenAI announced GPT-5.2 as a newer frontier model for ChatGPT and API users, highlighting stronger reasoning, long-context understanding, coding, and vision capabilities. [source:1]
</response>
Your app is responsible for actually running the tools. The model only decides when and how to call them.
Recommended app loop
- Send the user message and system prompt to Meridian.
- Watch the generation for
<call>...</call>blocks. - Parse the JSON inside each call.
- Execute the matching tool in your app.
- Feed the tool result back to the model as source text.
- Ask the model for the final
<response>.
Example source block:
<tool_result>
{"source_id":1,"title":"Example article","url":"https://example.com","text":"Relevant page text..."}
</tool_result>
Prompt style
A good system prompt is direct and tool-aware:
You are Meridian, a source-grounded research assistant.
Use browser.search when current or factual information is needed.
Use browser.open to read promising sources.
Answer concisely and cite sources inline as [source:1], [source:2].
If sources do not support a claim, say so.
Do not invent citations.
What Meridian is good at
- research assistant workflows
- live search and browsing agents
- citation-style answers
- tool-call formatting
- short factual summaries from provided sources
- app-integrated RAG experiences
What Meridian is not
- a search engine by itself
- a guarantee of truth without retrieval
- a replacement for source filtering or ranking
- a model that can browse unless your app gives it browsing tools
Safety and reliability notes
Meridian should be used with source validation in your application. For best results:
- prefer trusted domains when ranking search results
- open and read sources before asking for the final answer
- require citations for factual claims
- show source links to users
- handle tool errors explicitly
- avoid letting the model invent source IDs
Loading
This repository contains a merged Transformers model. Load it with transformers:
from transformers import AutoModelForCausalLM, AutoTokenizer
model_id = "Pharos-ai/Meridian"
tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
model_id,
torch_dtype="auto",
device_map="auto",
trust_remote_code=True,
)
For production use, connect Meridian to your own search, browser, and retrieval tools.
- Downloads last month
- -