Zen3 Embedding Medium

Medium-sized Zen3 embedding model balancing speed and retrieval accuracy.

Overview

Built on Zen MoDE (Mixture of Distilled Experts) architecture with medium parameters and 8K context window.

Developed by Hanzo AI and the Zoo Labs Foundation.

Quick Start

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("zenlm/zen3-embedding-medium")

sentences = [
    "The weather is lovely today.",
    "It's so sunny outside!",
    "He drove to the stadium.",
]
embeddings = model.encode(sentences)
print(embeddings.shape)

# Compute cosine similarities
similarities = model.similarity(embeddings, embeddings)
print(similarities)

API Access

from openai import OpenAI

client = OpenAI(base_url="https://api.hanzo.ai/v1", api_key="your-api-key")
response = client.embeddings.create(model="zen3-embedding-medium", input="Your text here")
print(response.data[0].embedding)

Model Details

Attribute Value
Parameters medium
Architecture Zen MoDE
Context 8K tokens
License Apache 2.0

License

Apache 2.0

Downloads last month
26
Safetensors
Model size
0.3B params
Tensor type
I64
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zenlm/zen3-embedding-medium

Quantizations
1 model