Sentence Similarity
sentence-transformers
ONNX
Safetensors
GGUF
Transformers.js
bert
feature-extraction
mteb
arctic
snowflake-arctic-embed
Eval Results (legacy)
text-embeddings-inference
Instructions to use Snowflake/snowflake-arctic-embed-m-v1.5 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use Snowflake/snowflake-arctic-embed-m-v1.5 with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("Snowflake/snowflake-arctic-embed-m-v1.5") sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Transformers.js
How to use Snowflake/snowflake-arctic-embed-m-v1.5 with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('sentence-similarity', 'Snowflake/snowflake-arctic-embed-m-v1.5'); - llama-cpp-python
How to use Snowflake/snowflake-arctic-embed-m-v1.5 with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="Snowflake/snowflake-arctic-embed-m-v1.5", filename="gguf/snowflake-arctic-embed-m-v1.5-bf16.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use Snowflake/snowflake-arctic-embed-m-v1.5 with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf Snowflake/snowflake-arctic-embed-m-v1.5:BF16 # Run inference directly in the terminal: llama-cli -hf Snowflake/snowflake-arctic-embed-m-v1.5:BF16
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf Snowflake/snowflake-arctic-embed-m-v1.5:BF16 # Run inference directly in the terminal: llama-cli -hf Snowflake/snowflake-arctic-embed-m-v1.5:BF16
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf Snowflake/snowflake-arctic-embed-m-v1.5:BF16 # Run inference directly in the terminal: ./llama-cli -hf Snowflake/snowflake-arctic-embed-m-v1.5:BF16
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf Snowflake/snowflake-arctic-embed-m-v1.5:BF16 # Run inference directly in the terminal: ./build/bin/llama-cli -hf Snowflake/snowflake-arctic-embed-m-v1.5:BF16
Use Docker
docker model run hf.co/Snowflake/snowflake-arctic-embed-m-v1.5:BF16
- LM Studio
- Jan
- Ollama
How to use Snowflake/snowflake-arctic-embed-m-v1.5 with Ollama:
ollama run hf.co/Snowflake/snowflake-arctic-embed-m-v1.5:BF16
- Unsloth Studio new
How to use Snowflake/snowflake-arctic-embed-m-v1.5 with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Snowflake/snowflake-arctic-embed-m-v1.5 to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Snowflake/snowflake-arctic-embed-m-v1.5 to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Snowflake/snowflake-arctic-embed-m-v1.5 to start chatting
- Docker Model Runner
How to use Snowflake/snowflake-arctic-embed-m-v1.5 with Docker Model Runner:
docker model run hf.co/Snowflake/snowflake-arctic-embed-m-v1.5:BF16
- Lemonade
How to use Snowflake/snowflake-arctic-embed-m-v1.5 with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull Snowflake/snowflake-arctic-embed-m-v1.5:BF16
Run and chat with the model
lemonade run user.snowflake-arctic-embed-m-v1.5-BF16
List all available models
lemonade list
Commit History
Upload README.md 8e4eaca verified
Upload GGUF files (#6) 4d7418a verified
Specify add_pooling_layer=False via configuration instead (#5) 3b5a16e verified
Add news for preprint release d70deb4
Luke Merrick commited on
Add other MTEB eval numbers 2ea2e86
Luke Merrick commited on
Update README.md 97eab2e verified
Remove ` (default)` from MTEB scores caused by an MTEB bug (#3) 93ff640 verified
Add launch blog post link beac42d
Luke Merrick commited on
Upload ONNX weights (#1) 9611734 verified
Update README for int8 quantization tips 1511436
Luke Merrick commited on
Fix usage example 35df7db
Luke Merrick commited on
Phrasing 3063768
Luke Merrick commited on
Fix link a6faa19
Luke Merrick commited on
README phrasing a0165f3
Luke Merrick commited on
Organize readme d3c84a2
Luke Merrick commited on
Add tables and fix whitespace CRLF -> LF 95c5e16
Luke Merrick commited on
Fix MTEB YAML based on existing examples 9be73aa
Luke Merrick commited on
Add MTEB retrieval results to model card 35c2fdb
Luke Merrick commited on
Update readme with usage and details 189504d
Luke Merrick commited on
Integrate sentence transformers f45f026
Luke Merrick commited on
Add model checkpoint 9193753
Luke Merrick commited on