Sentence Similarity
sentence-transformers
ONNX
Safetensors
Transformers.js
bert
feature-extraction
mteb
arctic
snowflake-arctic-embed
Eval Results (legacy)
text-embeddings-inference
Instructions to use Snowflake/snowflake-arctic-embed-s with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use Snowflake/snowflake-arctic-embed-s with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("Snowflake/snowflake-arctic-embed-s") sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Transformers.js
How to use Snowflake/snowflake-arctic-embed-s with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('sentence-similarity', 'Snowflake/snowflake-arctic-embed-s'); - Inference
- Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -2997,14 +2997,14 @@ document_tokens = tokenizer(documents, padding=True, truncation=True, return_te
|
|
| 2997 |
# Compute token embeddings
|
| 2998 |
with torch.no_grad():
|
| 2999 |
query_embeddings = model(**query_tokens)[0][:, 0]
|
| 3000 |
-
|
| 3001 |
|
| 3002 |
|
| 3003 |
# normalize embeddings
|
| 3004 |
query_embeddings = torch.nn.functional.normalize(query_embeddings, p=2, dim=1)
|
| 3005 |
-
|
| 3006 |
|
| 3007 |
-
scores = torch.mm(query_embeddings,
|
| 3008 |
for query, query_scores in zip(queries, scores):
|
| 3009 |
doc_score_pairs = list(zip(documents, query_scores))
|
| 3010 |
doc_score_pairs = sorted(doc_score_pairs, key=lambda x: x[1], reverse=True)
|
|
|
|
| 2997 |
# Compute token embeddings
|
| 2998 |
with torch.no_grad():
|
| 2999 |
query_embeddings = model(**query_tokens)[0][:, 0]
|
| 3000 |
+
document_embeddings = model(**document_tokens)[0][:, 0]
|
| 3001 |
|
| 3002 |
|
| 3003 |
# normalize embeddings
|
| 3004 |
query_embeddings = torch.nn.functional.normalize(query_embeddings, p=2, dim=1)
|
| 3005 |
+
document_embeddings = torch.nn.functional.normalize(document_embeddings, p=2, dim=1)
|
| 3006 |
|
| 3007 |
+
scores = torch.mm(query_embeddings, document_embeddings.transpose(0, 1))
|
| 3008 |
for query, query_scores in zip(queries, scores):
|
| 3009 |
doc_score_pairs = list(zip(documents, query_scores))
|
| 3010 |
doc_score_pairs = sorted(doc_score_pairs, key=lambda x: x[1], reverse=True)
|