This is a part of the MTEB test.

# !pip install tensorflow_text 

import tensorflow_hub as hub
from tensorflow_text import SentencepieceTokenizer
import tensorflow as tf

embedder=hub.load("https://tfhub.dev/google/universal-sentence-encoder-multilingual-large/3")

class USE():
    def encode(self, sentences, batch_size=32, **kwargs):
        embeddings = []
        for i in range(0, len(sentences), batch_size):
            batch_sentences = sentences[i:i+batch_size]
            batch_embeddings = embedder(batch_sentences)
            embeddings.extend(batch_embeddings)
        return embeddings


model = USE()

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Spaces using vprelovac/universal-sentence-encoder-large-5 11

Evaluation results

accuracy on MTEB AmazonCounterfactualClassification (en)
test set self-reported

76.194
ap on MTEB AmazonCounterfactualClassification (en)
test set self-reported

39.250
f1 on MTEB AmazonCounterfactualClassification (en)
test set self-reported

70.175
accuracy on MTEB AmazonPolarityClassification
test set self-reported

69.629
ap on MTEB AmazonPolarityClassification
test set self-reported

63.973
f1 on MTEB AmazonPolarityClassification
test set self-reported

69.486
accuracy on MTEB AmazonReviewsClassification (en)
test set self-reported

35.534
f1 on MTEB AmazonReviewsClassification (en)
test set self-reported

34.974