all-MiniLM-L12-v2 โ€” ONNX

ONNX export of all-MiniLM-L12-v2, a sentence-transformers model that maps sentences to a 384-dimensional dense vector space. Deeper variant (12 layers) of MiniLM with slightly better quality at the cost of speed.

Mirrored for use with inference4j, an inference-only AI library for Java.

Original Source

Usage with inference4j

try (SentenceTransformer model = SentenceTransformer.fromPretrained("models/all-MiniLM-L12-v2")) {
    float[] embedding = model.encode("Hello, world!");
    System.out.println("Dimension: " + embedding.length); // 384
}

Model Details

Property Value
Architecture MiniLM-L12 (12 layers, 384 hidden)
Task Sentence embeddings / semantic similarity
Output dimension 384
Max sequence length 256
Training data 1B+ sentence pairs
Original framework PyTorch (sentence-transformers)

License

This model is licensed under the Apache License 2.0. Original model by sentence-transformers.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support