Sentence Similarity
sentence-transformers
PyTorch
TensorFlow
Rust
ONNX
Safetensors
OpenVINO
Transformers
English
bert
feature-extraction
Eval Results
text-embeddings-inference
Instructions to use sentence-transformers/all-MiniLM-L6-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use sentence-transformers/all-MiniLM-L6-v2 with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("sentence-transformers/all-MiniLM-L6-v2") sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Transformers
How to use sentence-transformers/all-MiniLM-L6-v2 with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("sentence-transformers/all-MiniLM-L6-v2") model = AutoModel.from_pretrained("sentence-transformers/all-MiniLM-L6-v2") - Inference
- Notebooks
- Google Colab
- Kaggle
支持中文吗
#88
by hswu - opened
支持中文吗
实测中午效果不好
对
实测中午效果不好
怎么测的?
这个模型确实只针对英文文本进行训练,在中文上表现不佳。
也许您可以使用其中之一,它们经过了包括中文在内的多种语言的培训。
https://huggingface.co/models?library=sentence-transformers&language=zh
例如
https://huggingface.co/DMetaSoul/sbert-chinese-general-v2
https://huggingface.co/maidalun1020/bce-embedding-base_v1
https://huggingface.co/shibing624/text2vec-base-chinese
https://huggingface.co/intfloat/multilingual-e5-small
(此消息是我翻译的,如有错误请见谅)
- Tom Aarsen
中文效果不是很好,有些信息检索不出来。对比了一下,差不多size的模型下面这个中文表现更好
https://huggingface.co/DMetaSoul/sbert-chinese-general-v2-distill