AITeamVN
/

Vietnamese_Embedding

Sentence Similarity

sentence-transformers

text-embeddings-inference

Model card Files Files and versions

AITeamVN commited on Mar 18, 2025

Commit

8df5d5b

·

verified ·

1 Parent(s): 1ffdaf0

Update README.md

Files changed (1) hide show

README.md +22 -1

README.md CHANGED Viewed

@@ -24,4 +24,25 @@ Vietnamese_Embedding is an embedding model fine-tuned from the BGE-M3 model (htt
 |----------------------|------------|------------|------------|-------------|-------------|--------------|
 | Vietnamese_Embedding            | 0.7274     | 0.8992     | 0.9305     | 0.9568      | 0.9922     | 0.8181       |
 | Vietnamese-bi-encoder         | 0.7109     | 0.8680     | 0.9014     | 0.9299      | 0.9772      | 0.7951       |
-| BGE-M3 | 0.5682     | 0.7728     | 0.8382     | 0.8921      | 0.9772      | 0.6822       |

 |----------------------|------------|------------|------------|-------------|-------------|--------------|
 | Vietnamese_Embedding            | 0.7274     | 0.8992     | 0.9305     | 0.9568      | 0.9922     | 0.8181       |
 | Vietnamese-bi-encoder         | 0.7109     | 0.8680     | 0.9014     | 0.9299      | 0.9772      | 0.7951       |
+| BGE-M3 | 0.5682     | 0.7728     | 0.8382     | 0.8921      | 0.9772      | 0.6822       |
+## Usage
+```python
+from sentence_transformers import SentenceTransformer
+import torch
+model = SentenceTransformer("AITeamVN/Vietnamese_Embedding")
+model.max_seq_length = 2048
+sentences_1 = ["Trí tuệ nhân tạo là gì", "Tại sao giấc ngủ quan trọng?"]
+sentences_2 = ["Trí tuệ nhân tạo là công nghệ giúp máy móc suy nghĩ và học hỏi như con người. Nó hoạt động bằng cách thu thập dữ liệu, nhận diện mẫu và đưa ra quyết định.",
+               "Giấc ngủ giúp cơ thể và não bộ nghỉ ngơi, hồi phục năng lượng và cải thiện trí nhớ. Ngủ đủ giấc giúp tinh thần tỉnh táo và làm việc hiệu quả hơn."]
+query_embedding = model.encode(sentences_1)
+doc_embeddings = model.encode(sentences_2)
+similarity = query_embedding @ doc_embeddings.T
+'''
+array([[0.6621206 , 0.33066636],
+       [0.18678051, 0.4875508 ]], dtype=float32)'''
+```