matsut21 commited on
Commit
4108e41
·
verified ·
1 Parent(s): a75c940

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ka
4
+ base_model:
5
+ - intfloat/multilingual-e5-small
6
+ tags:
7
+ - text-embeddings
8
+ - georgian
9
+ - multilingual-e5
10
+ ---
11
+
12
+ # Georgian E5 Fine-tuned Text Embeddings
13
+
14
+ Fine-tuned version of `intfloat/multilingual-e5-small` for Georgian text embeddings using contrastive learning.
15
+
16
+ ## Model Performance
17
+ - Validation Accuracy: 80.06%
18
+ - Training completed over 3 epochs
19
+ - Contrastive loss with margin=0.5
20
+
21
+ ## Dataset
22
+ - 12,455+ Georgian text pairs across 9 semantic relationship types
23
+
24
+ ## Usage
25
+ ```python
26
+ from transformers import AutoTokenizer, AutoModel
27
+
28
+ tokenizer = AutoTokenizer.from_pretrained("matsut21/georgian-e5-finetuned")
29
+ model = AutoModel.from_pretrained("matsut21/georgian-e5-finetuned")