Add new SentenceTransformer model.
Browse files- README.md +5 -5
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -7,7 +7,7 @@ tags:
|
|
| 7 |
- sentence-similarity
|
| 8 |
- transformers
|
| 9 |
datasets:
|
| 10 |
-
- COCOTECH-AI/ViNLI-SimCSE-supervised
|
| 11 |
---
|
| 12 |
|
| 13 |
# MiuN2k3/ViWikiSBert-fine-tuning
|
|
@@ -87,7 +87,7 @@ The model was trained with the parameters:
|
|
| 87 |
|
| 88 |
**DataLoader**:
|
| 89 |
|
| 90 |
-
`torch.utils.data.dataloader.DataLoader` of length
|
| 91 |
```
|
| 92 |
{'batch_size': 64, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}
|
| 93 |
```
|
|
@@ -103,8 +103,8 @@ Parameters of the fit()-Method:
|
|
| 103 |
```
|
| 104 |
{
|
| 105 |
"epochs": 5,
|
| 106 |
-
"evaluation_steps":
|
| 107 |
-
"evaluator": "
|
| 108 |
"max_grad_norm": 1,
|
| 109 |
"optimizer_class": "<class 'torch.optim.adamw.AdamW'>",
|
| 110 |
"optimizer_params": {
|
|
@@ -112,7 +112,7 @@ Parameters of the fit()-Method:
|
|
| 112 |
},
|
| 113 |
"scheduler": "WarmupLinear",
|
| 114 |
"steps_per_epoch": null,
|
| 115 |
-
"warmup_steps":
|
| 116 |
"weight_decay": 0.01
|
| 117 |
}
|
| 118 |
```
|
|
|
|
| 7 |
- sentence-similarity
|
| 8 |
- transformers
|
| 9 |
datasets:
|
| 10 |
+
- COCOTECH-AI/ViNLI-SimCSE-supervised-v2.0
|
| 11 |
---
|
| 12 |
|
| 13 |
# MiuN2k3/ViWikiSBert-fine-tuning
|
|
|
|
| 87 |
|
| 88 |
**DataLoader**:
|
| 89 |
|
| 90 |
+
`torch.utils.data.dataloader.DataLoader` of length 1597 with parameters:
|
| 91 |
```
|
| 92 |
{'batch_size': 64, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}
|
| 93 |
```
|
|
|
|
| 103 |
```
|
| 104 |
{
|
| 105 |
"epochs": 5,
|
| 106 |
+
"evaluation_steps": 500,
|
| 107 |
+
"evaluator": "sentence_transformers.evaluation.TripletEvaluator.TripletEvaluator",
|
| 108 |
"max_grad_norm": 1,
|
| 109 |
"optimizer_class": "<class 'torch.optim.adamw.AdamW'>",
|
| 110 |
"optimizer_params": {
|
|
|
|
| 112 |
},
|
| 113 |
"scheduler": "WarmupLinear",
|
| 114 |
"steps_per_epoch": null,
|
| 115 |
+
"warmup_steps": 798,
|
| 116 |
"weight_decay": 0.01
|
| 117 |
}
|
| 118 |
```
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 540015464
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ab3f5d33382ffe1e1115bdd62d84fbc24e5aabd356202ea390804d2cc6890baa
|
| 3 |
size 540015464
|