Sentence Similarity
sentence-transformers
Safetensors
bert
feature-extraction
dense
Generated from Trainer
dataset_size:111470
loss:MultipleNegativesRankingLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use redis/model-b-structured with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use redis/model-b-structured with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("redis/model-b-structured") sentences = [ "when was the first elephant brought to america", "Old Bet The first elephant brought to the United States was in 1796, aboard the America which set sail from Calcutta for New York on December 3, 1795.[4] However, it is not certain that this was Old Bet.[2] The first references to Old Bet start in 1804 in Boston as part of a menagerie.[1] In 1808, while residing in Somers, New York, Hachaliah Bailey purchased the menagerie elephant for $1,000 and named it \"Old Bet\".[5][6]", "Cronus Rhea secretly gave birth to Zeus in Crete, and handed Cronus a stone wrapped in swaddling clothes, also known as the Omphalos Stone, which he promptly swallowed, thinking that it was his son.", "Renal artery One or two accessory renal arteries are frequently found, especially on the left side since they usually arise from the aorta, and may come off above (more common) or below the main artery. Instead of entering the kidney at the hilus, they usually pierce the upper or lower part of the organ." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
Add new SentenceTransformer model
Browse files
README.md
CHANGED
|
@@ -5,38 +5,110 @@ tags:
|
|
| 5 |
- feature-extraction
|
| 6 |
- dense
|
| 7 |
- generated_from_trainer
|
| 8 |
-
- dataset_size:
|
| 9 |
- loss:MultipleNegativesRankingLoss
|
| 10 |
base_model: prajjwal1/bert-small
|
| 11 |
widget:
|
| 12 |
-
- source_sentence: How do I
|
|
|
|
| 13 |
sentences:
|
| 14 |
-
-
|
| 15 |
-
- How do
|
| 16 |
-
- What are some
|
| 17 |
-
- source_sentence:
|
| 18 |
sentences:
|
| 19 |
-
-
|
| 20 |
-
|
| 21 |
-
-
|
| 22 |
-
|
| 23 |
-
- source_sentence:
|
|
|
|
| 24 |
sentences:
|
| 25 |
-
-
|
| 26 |
-
-
|
| 27 |
-
|
| 28 |
-
-
|
|
|
|
|
|
|
| 29 |
sentences:
|
| 30 |
-
-
|
| 31 |
-
-
|
| 32 |
-
-
|
| 33 |
-
- source_sentence:
|
| 34 |
sentences:
|
| 35 |
-
-
|
| 36 |
-
|
| 37 |
-
-
|
|
|
|
| 38 |
pipeline_tag: sentence-similarity
|
| 39 |
library_name: sentence-transformers
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 40 |
---
|
| 41 |
|
| 42 |
# SentenceTransformer based on prajjwal1/bert-small
|
|
@@ -85,12 +157,12 @@ Then you can load this model and run inference.
|
|
| 85 |
from sentence_transformers import SentenceTransformer
|
| 86 |
|
| 87 |
# Download from the 🤗 Hub
|
| 88 |
-
model = SentenceTransformer("
|
| 89 |
# Run inference
|
| 90 |
sentences = [
|
| 91 |
-
'
|
| 92 |
-
'
|
| 93 |
-
'
|
| 94 |
]
|
| 95 |
embeddings = model.encode(sentences)
|
| 96 |
print(embeddings.shape)
|
|
@@ -99,9 +171,9 @@ print(embeddings.shape)
|
|
| 99 |
# Get the similarity scores for the embeddings
|
| 100 |
similarities = model.similarity(embeddings, embeddings)
|
| 101 |
print(similarities)
|
| 102 |
-
# tensor([[1.
|
| 103 |
-
# [0.
|
| 104 |
-
# [0.
|
| 105 |
```
|
| 106 |
|
| 107 |
<!--
|
|
@@ -128,6 +200,32 @@ You can finetune this model on your own dataset.
|
|
| 128 |
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
|
| 129 |
-->
|
| 130 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 131 |
<!--
|
| 132 |
## Bias, Risks and Limitations
|
| 133 |
|
|
@@ -146,23 +244,49 @@ You can finetune this model on your own dataset.
|
|
| 146 |
|
| 147 |
#### Unnamed Dataset
|
| 148 |
|
| 149 |
-
* Size:
|
| 150 |
-
* Columns: <code>
|
| 151 |
* Approximate statistics based on the first 1000 samples:
|
| 152 |
-
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 153 |
|:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
|
| 154 |
| type | string | string | string |
|
| 155 |
-
| details | <ul><li>min: 6 tokens</li><li>mean: 15.
|
| 156 |
* Samples:
|
| 157 |
-
|
|
| 158 |
-
|:-----------------------------------------------------------------|:-----------------------------------------------------------------|:----------------------------------------------------------------------------------|
|
| 159 |
-
| <code>
|
| 160 |
-
| <code>
|
| 161 |
-
| <code>
|
| 162 |
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
| 163 |
```json
|
| 164 |
{
|
| 165 |
-
"scale":
|
| 166 |
"similarity_fct": "cos_sim",
|
| 167 |
"gather_across_devices": false
|
| 168 |
}
|
|
@@ -171,36 +295,49 @@ You can finetune this model on your own dataset.
|
|
| 171 |
### Training Hyperparameters
|
| 172 |
#### Non-Default Hyperparameters
|
| 173 |
|
| 174 |
-
- `
|
| 175 |
-
- `
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 176 |
- `fp16`: True
|
| 177 |
-
- `
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 178 |
|
| 179 |
#### All Hyperparameters
|
| 180 |
<details><summary>Click to expand</summary>
|
| 181 |
|
| 182 |
- `overwrite_output_dir`: False
|
| 183 |
- `do_predict`: False
|
| 184 |
-
- `eval_strategy`:
|
| 185 |
- `prediction_loss_only`: True
|
| 186 |
-
- `per_device_train_batch_size`:
|
| 187 |
-
- `per_device_eval_batch_size`:
|
| 188 |
- `per_gpu_train_batch_size`: None
|
| 189 |
- `per_gpu_eval_batch_size`: None
|
| 190 |
- `gradient_accumulation_steps`: 1
|
| 191 |
- `eval_accumulation_steps`: None
|
| 192 |
- `torch_empty_cache_steps`: None
|
| 193 |
-
- `learning_rate`:
|
| 194 |
-
- `weight_decay`: 0.
|
| 195 |
- `adam_beta1`: 0.9
|
| 196 |
- `adam_beta2`: 0.999
|
| 197 |
- `adam_epsilon`: 1e-08
|
| 198 |
-
- `max_grad_norm`: 1
|
| 199 |
-
- `num_train_epochs`: 3
|
| 200 |
-
- `max_steps`:
|
| 201 |
- `lr_scheduler_type`: linear
|
| 202 |
- `lr_scheduler_kwargs`: {}
|
| 203 |
-
- `warmup_ratio`: 0.
|
| 204 |
- `warmup_steps`: 0
|
| 205 |
- `log_level`: passive
|
| 206 |
- `log_level_replica`: warning
|
|
@@ -228,14 +365,14 @@ You can finetune this model on your own dataset.
|
|
| 228 |
- `tpu_num_cores`: None
|
| 229 |
- `tpu_metrics_debug`: False
|
| 230 |
- `debug`: []
|
| 231 |
-
- `dataloader_drop_last`:
|
| 232 |
-
- `dataloader_num_workers`:
|
| 233 |
-
- `dataloader_prefetch_factor`:
|
| 234 |
- `past_index`: -1
|
| 235 |
- `disable_tqdm`: False
|
| 236 |
- `remove_unused_columns`: True
|
| 237 |
- `label_names`: None
|
| 238 |
-
- `load_best_model_at_end`:
|
| 239 |
- `ignore_data_skip`: False
|
| 240 |
- `fsdp`: []
|
| 241 |
- `fsdp_min_num_params`: 0
|
|
@@ -245,23 +382,23 @@ You can finetune this model on your own dataset.
|
|
| 245 |
- `parallelism_config`: None
|
| 246 |
- `deepspeed`: None
|
| 247 |
- `label_smoothing_factor`: 0.0
|
| 248 |
-
- `optim`:
|
| 249 |
- `optim_args`: None
|
| 250 |
- `adafactor`: False
|
| 251 |
- `group_by_length`: False
|
| 252 |
- `length_column_name`: length
|
| 253 |
- `project`: huggingface
|
| 254 |
- `trackio_space_id`: trackio
|
| 255 |
-
- `ddp_find_unused_parameters`:
|
| 256 |
- `ddp_bucket_cap_mb`: None
|
| 257 |
- `ddp_broadcast_buffers`: False
|
| 258 |
- `dataloader_pin_memory`: True
|
| 259 |
- `dataloader_persistent_workers`: False
|
| 260 |
- `skip_memory_metrics`: True
|
| 261 |
- `use_legacy_prediction_loop`: False
|
| 262 |
-
- `push_to_hub`:
|
| 263 |
- `resume_from_checkpoint`: None
|
| 264 |
-
- `hub_model_id`:
|
| 265 |
- `hub_strategy`: every_save
|
| 266 |
- `hub_private_repo`: None
|
| 267 |
- `hub_always_push`: False
|
|
@@ -288,32 +425,167 @@ You can finetune this model on your own dataset.
|
|
| 288 |
- `neftune_noise_alpha`: None
|
| 289 |
- `optim_target_modules`: None
|
| 290 |
- `batch_eval_metrics`: False
|
| 291 |
-
- `eval_on_start`:
|
| 292 |
- `use_liger_kernel`: False
|
| 293 |
- `liger_kernel_config`: None
|
| 294 |
- `eval_use_gather_object`: False
|
| 295 |
- `average_tokens_across_devices`: True
|
| 296 |
- `prompts`: None
|
| 297 |
- `batch_sampler`: batch_sampler
|
| 298 |
-
- `multi_dataset_batch_sampler`:
|
| 299 |
- `router_mapping`: {}
|
| 300 |
- `learning_rate_mapping`: {}
|
| 301 |
|
| 302 |
</details>
|
| 303 |
|
| 304 |
### Training Logs
|
| 305 |
-
|
| 306 |
-
|:------:|:----:|:-------------:|
|
| 307 |
-
| 0.3199 | 500 | 0.4294 |
|
| 308 |
-
| 0.6398 | 1000 | 0.1268 |
|
| 309 |
-
| 0.9597 | 1500 | 0.1 |
|
| 310 |
-
| 1.2796 | 2000 | 0.0792 |
|
| 311 |
-
| 1.5995 | 2500 | 0.0706 |
|
| 312 |
-
| 1.9194 | 3000 | 0.0687 |
|
| 313 |
-
| 2.2393 | 3500 | 0.0584 |
|
| 314 |
-
| 2.5592 | 4000 | 0.057 |
|
| 315 |
-
| 2.8791 | 4500 | 0.0581 |
|
| 316 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 317 |
|
| 318 |
### Framework Versions
|
| 319 |
- Python: 3.10.18
|
|
|
|
| 5 |
- feature-extraction
|
| 6 |
- dense
|
| 7 |
- generated_from_trainer
|
| 8 |
+
- dataset_size:359999
|
| 9 |
- loss:MultipleNegativesRankingLoss
|
| 10 |
base_model: prajjwal1/bert-small
|
| 11 |
widget:
|
| 12 |
+
- source_sentence: Someone blocked me on Instagram. How do I unblock myself from their
|
| 13 |
+
account?
|
| 14 |
sentences:
|
| 15 |
+
- Someone blocked me on Instagram. How do I unblock myself from their account?
|
| 16 |
+
- Someone blocked me on Instagram. How do myself unblock Ifrom their account?
|
| 17 |
+
- What are some good tips for dealing with a very easily frustrated 1 year old?
|
| 18 |
+
- source_sentence: Do you love the life you live?
|
| 19 |
sentences:
|
| 20 |
+
- What is Jakob Nowell, Bradley Nowell's son, up to and will he pursue a career
|
| 21 |
+
in music?
|
| 22 |
+
- Do you love the life you're living?
|
| 23 |
+
- Do you love not the life you live ?
|
| 24 |
+
- source_sentence: I had sex on the 9th and my period started on the 11th. Could I
|
| 25 |
+
still get pregnant?
|
| 26 |
sentences:
|
| 27 |
+
- How can I earn money easily online?
|
| 28 |
+
- If I have sex on the day of my ovulation and I get my period two weeks later,
|
| 29 |
+
can I still be pregnant?
|
| 30 |
+
- I did not have sex on the 9th and my period started on the 11th . Could I still
|
| 31 |
+
get pregnant ?
|
| 32 |
+
- source_sentence: Would you read book at your office?
|
| 33 |
sentences:
|
| 34 |
+
- Would book read youat your office?
|
| 35 |
+
- I am a married woman and I'm in love with married man. what should I do?
|
| 36 |
+
- Would you read book at your office?
|
| 37 |
+
- source_sentence: How do you earn money on Quora?
|
| 38 |
sentences:
|
| 39 |
+
- Ordered food on Swiggy 3 days ago.After accepting my money, said no more on Menu!
|
| 40 |
+
When if ever will I atleast get refund in cr card a/c?
|
| 41 |
+
- How do you earn not money on Quora ?
|
| 42 |
+
- What is the best way to make money on Quora?
|
| 43 |
pipeline_tag: sentence-similarity
|
| 44 |
library_name: sentence-transformers
|
| 45 |
+
metrics:
|
| 46 |
+
- cosine_accuracy@1
|
| 47 |
+
- cosine_accuracy@3
|
| 48 |
+
- cosine_accuracy@5
|
| 49 |
+
- cosine_precision@1
|
| 50 |
+
- cosine_precision@3
|
| 51 |
+
- cosine_precision@5
|
| 52 |
+
- cosine_recall@1
|
| 53 |
+
- cosine_recall@3
|
| 54 |
+
- cosine_recall@5
|
| 55 |
+
- cosine_ndcg@10
|
| 56 |
+
- cosine_mrr@1
|
| 57 |
+
- cosine_mrr@5
|
| 58 |
+
- cosine_mrr@10
|
| 59 |
+
- cosine_map@100
|
| 60 |
+
model-index:
|
| 61 |
+
- name: SentenceTransformer based on prajjwal1/bert-small
|
| 62 |
+
results:
|
| 63 |
+
- task:
|
| 64 |
+
type: information-retrieval
|
| 65 |
+
name: Information Retrieval
|
| 66 |
+
dataset:
|
| 67 |
+
name: val
|
| 68 |
+
type: val
|
| 69 |
+
metrics:
|
| 70 |
+
- type: cosine_accuracy@1
|
| 71 |
+
value: 0.761525
|
| 72 |
+
name: Cosine Accuracy@1
|
| 73 |
+
- type: cosine_accuracy@3
|
| 74 |
+
value: 0.826125
|
| 75 |
+
name: Cosine Accuracy@3
|
| 76 |
+
- type: cosine_accuracy@5
|
| 77 |
+
value: 0.85095
|
| 78 |
+
name: Cosine Accuracy@5
|
| 79 |
+
- type: cosine_precision@1
|
| 80 |
+
value: 0.761525
|
| 81 |
+
name: Cosine Precision@1
|
| 82 |
+
- type: cosine_precision@3
|
| 83 |
+
value: 0.275375
|
| 84 |
+
name: Cosine Precision@3
|
| 85 |
+
- type: cosine_precision@5
|
| 86 |
+
value: 0.17019000000000004
|
| 87 |
+
name: Cosine Precision@5
|
| 88 |
+
- type: cosine_recall@1
|
| 89 |
+
value: 0.761525
|
| 90 |
+
name: Cosine Recall@1
|
| 91 |
+
- type: cosine_recall@3
|
| 92 |
+
value: 0.826125
|
| 93 |
+
name: Cosine Recall@3
|
| 94 |
+
- type: cosine_recall@5
|
| 95 |
+
value: 0.85095
|
| 96 |
+
name: Cosine Recall@5
|
| 97 |
+
- type: cosine_ndcg@10
|
| 98 |
+
value: 0.8202534934281767
|
| 99 |
+
name: Cosine Ndcg@10
|
| 100 |
+
- type: cosine_mrr@1
|
| 101 |
+
value: 0.761525
|
| 102 |
+
name: Cosine Mrr@1
|
| 103 |
+
- type: cosine_mrr@5
|
| 104 |
+
value: 0.7961479166666627
|
| 105 |
+
name: Cosine Mrr@5
|
| 106 |
+
- type: cosine_mrr@10
|
| 107 |
+
value: 0.8004402281746008
|
| 108 |
+
name: Cosine Mrr@10
|
| 109 |
+
- type: cosine_map@100
|
| 110 |
+
value: 0.8038638243708912
|
| 111 |
+
name: Cosine Map@100
|
| 112 |
---
|
| 113 |
|
| 114 |
# SentenceTransformer based on prajjwal1/bert-small
|
|
|
|
| 157 |
from sentence_transformers import SentenceTransformer
|
| 158 |
|
| 159 |
# Download from the 🤗 Hub
|
| 160 |
+
model = SentenceTransformer("redis/model-b-structured")
|
| 161 |
# Run inference
|
| 162 |
sentences = [
|
| 163 |
+
'How do you earn money on Quora?',
|
| 164 |
+
'What is the best way to make money on Quora?',
|
| 165 |
+
'How do you earn not money on Quora ?',
|
| 166 |
]
|
| 167 |
embeddings = model.encode(sentences)
|
| 168 |
print(embeddings.shape)
|
|
|
|
| 171 |
# Get the similarity scores for the embeddings
|
| 172 |
similarities = model.similarity(embeddings, embeddings)
|
| 173 |
print(similarities)
|
| 174 |
+
# tensor([[ 1.0001, 0.9996, -0.3001],
|
| 175 |
+
# [ 0.9996, 1.0000, -0.3007],
|
| 176 |
+
# [-0.3001, -0.3007, 1.0003]])
|
| 177 |
```
|
| 178 |
|
| 179 |
<!--
|
|
|
|
| 200 |
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
|
| 201 |
-->
|
| 202 |
|
| 203 |
+
## Evaluation
|
| 204 |
+
|
| 205 |
+
### Metrics
|
| 206 |
+
|
| 207 |
+
#### Information Retrieval
|
| 208 |
+
|
| 209 |
+
* Dataset: `val`
|
| 210 |
+
* Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
|
| 211 |
+
|
| 212 |
+
| Metric | Value |
|
| 213 |
+
|:-------------------|:-----------|
|
| 214 |
+
| cosine_accuracy@1 | 0.7615 |
|
| 215 |
+
| cosine_accuracy@3 | 0.8261 |
|
| 216 |
+
| cosine_accuracy@5 | 0.8509 |
|
| 217 |
+
| cosine_precision@1 | 0.7615 |
|
| 218 |
+
| cosine_precision@3 | 0.2754 |
|
| 219 |
+
| cosine_precision@5 | 0.1702 |
|
| 220 |
+
| cosine_recall@1 | 0.7615 |
|
| 221 |
+
| cosine_recall@3 | 0.8261 |
|
| 222 |
+
| cosine_recall@5 | 0.8509 |
|
| 223 |
+
| **cosine_ndcg@10** | **0.8203** |
|
| 224 |
+
| cosine_mrr@1 | 0.7615 |
|
| 225 |
+
| cosine_mrr@5 | 0.7961 |
|
| 226 |
+
| cosine_mrr@10 | 0.8004 |
|
| 227 |
+
| cosine_map@100 | 0.8039 |
|
| 228 |
+
|
| 229 |
<!--
|
| 230 |
## Bias, Risks and Limitations
|
| 231 |
|
|
|
|
| 244 |
|
| 245 |
#### Unnamed Dataset
|
| 246 |
|
| 247 |
+
* Size: 359,999 training samples
|
| 248 |
+
* Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
|
| 249 |
* Approximate statistics based on the first 1000 samples:
|
| 250 |
+
| | anchor | positive | negative |
|
| 251 |
+
|:--------|:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
|
| 252 |
+
| type | string | string | string |
|
| 253 |
+
| details | <ul><li>min: 6 tokens</li><li>mean: 15.4 tokens</li><li>max: 60 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 15.45 tokens</li><li>max: 78 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 16.07 tokens</li><li>max: 62 tokens</li></ul> |
|
| 254 |
+
* Samples:
|
| 255 |
+
| anchor | positive | negative |
|
| 256 |
+
|:--------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------|
|
| 257 |
+
| <code>Shall I upgrade my iPhone 5s to iOS 10 final version?</code> | <code>Should I upgrade an iPhone 5s to iOS 10?</code> | <code>Shall I upgrade not my iPhone 5s to iOS 10 final version ?</code> |
|
| 258 |
+
| <code>Do Census Bureau income figures count sources of unearned income, or do they just count earned income?</code> | <code>Do Census Bureau income figures count sources of unearned income, or do they just count earned income?</code> | <code>Do Census Bureau income figures count sources of unearned income, or do income just count earned they?</code> |
|
| 259 |
+
| <code>Who has the highest IQ?</code> | <code>Who has the highest IQ?</code> | <code>the highest IQ has Who?</code> |
|
| 260 |
+
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
| 261 |
+
```json
|
| 262 |
+
{
|
| 263 |
+
"scale": 1.0,
|
| 264 |
+
"similarity_fct": "cos_sim",
|
| 265 |
+
"gather_across_devices": false
|
| 266 |
+
}
|
| 267 |
+
```
|
| 268 |
+
|
| 269 |
+
### Evaluation Dataset
|
| 270 |
+
|
| 271 |
+
#### Unnamed Dataset
|
| 272 |
+
|
| 273 |
+
* Size: 40,000 evaluation samples
|
| 274 |
+
* Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
|
| 275 |
+
* Approximate statistics based on the first 1000 samples:
|
| 276 |
+
| | anchor | positive | negative |
|
| 277 |
|:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
|
| 278 |
| type | string | string | string |
|
| 279 |
+
| details | <ul><li>min: 6 tokens</li><li>mean: 15.86 tokens</li><li>max: 65 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 15.94 tokens</li><li>max: 65 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 16.46 tokens</li><li>max: 66 tokens</li></ul> |
|
| 280 |
* Samples:
|
| 281 |
+
| anchor | positive | negative |
|
| 282 |
+
|:------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------|
|
| 283 |
+
| <code>What are some mind-blowing Iphone gadgets and tools that most people don't know about?</code> | <code>What are some mind-blowing iphone tools that most people don't know about?</code> | <code>most people are some mind-blowing Iphone gadgets and tools that Whatdon't know about?</code> |
|
| 284 |
+
| <code>If FOX News is the conservative news station, which cable news network is for liberals/progressives?</code> | <code>If FOX News is the conservative news station, which cable news network is for liberals/progressives?</code> | <code>If FOX News is not the conservative news station , which cable news network is for liberals / progressives ?</code> |
|
| 285 |
+
| <code>How can guys last longer during sex?</code> | <code>How do I last longer in sex?</code> | <code>How can guys last not longer during sex ?</code> |
|
| 286 |
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
| 287 |
```json
|
| 288 |
{
|
| 289 |
+
"scale": 1.0,
|
| 290 |
"similarity_fct": "cos_sim",
|
| 291 |
"gather_across_devices": false
|
| 292 |
}
|
|
|
|
| 295 |
### Training Hyperparameters
|
| 296 |
#### Non-Default Hyperparameters
|
| 297 |
|
| 298 |
+
- `eval_strategy`: steps
|
| 299 |
+
- `per_device_train_batch_size`: 256
|
| 300 |
+
- `per_device_eval_batch_size`: 256
|
| 301 |
+
- `learning_rate`: 2e-05
|
| 302 |
+
- `weight_decay`: 0.001
|
| 303 |
+
- `max_steps`: 14060
|
| 304 |
+
- `warmup_ratio`: 0.1
|
| 305 |
- `fp16`: True
|
| 306 |
+
- `dataloader_drop_last`: True
|
| 307 |
+
- `dataloader_num_workers`: 1
|
| 308 |
+
- `dataloader_prefetch_factor`: 1
|
| 309 |
+
- `load_best_model_at_end`: True
|
| 310 |
+
- `optim`: adamw_torch
|
| 311 |
+
- `ddp_find_unused_parameters`: False
|
| 312 |
+
- `push_to_hub`: True
|
| 313 |
+
- `hub_model_id`: redis/model-b-structured
|
| 314 |
+
- `eval_on_start`: True
|
| 315 |
|
| 316 |
#### All Hyperparameters
|
| 317 |
<details><summary>Click to expand</summary>
|
| 318 |
|
| 319 |
- `overwrite_output_dir`: False
|
| 320 |
- `do_predict`: False
|
| 321 |
+
- `eval_strategy`: steps
|
| 322 |
- `prediction_loss_only`: True
|
| 323 |
+
- `per_device_train_batch_size`: 256
|
| 324 |
+
- `per_device_eval_batch_size`: 256
|
| 325 |
- `per_gpu_train_batch_size`: None
|
| 326 |
- `per_gpu_eval_batch_size`: None
|
| 327 |
- `gradient_accumulation_steps`: 1
|
| 328 |
- `eval_accumulation_steps`: None
|
| 329 |
- `torch_empty_cache_steps`: None
|
| 330 |
+
- `learning_rate`: 2e-05
|
| 331 |
+
- `weight_decay`: 0.001
|
| 332 |
- `adam_beta1`: 0.9
|
| 333 |
- `adam_beta2`: 0.999
|
| 334 |
- `adam_epsilon`: 1e-08
|
| 335 |
+
- `max_grad_norm`: 1.0
|
| 336 |
+
- `num_train_epochs`: 3.0
|
| 337 |
+
- `max_steps`: 14060
|
| 338 |
- `lr_scheduler_type`: linear
|
| 339 |
- `lr_scheduler_kwargs`: {}
|
| 340 |
+
- `warmup_ratio`: 0.1
|
| 341 |
- `warmup_steps`: 0
|
| 342 |
- `log_level`: passive
|
| 343 |
- `log_level_replica`: warning
|
|
|
|
| 365 |
- `tpu_num_cores`: None
|
| 366 |
- `tpu_metrics_debug`: False
|
| 367 |
- `debug`: []
|
| 368 |
+
- `dataloader_drop_last`: True
|
| 369 |
+
- `dataloader_num_workers`: 1
|
| 370 |
+
- `dataloader_prefetch_factor`: 1
|
| 371 |
- `past_index`: -1
|
| 372 |
- `disable_tqdm`: False
|
| 373 |
- `remove_unused_columns`: True
|
| 374 |
- `label_names`: None
|
| 375 |
+
- `load_best_model_at_end`: True
|
| 376 |
- `ignore_data_skip`: False
|
| 377 |
- `fsdp`: []
|
| 378 |
- `fsdp_min_num_params`: 0
|
|
|
|
| 382 |
- `parallelism_config`: None
|
| 383 |
- `deepspeed`: None
|
| 384 |
- `label_smoothing_factor`: 0.0
|
| 385 |
+
- `optim`: adamw_torch
|
| 386 |
- `optim_args`: None
|
| 387 |
- `adafactor`: False
|
| 388 |
- `group_by_length`: False
|
| 389 |
- `length_column_name`: length
|
| 390 |
- `project`: huggingface
|
| 391 |
- `trackio_space_id`: trackio
|
| 392 |
+
- `ddp_find_unused_parameters`: False
|
| 393 |
- `ddp_bucket_cap_mb`: None
|
| 394 |
- `ddp_broadcast_buffers`: False
|
| 395 |
- `dataloader_pin_memory`: True
|
| 396 |
- `dataloader_persistent_workers`: False
|
| 397 |
- `skip_memory_metrics`: True
|
| 398 |
- `use_legacy_prediction_loop`: False
|
| 399 |
+
- `push_to_hub`: True
|
| 400 |
- `resume_from_checkpoint`: None
|
| 401 |
+
- `hub_model_id`: redis/model-b-structured
|
| 402 |
- `hub_strategy`: every_save
|
| 403 |
- `hub_private_repo`: None
|
| 404 |
- `hub_always_push`: False
|
|
|
|
| 425 |
- `neftune_noise_alpha`: None
|
| 426 |
- `optim_target_modules`: None
|
| 427 |
- `batch_eval_metrics`: False
|
| 428 |
+
- `eval_on_start`: True
|
| 429 |
- `use_liger_kernel`: False
|
| 430 |
- `liger_kernel_config`: None
|
| 431 |
- `eval_use_gather_object`: False
|
| 432 |
- `average_tokens_across_devices`: True
|
| 433 |
- `prompts`: None
|
| 434 |
- `batch_sampler`: batch_sampler
|
| 435 |
+
- `multi_dataset_batch_sampler`: proportional
|
| 436 |
- `router_mapping`: {}
|
| 437 |
- `learning_rate_mapping`: {}
|
| 438 |
|
| 439 |
</details>
|
| 440 |
|
| 441 |
### Training Logs
|
| 442 |
+
<details><summary>Click to expand</summary>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 443 |
|
| 444 |
+
| Epoch | Step | Training Loss | Validation Loss | val_cosine_ndcg@10 |
|
| 445 |
+
|:------:|:-----:|:-------------:|:---------------:|:------------------:|
|
| 446 |
+
| 0 | 0 | - | 5.9531 | 0.7603 |
|
| 447 |
+
| 0.0711 | 100 | 5.9694 | 5.7072 | 0.7792 |
|
| 448 |
+
| 0.1422 | 200 | 5.7181 | 5.4263 | 0.7865 |
|
| 449 |
+
| 0.2134 | 300 | 5.5628 | 5.3443 | 0.7829 |
|
| 450 |
+
| 0.2845 | 400 | 5.4947 | 5.3221 | 0.7774 |
|
| 451 |
+
| 0.3556 | 500 | 5.4597 | 5.3180 | 0.7741 |
|
| 452 |
+
| 0.4267 | 600 | 5.4387 | 5.3158 | 0.7737 |
|
| 453 |
+
| 0.4979 | 700 | 5.423 | 5.3141 | 0.7751 |
|
| 454 |
+
| 0.5690 | 800 | 5.4108 | 5.3109 | 0.7848 |
|
| 455 |
+
| 0.6401 | 900 | 5.397 | 5.2923 | 0.8008 |
|
| 456 |
+
| 0.7112 | 1000 | 5.3724 | 5.2839 | 0.8004 |
|
| 457 |
+
| 0.7824 | 1100 | 5.3625 | 5.2804 | 0.8007 |
|
| 458 |
+
| 0.8535 | 1200 | 5.355 | 5.2777 | 0.8013 |
|
| 459 |
+
| 0.9246 | 1300 | 5.3499 | 5.2748 | 0.8030 |
|
| 460 |
+
| 0.9957 | 1400 | 5.3442 | 5.2729 | 0.8067 |
|
| 461 |
+
| 1.0669 | 1500 | 5.3382 | 5.2624 | 0.8103 |
|
| 462 |
+
| 1.1380 | 1600 | 5.3254 | 5.2557 | 0.8138 |
|
| 463 |
+
| 1.2091 | 1700 | 5.3159 | 5.2441 | 0.8163 |
|
| 464 |
+
| 1.2802 | 1800 | 5.3035 | 5.2350 | 0.8180 |
|
| 465 |
+
| 1.3514 | 1900 | 5.295 | 5.2303 | 0.8179 |
|
| 466 |
+
| 1.4225 | 2000 | 5.2925 | 5.2292 | 0.8182 |
|
| 467 |
+
| 1.4936 | 2100 | 5.2881 | 5.2271 | 0.8187 |
|
| 468 |
+
| 1.5647 | 2200 | 5.2854 | 5.2258 | 0.8187 |
|
| 469 |
+
| 1.6358 | 2300 | 5.2831 | 5.2258 | 0.8189 |
|
| 470 |
+
| 1.7070 | 2400 | 5.2805 | 5.2247 | 0.8192 |
|
| 471 |
+
| 1.7781 | 2500 | 5.278 | 5.2247 | 0.8186 |
|
| 472 |
+
| 1.8492 | 2600 | 5.2761 | 5.2230 | 0.8184 |
|
| 473 |
+
| 1.9203 | 2700 | 5.2754 | 5.2221 | 0.8185 |
|
| 474 |
+
| 1.9915 | 2800 | 5.274 | 5.2228 | 0.8185 |
|
| 475 |
+
| 2.0626 | 2900 | 5.2722 | 5.2209 | 0.8175 |
|
| 476 |
+
| 2.1337 | 3000 | 5.2708 | 5.2206 | 0.8182 |
|
| 477 |
+
| 2.2048 | 3100 | 5.2686 | 5.2211 | 0.8194 |
|
| 478 |
+
| 2.2760 | 3200 | 5.2666 | 5.2204 | 0.8186 |
|
| 479 |
+
| 2.3471 | 3300 | 5.2671 | 5.2192 | 0.8191 |
|
| 480 |
+
| 2.4182 | 3400 | 5.2657 | 5.2200 | 0.8188 |
|
| 481 |
+
| 2.4893 | 3500 | 5.2638 | 5.2188 | 0.8184 |
|
| 482 |
+
| 2.5605 | 3600 | 5.2635 | 5.2189 | 0.8188 |
|
| 483 |
+
| 2.6316 | 3700 | 5.2624 | 5.2187 | 0.8192 |
|
| 484 |
+
| 2.7027 | 3800 | 5.262 | 5.2178 | 0.8182 |
|
| 485 |
+
| 2.7738 | 3900 | 5.2608 | 5.2175 | 0.8188 |
|
| 486 |
+
| 2.8450 | 4000 | 5.2595 | 5.2179 | 0.8189 |
|
| 487 |
+
| 2.9161 | 4100 | 5.2599 | 5.2163 | 0.8191 |
|
| 488 |
+
| 2.9872 | 4200 | 5.2587 | 5.2162 | 0.8184 |
|
| 489 |
+
| 3.0583 | 4300 | 5.2574 | 5.2168 | 0.8193 |
|
| 490 |
+
| 3.1294 | 4400 | 5.256 | 5.2165 | 0.8197 |
|
| 491 |
+
| 3.2006 | 4500 | 5.2551 | 5.2158 | 0.8188 |
|
| 492 |
+
| 3.2717 | 4600 | 5.2552 | 5.2159 | 0.8188 |
|
| 493 |
+
| 3.3428 | 4700 | 5.2549 | 5.2157 | 0.8192 |
|
| 494 |
+
| 3.4139 | 4800 | 5.2531 | 5.2154 | 0.8192 |
|
| 495 |
+
| 3.4851 | 4900 | 5.2534 | 5.2152 | 0.8191 |
|
| 496 |
+
| 3.5562 | 5000 | 5.2528 | 5.2146 | 0.8197 |
|
| 497 |
+
| 3.6273 | 5100 | 5.2521 | 5.2149 | 0.8193 |
|
| 498 |
+
| 3.6984 | 5200 | 5.2509 | 5.2145 | 0.8199 |
|
| 499 |
+
| 3.7696 | 5300 | 5.2509 | 5.2144 | 0.8189 |
|
| 500 |
+
| 3.8407 | 5400 | 5.2495 | 5.2139 | 0.8195 |
|
| 501 |
+
| 3.9118 | 5500 | 5.2496 | 5.2140 | 0.8195 |
|
| 502 |
+
| 3.9829 | 5600 | 5.2505 | 5.2135 | 0.8193 |
|
| 503 |
+
| 4.0541 | 5700 | 5.2478 | 5.2140 | 0.8197 |
|
| 504 |
+
| 4.1252 | 5800 | 5.2476 | 5.2136 | 0.8196 |
|
| 505 |
+
| 4.1963 | 5900 | 5.248 | 5.2130 | 0.8199 |
|
| 506 |
+
| 4.2674 | 6000 | 5.2482 | 5.2129 | 0.8196 |
|
| 507 |
+
| 4.3385 | 6100 | 5.2466 | 5.2135 | 0.8196 |
|
| 508 |
+
| 4.4097 | 6200 | 5.2461 | 5.2126 | 0.8196 |
|
| 509 |
+
| 4.4808 | 6300 | 5.2453 | 5.2124 | 0.8196 |
|
| 510 |
+
| 4.5519 | 6400 | 5.2448 | 5.2128 | 0.8197 |
|
| 511 |
+
| 4.6230 | 6500 | 5.2439 | 5.2124 | 0.8193 |
|
| 512 |
+
| 4.6942 | 6600 | 5.244 | 5.2123 | 0.8192 |
|
| 513 |
+
| 4.7653 | 6700 | 5.2428 | 5.2114 | 0.8192 |
|
| 514 |
+
| 4.8364 | 6800 | 5.2433 | 5.2112 | 0.8197 |
|
| 515 |
+
| 4.9075 | 6900 | 5.2439 | 5.2117 | 0.8194 |
|
| 516 |
+
| 4.9787 | 7000 | 5.2422 | 5.2121 | 0.8204 |
|
| 517 |
+
| 5.0498 | 7100 | 5.2425 | 5.2114 | 0.8198 |
|
| 518 |
+
| 5.1209 | 7200 | 5.2418 | 5.2113 | 0.8201 |
|
| 519 |
+
| 5.1920 | 7300 | 5.2416 | 5.2113 | 0.8200 |
|
| 520 |
+
| 5.2632 | 7400 | 5.2405 | 5.2109 | 0.8199 |
|
| 521 |
+
| 5.3343 | 7500 | 5.242 | 5.2106 | 0.8197 |
|
| 522 |
+
| 5.4054 | 7600 | 5.2402 | 5.2105 | 0.8199 |
|
| 523 |
+
| 5.4765 | 7700 | 5.2393 | 5.2108 | 0.8203 |
|
| 524 |
+
| 5.5477 | 7800 | 5.24 | 5.2104 | 0.8198 |
|
| 525 |
+
| 5.6188 | 7900 | 5.2395 | 5.2103 | 0.8201 |
|
| 526 |
+
| 5.6899 | 8000 | 5.2381 | 5.2102 | 0.8198 |
|
| 527 |
+
| 5.7610 | 8100 | 5.2399 | 5.2102 | 0.8195 |
|
| 528 |
+
| 5.8321 | 8200 | 5.2395 | 5.2100 | 0.8195 |
|
| 529 |
+
| 5.9033 | 8300 | 5.2377 | 5.2100 | 0.8197 |
|
| 530 |
+
| 5.9744 | 8400 | 5.238 | 5.2097 | 0.8198 |
|
| 531 |
+
| 6.0455 | 8500 | 5.2372 | 5.2097 | 0.8200 |
|
| 532 |
+
| 6.1166 | 8600 | 5.2368 | 5.2095 | 0.8200 |
|
| 533 |
+
| 6.1878 | 8700 | 5.2378 | 5.2096 | 0.8201 |
|
| 534 |
+
| 6.2589 | 8800 | 5.2372 | 5.2097 | 0.8197 |
|
| 535 |
+
| 6.3300 | 8900 | 5.2365 | 5.2098 | 0.8197 |
|
| 536 |
+
| 6.4011 | 9000 | 5.2367 | 5.2092 | 0.8199 |
|
| 537 |
+
| 6.4723 | 9100 | 5.2364 | 5.2093 | 0.8197 |
|
| 538 |
+
| 6.5434 | 9200 | 5.2362 | 5.2095 | 0.8202 |
|
| 539 |
+
| 6.6145 | 9300 | 5.2359 | 5.2096 | 0.8199 |
|
| 540 |
+
| 6.6856 | 9400 | 5.2345 | 5.2095 | 0.8203 |
|
| 541 |
+
| 6.7568 | 9500 | 5.2362 | 5.2090 | 0.8202 |
|
| 542 |
+
| 6.8279 | 9600 | 5.2353 | 5.2089 | 0.8201 |
|
| 543 |
+
| 6.8990 | 9700 | 5.2346 | 5.2090 | 0.8203 |
|
| 544 |
+
| 6.9701 | 9800 | 5.2354 | 5.2090 | 0.8202 |
|
| 545 |
+
| 7.0413 | 9900 | 5.234 | 5.2089 | 0.8202 |
|
| 546 |
+
| 7.1124 | 10000 | 5.2334 | 5.2087 | 0.8202 |
|
| 547 |
+
| 7.1835 | 10100 | 5.2342 | 5.2089 | 0.8204 |
|
| 548 |
+
| 7.2546 | 10200 | 5.2342 | 5.2089 | 0.8204 |
|
| 549 |
+
| 7.3257 | 10300 | 5.2336 | 5.2085 | 0.8203 |
|
| 550 |
+
| 7.3969 | 10400 | 5.2347 | 5.2086 | 0.8206 |
|
| 551 |
+
| 7.4680 | 10500 | 5.2326 | 5.2086 | 0.8203 |
|
| 552 |
+
| 7.5391 | 10600 | 5.2336 | 5.2082 | 0.8201 |
|
| 553 |
+
| 7.6102 | 10700 | 5.2328 | 5.2084 | 0.8202 |
|
| 554 |
+
| 7.6814 | 10800 | 5.2328 | 5.2085 | 0.8203 |
|
| 555 |
+
| 7.7525 | 10900 | 5.2321 | 5.2083 | 0.8201 |
|
| 556 |
+
| 7.8236 | 11000 | 5.2332 | 5.2082 | 0.8202 |
|
| 557 |
+
| 7.8947 | 11100 | 5.2325 | 5.2082 | 0.8202 |
|
| 558 |
+
| 7.9659 | 11200 | 5.2331 | 5.2082 | 0.8200 |
|
| 559 |
+
| 8.0370 | 11300 | 5.2322 | 5.2081 | 0.8202 |
|
| 560 |
+
| 8.1081 | 11400 | 5.2324 | 5.2082 | 0.8206 |
|
| 561 |
+
| 8.1792 | 11500 | 5.2318 | 5.2080 | 0.8200 |
|
| 562 |
+
| 8.2504 | 11600 | 5.2314 | 5.2082 | 0.8202 |
|
| 563 |
+
| 8.3215 | 11700 | 5.2318 | 5.2082 | 0.8202 |
|
| 564 |
+
| 8.3926 | 11800 | 5.2317 | 5.2078 | 0.8203 |
|
| 565 |
+
| 8.4637 | 11900 | 5.2312 | 5.2078 | 0.8202 |
|
| 566 |
+
| 8.5349 | 12000 | 5.2327 | 5.2079 | 0.8201 |
|
| 567 |
+
| 8.6060 | 12100 | 5.2316 | 5.2077 | 0.8203 |
|
| 568 |
+
| 8.6771 | 12200 | 5.2317 | 5.2078 | 0.8204 |
|
| 569 |
+
| 8.7482 | 12300 | 5.2301 | 5.2079 | 0.8202 |
|
| 570 |
+
| 8.8193 | 12400 | 5.2308 | 5.2077 | 0.8201 |
|
| 571 |
+
| 8.8905 | 12500 | 5.2306 | 5.2078 | 0.8200 |
|
| 572 |
+
| 8.9616 | 12600 | 5.231 | 5.2077 | 0.8200 |
|
| 573 |
+
| 9.0327 | 12700 | 5.2307 | 5.2076 | 0.8199 |
|
| 574 |
+
| 9.1038 | 12800 | 5.2309 | 5.2076 | 0.8201 |
|
| 575 |
+
| 9.1750 | 12900 | 5.2301 | 5.2076 | 0.8200 |
|
| 576 |
+
| 9.2461 | 13000 | 5.231 | 5.2076 | 0.8202 |
|
| 577 |
+
| 9.3172 | 13100 | 5.2312 | 5.2075 | 0.8201 |
|
| 578 |
+
| 9.3883 | 13200 | 5.2304 | 5.2077 | 0.8204 |
|
| 579 |
+
| 9.4595 | 13300 | 5.2304 | 5.2075 | 0.8202 |
|
| 580 |
+
| 9.5306 | 13400 | 5.2312 | 5.2076 | 0.8203 |
|
| 581 |
+
| 9.6017 | 13500 | 5.2304 | 5.2076 | 0.8204 |
|
| 582 |
+
| 9.6728 | 13600 | 5.2309 | 5.2076 | 0.8203 |
|
| 583 |
+
| 9.7440 | 13700 | 5.23 | 5.2075 | 0.8202 |
|
| 584 |
+
| 9.8151 | 13800 | 5.2301 | 5.2075 | 0.8201 |
|
| 585 |
+
| 9.8862 | 13900 | 5.231 | 5.2075 | 0.8203 |
|
| 586 |
+
| 9.9573 | 14000 | 5.2303 | 5.2075 | 0.8203 |
|
| 587 |
+
|
| 588 |
+
</details>
|
| 589 |
|
| 590 |
### Framework Versions
|
| 591 |
- Python: 3.10.18
|