Yash911 commited on
Commit
b0fb143
·
1 Parent(s): 4b02610

Upload folder using huggingface_hub

Browse files
1_Pooling/config.json CHANGED
@@ -1,10 +1,10 @@
1
  {
2
- "word_embedding_dimension": 1024,
3
- "pooling_mode_cls_token": false,
4
- "pooling_mode_mean_tokens": true,
5
- "pooling_mode_max_tokens": false,
6
- "pooling_mode_mean_sqrt_len_tokens": false,
7
- "pooling_mode_weightedmean_tokens": false,
8
- "pooling_mode_lasttoken": false,
9
- "include_prompt": true
10
  }
 
1
  {
2
+ "word_embedding_dimension": 1024,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
  }
README.md CHANGED
@@ -3,10 +3,11 @@ tags:
3
  - sentence-transformers
4
  - sentence-similarity
5
  - feature-extraction
 
6
  - generated_from_trainer
7
  - dataset_size:50000
8
  - loss:MultipleNegativesRankingLoss
9
- base_model: intfloat/e5-large-v2
10
  widget:
11
  - source_sentence: AVS Video Editor AVS Video Editor is a video editing software published
12
  by Online Media Technologies Ltd. It is a part of AVS4YOU software suite which
@@ -143,15 +144,15 @@ pipeline_tag: sentence-similarity
143
  library_name: sentence-transformers
144
  ---
145
 
146
- # SentenceTransformer based on intfloat/e5-large-v2
147
 
148
- This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [intfloat/e5-large-v2](https://huggingface.co/intfloat/e5-large-v2). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
149
 
150
  ## Model Details
151
 
152
  ### Model Description
153
  - **Model Type:** Sentence Transformer
154
- - **Base model:** [intfloat/e5-large-v2](https://huggingface.co/intfloat/e5-large-v2) <!-- at revision f169b11e22de13617baa190a028a32f3493550b6 -->
155
  - **Maximum Sequence Length:** 512 tokens
156
  - **Output Dimensionality:** 1024 dimensions
157
  - **Similarity Function:** Cosine Similarity
@@ -169,7 +170,7 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [i
169
 
170
  ```
171
  SentenceTransformer(
172
- (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
173
  (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
174
  (2): Normalize()
175
  )
@@ -203,8 +204,10 @@ print(embeddings.shape)
203
 
204
  # Get the similarity scores for the embeddings
205
  similarities = model.similarity(embeddings, embeddings)
206
- print(similarities.shape)
207
- # [3, 3]
 
 
208
  ```
209
 
210
  <!--
@@ -266,7 +269,8 @@ You can finetune this model on your own dataset.
266
  ```json
267
  {
268
  "scale": 20.0,
269
- "similarity_fct": "cos_sim"
 
270
  }
271
  ```
272
 
@@ -393,29 +397,31 @@ You can finetune this model on your own dataset.
393
  - `prompts`: None
394
  - `batch_sampler`: batch_sampler
395
  - `multi_dataset_batch_sampler`: round_robin
 
 
396
 
397
  </details>
398
 
399
  ### Training Logs
400
  | Epoch | Step | Training Loss |
401
  |:-----:|:----:|:-------------:|
402
- | 0.08 | 500 | 0.3751 |
403
- | 0.16 | 1000 | 0.1414 |
404
- | 0.24 | 1500 | 0.1219 |
405
- | 0.32 | 2000 | 0.0979 |
406
- | 0.4 | 2500 | 0.083 |
407
- | 0.48 | 3000 | 0.067 |
408
- | 0.56 | 3500 | 0.0645 |
409
- | 0.64 | 4000 | 0.0578 |
410
- | 0.72 | 4500 | 0.0454 |
411
- | 0.8 | 5000 | 0.0404 |
412
- | 0.88 | 5500 | 0.0419 |
413
- | 0.96 | 6000 | 0.0402 |
414
 
415
 
416
  ### Framework Versions
417
  - Python: 3.11.13
418
- - Sentence Transformers: 4.1.0
419
  - Transformers: 4.52.4
420
  - PyTorch: 2.6.0+cu124
421
  - Accelerate: 1.8.1
 
3
  - sentence-transformers
4
  - sentence-similarity
5
  - feature-extraction
6
+ - dense
7
  - generated_from_trainer
8
  - dataset_size:50000
9
  - loss:MultipleNegativesRankingLoss
10
+ base_model: Yash911/e5-finetuned
11
  widget:
12
  - source_sentence: AVS Video Editor AVS Video Editor is a video editing software published
13
  by Online Media Technologies Ltd. It is a part of AVS4YOU software suite which
 
144
  library_name: sentence-transformers
145
  ---
146
 
147
+ # SentenceTransformer based on Yash911/e5-finetuned
148
 
149
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Yash911/e5-finetuned](https://huggingface.co/Yash911/e5-finetuned). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
150
 
151
  ## Model Details
152
 
153
  ### Model Description
154
  - **Model Type:** Sentence Transformer
155
+ - **Base model:** [Yash911/e5-finetuned](https://huggingface.co/Yash911/e5-finetuned) <!-- at revision 3225106ac46f6cb6475281b1d428aad055924db7 -->
156
  - **Maximum Sequence Length:** 512 tokens
157
  - **Output Dimensionality:** 1024 dimensions
158
  - **Similarity Function:** Cosine Similarity
 
170
 
171
  ```
172
  SentenceTransformer(
173
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'BertModel'})
174
  (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
175
  (2): Normalize()
176
  )
 
204
 
205
  # Get the similarity scores for the embeddings
206
  similarities = model.similarity(embeddings, embeddings)
207
+ print(similarities)
208
+ # tensor([[ 1.0000, 0.6894, -0.0088],
209
+ # [ 0.6894, 1.0000, -0.0543],
210
+ # [-0.0088, -0.0543, 1.0000]])
211
  ```
212
 
213
  <!--
 
269
  ```json
270
  {
271
  "scale": 20.0,
272
+ "similarity_fct": "cos_sim",
273
+ "gather_across_devices": false
274
  }
275
  ```
276
 
 
397
  - `prompts`: None
398
  - `batch_sampler`: batch_sampler
399
  - `multi_dataset_batch_sampler`: round_robin
400
+ - `router_mapping`: {}
401
+ - `learning_rate_mapping`: {}
402
 
403
  </details>
404
 
405
  ### Training Logs
406
  | Epoch | Step | Training Loss |
407
  |:-----:|:----:|:-------------:|
408
+ | 0.08 | 500 | 0.0323 |
409
+ | 0.16 | 1000 | 0.0356 |
410
+ | 0.24 | 1500 | 0.0426 |
411
+ | 0.32 | 2000 | 0.0451 |
412
+ | 0.4 | 2500 | 0.0306 |
413
+ | 0.48 | 3000 | 0.0341 |
414
+ | 0.56 | 3500 | 0.0374 |
415
+ | 0.64 | 4000 | 0.0291 |
416
+ | 0.72 | 4500 | 0.0266 |
417
+ | 0.8 | 5000 | 0.0214 |
418
+ | 0.88 | 5500 | 0.0331 |
419
+ | 0.96 | 6000 | 0.0281 |
420
 
421
 
422
  ### Framework Versions
423
  - Python: 3.11.13
424
+ - Sentence Transformers: 5.1.0
425
  - Transformers: 4.52.4
426
  - PyTorch: 2.6.0+cu124
427
  - Accelerate: 1.8.1
config_sentence_transformers.json CHANGED
@@ -1,10 +1,14 @@
1
  {
2
  "__version__": {
3
- "sentence_transformers": "4.1.0",
4
  "transformers": "4.52.4",
5
  "pytorch": "2.6.0+cu124"
6
  },
7
- "prompts": {},
 
 
 
8
  "default_prompt_name": null,
9
- "similarity_fn_name": "cosine"
 
10
  }
 
1
  {
2
  "__version__": {
3
+ "sentence_transformers": "5.1.0",
4
  "transformers": "4.52.4",
5
  "pytorch": "2.6.0+cu124"
6
  },
7
+ "prompts": {
8
+ "query": "",
9
+ "document": ""
10
+ },
11
  "default_prompt_name": null,
12
+ "similarity_fn_name": "cosine",
13
+ "model_type": "SentenceTransformer"
14
  }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5e9435891aaf138c9519d918f0dfd6b9d607abfeafec2575a06d3137bd292a0c
3
+ size 1340612432
sentence_bert_config.json CHANGED
@@ -1,4 +1,4 @@
1
  {
2
- "max_seq_length": 512,
3
- "do_lower_case": false
4
  }
 
1
  {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
  }
tokenizer_config.json CHANGED
@@ -46,11 +46,18 @@
46
  "do_lower_case": true,
47
  "extra_special_tokens": {},
48
  "mask_token": "[MASK]",
 
49
  "model_max_length": 512,
 
50
  "pad_token": "[PAD]",
 
 
51
  "sep_token": "[SEP]",
 
52
  "strip_accents": null,
53
  "tokenize_chinese_chars": true,
54
  "tokenizer_class": "BertTokenizer",
 
 
55
  "unk_token": "[UNK]"
56
  }
 
46
  "do_lower_case": true,
47
  "extra_special_tokens": {},
48
  "mask_token": "[MASK]",
49
+ "max_length": 512,
50
  "model_max_length": 512,
51
+ "pad_to_multiple_of": null,
52
  "pad_token": "[PAD]",
53
+ "pad_token_type_id": 0,
54
+ "padding_side": "right",
55
  "sep_token": "[SEP]",
56
+ "stride": 0,
57
  "strip_accents": null,
58
  "tokenize_chinese_chars": true,
59
  "tokenizer_class": "BertTokenizer",
60
+ "truncation_side": "right",
61
+ "truncation_strategy": "longest_first",
62
  "unk_token": "[UNK]"
63
  }