Syldehayem commited on
Commit
7b6614e
·
verified ·
1 Parent(s): 5895642

Add new SentenceTransformer model

Browse files
Files changed (2) hide show
  1. README.md +106 -59
  2. model.safetensors +1 -1
README.md CHANGED
@@ -4,48 +4,52 @@ tags:
4
  - sentence-similarity
5
  - feature-extraction
6
  - generated_from_trainer
7
- - dataset_size:13657
8
  - loss:TripletLoss
9
- base_model: sentence-transformers/all-MiniLM-L12-v2
10
  widget:
11
- - source_sentence: AN ISLAND DRIFTS | Omeleto Drama
12
  sentences:
13
- - 'The DUST Files: Awesome Aliens Vol. 1'
14
- - Doshti Putuler Galpo | দশটি পুতুলের গল্প | Bangla Telefilm | Part - 2
15
- - Mon Shunte Ki Chay | Hingsha | Bengali Movie Song | Kumar Sanu, Kavita
16
- - source_sentence: 'CGI 3D Animated Short: "Knight GYM" - by Alexis Dumortier | TheCGBros'
 
 
17
  sentences:
18
- - 'Zoom Cloud Hack #91784 The Tribe Murders | Short Horror Film | Screamfest'
19
- - '**Caution** CGI 3D Animated Spot : "#feelingnuts" - by Studio AKA'
20
- - 'CGI VFX Breakdowns : "Dust - Creature Shot VFX" by Ember Lab'
21
- - source_sentence: CGI Animated Short Film HD "Scarlett " by The STUDIO NYC | CGMeetup
 
22
  sentences:
23
- - 'CGI 3D Animated Short: "The Song of the Rain" - by Hezmon Animation | TheCGBros'
24
- - Ghum Ghum Chand | Sabar Oparey | Bengali Movie Song | Sandhya Mukherjee
25
- - 'CGI 3D Showreel : "Architectural 2012" by - ALLCGSTUDIO'
26
- - source_sentence: Chucky | Halloween Horror Nights 2023
27
  sentences:
28
- - Bhalobasa Bhalobasa | ভালবাসা ভালবাসা | Bengali Movie - 3/13
29
- - 'CGI & VFX Breakdowns: "After Earth Breakdown" - by Tippett Studio | TheCGBros'
30
- - Horror Short Film "The World Over" | ALTER
31
- - source_sentence: Horror Short Film “The Guest” | ALTER
 
32
  sentences:
33
- - Sci-Fi Short Film "Who Among Us" | DUST
34
- - '"The Amazing SpiderDad Trailer" - Mike Wilson'
35
- - Horror Short Film "Peter the Penguin" | ALTER
36
  pipeline_tag: sentence-similarity
37
  library_name: sentence-transformers
38
  ---
39
 
40
- # SentenceTransformer based on sentence-transformers/all-MiniLM-L12-v2
41
 
42
- This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
43
 
44
  ## Model Details
45
 
46
  ### Model Description
47
  - **Model Type:** Sentence Transformer
48
- - **Base model:** [sentence-transformers/all-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2) <!-- at revision c004d8e3e901237d8fa7e9fff12774962e391ce5 -->
49
  - **Maximum Sequence Length:** 128 tokens
50
  - **Output Dimensionality:** 384 dimensions
51
  - **Similarity Function:** Cosine Similarity
@@ -87,9 +91,9 @@ from sentence_transformers import SentenceTransformer
87
  model = SentenceTransformer("Syldehayem/all-MiniLM-L12-v2_embedder_train")
88
  # Run inference
89
  sentences = [
90
- 'Horror Short Film “The Guest” | ALTER',
91
- 'Sci-Fi Short Film "Who Among Us" | DUST',
92
- 'Horror Short Film "Peter the Penguin" | ALTER',
93
  ]
94
  embeddings = model.encode(sentences)
95
  print(embeddings.shape)
@@ -143,19 +147,19 @@ You can finetune this model on your own dataset.
143
 
144
  #### Unnamed Dataset
145
 
146
- * Size: 13,657 training samples
147
  * Columns: <code>sentence_0</code>, <code>sentence_1</code>, and <code>sentence_2</code>
148
  * Approximate statistics based on the first 1000 samples:
149
- | | sentence_0 | sentence_1 | sentence_2 |
150
- |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
151
- | type | string | string | string |
152
- | details | <ul><li>min: 3 tokens</li><li>mean: 19.84 tokens</li><li>max: 69 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 19.98 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 19.89 tokens</li><li>max: 46 tokens</li></ul> |
153
  * Samples:
154
- | sentence_0 | sentence_1 | sentence_2 |
155
- |:--------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
156
- | <code>CGI 3D Animated Promo Short : "EFdeN: Where N is Nature" - by UmbrellaFX</code> | <code>A Sci-Fi Short Film: "The Last Star" by Dimitri Vallein | TheCGBros</code> | <code>CGI & VFX Showreels: "Character reel" - by Dario Triglia | TheCGBros</code> |
157
- | <code>**Award Winning** Sci-Fi Short Film: "The Developer" - by Robert Odegnal | TheCGBros</code> | <code>Vonnis | Short Horror Film | Screamfest</code> | <code>Adobe and the Frog BTS - Day 3 & 4!</code> |
158
- | <code>CGI 3D Animated Short "Olrik" - by Philip Harris-Genois and Marilyn Marcotte</code> | <code>CGI & VFX Showreels: "VFX Compositing Showreel" - by Ameya More | TheCGBros</code> | <code>CGI 3D Making Of : "Project 4450" - by The Animation Workshop</code> |
159
  * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
160
  ```json
161
  {
@@ -169,7 +173,7 @@ You can finetune this model on your own dataset.
169
 
170
  - `per_device_train_batch_size`: 16
171
  - `per_device_eval_batch_size`: 16
172
- - `num_train_epochs`: 10
173
  - `multi_dataset_batch_sampler`: round_robin
174
 
175
  #### All Hyperparameters
@@ -192,7 +196,7 @@ You can finetune this model on your own dataset.
192
  - `adam_beta2`: 0.999
193
  - `adam_epsilon`: 1e-08
194
  - `max_grad_norm`: 1
195
- - `num_train_epochs`: 10
196
  - `max_steps`: -1
197
  - `lr_scheduler_type`: linear
198
  - `lr_scheduler_kwargs`: {}
@@ -293,25 +297,68 @@ You can finetune this model on your own dataset.
293
  </details>
294
 
295
  ### Training Logs
296
- | Epoch | Step | Training Loss |
297
- |:------:|:----:|:-------------:|
298
- | 0.5855 | 500 | 4.9981 |
299
- | 1.1710 | 1000 | 4.997 |
300
- | 1.7564 | 1500 | 4.9753 |
301
- | 2.3419 | 2000 | 4.9609 |
302
- | 2.9274 | 2500 | 4.9416 |
303
- | 3.5129 | 3000 | 4.8768 |
304
- | 4.0984 | 3500 | 4.8283 |
305
- | 4.6838 | 4000 | 4.7853 |
306
- | 5.2693 | 4500 | 4.7767 |
307
- | 5.8548 | 5000 | 4.7234 |
308
- | 6.4403 | 5500 | 4.7153 |
309
- | 7.0258 | 6000 | 4.6914 |
310
- | 7.6112 | 6500 | 4.6429 |
311
- | 8.1967 | 7000 | 4.6607 |
312
- | 8.7822 | 7500 | 4.6422 |
313
- | 9.3677 | 8000 | 4.613 |
314
- | 9.9532 | 8500 | 4.6118 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
315
 
316
 
317
  ### Framework Versions
 
4
  - sentence-similarity
5
  - feature-extraction
6
  - generated_from_trainer
7
+ - dataset_size:9712
8
  - loss:TripletLoss
9
+ base_model: Syldehayem/all-MiniLM-L12-v2_embedder_train
10
  widget:
11
+ - source_sentence: CGI 3D Animated Short "The Scarf" - by Team The Scarf
12
  sentences:
13
+ - 'CGI 3D Short: "Lenovo Legion: Turning Point" - by Audis Huang & Moonshine Animation
14
+ | TheCGBros'
15
+ - 'CGI Animated Trailers : "Dropzone" - by RealtimeUK'
16
+ - 'CGI 3D Animated Short: "SOLVIVAL" - by Pixelhunters | TheCGBros'
17
+ - source_sentence: CGI Animated Short Film HD "Terazia's Zoo " by Alison Dulou & Estelle
18
+ Lefebvre | CGMeetup
19
  sentences:
20
+ - A comedian puppet decides to branch out on his own / You're The Puppet
21
+ - Horror Short Film Series “The Outer Darkness” Part 1 | ALTER
22
+ - ERNIE | Omeleto
23
+ - source_sentence: Kenneth Branagh in the thriller "Schneider's 2nd Stage" - Short
24
+ film by Phil Stoole
25
  sentences:
26
+ - 'CGI 3D Animated Short Film: "Fish in LOVE" by ISArt Digital | @CGMeetup'
27
+ - Cookies By The Fire Short Horror Film | Screamfest | Merry Christmas
28
+ - 'CGI 3D Animated Spot: "Mantse Palm Wine" - by Arnold Bannerman | TheCGBros'
29
+ - source_sentence: The Portrait
30
  sentences:
31
+ - A teenage girl must quickly adapt to a radically different urban environment |
32
+ Barrio Frontera
33
+ - Queen of Meatloaf | Short film tease
34
+ - 'CGI 3D Tutorial : "Using Zapplink in Zbrush" - by 3dmotive'
35
+ - source_sentence: Horror Short Film "Nice to Finally Meet You" | ALTER | Online Premiere
36
  sentences:
37
+ - 'Mondays: The Spielberg Challenge Winner!'
38
+ - 'The Curse of Pandora''s Box Returns to #UniversalHHN 2021'
39
+ - SONS OF APRIL | Omeleto
40
  pipeline_tag: sentence-similarity
41
  library_name: sentence-transformers
42
  ---
43
 
44
+ # SentenceTransformer based on Syldehayem/all-MiniLM-L12-v2_embedder_train
45
 
46
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Syldehayem/all-MiniLM-L12-v2_embedder_train](https://huggingface.co/Syldehayem/all-MiniLM-L12-v2_embedder_train). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
47
 
48
  ## Model Details
49
 
50
  ### Model Description
51
  - **Model Type:** Sentence Transformer
52
+ - **Base model:** [Syldehayem/all-MiniLM-L12-v2_embedder_train](https://huggingface.co/Syldehayem/all-MiniLM-L12-v2_embedder_train) <!-- at revision 58956428f2d485efdf2697a1a2cc793795e25057 -->
53
  - **Maximum Sequence Length:** 128 tokens
54
  - **Output Dimensionality:** 384 dimensions
55
  - **Similarity Function:** Cosine Similarity
 
91
  model = SentenceTransformer("Syldehayem/all-MiniLM-L12-v2_embedder_train")
92
  # Run inference
93
  sentences = [
94
+ 'Horror Short Film "Nice to Finally Meet You" | ALTER | Online Premiere',
95
+ "The Curse of Pandora's Box Returns to #UniversalHHN 2021",
96
+ 'Mondays: The Spielberg Challenge Winner!',
97
  ]
98
  embeddings = model.encode(sentences)
99
  print(embeddings.shape)
 
147
 
148
  #### Unnamed Dataset
149
 
150
+ * Size: 9,712 training samples
151
  * Columns: <code>sentence_0</code>, <code>sentence_1</code>, and <code>sentence_2</code>
152
  * Approximate statistics based on the first 1000 samples:
153
+ | | sentence_0 | sentence_1 | sentence_2 |
154
+ |:--------|:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
155
+ | type | string | string | string |
156
+ | details | <ul><li>min: 3 tokens</li><li>mean: 19.7 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 19.91 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 20.27 tokens</li><li>max: 50 tokens</li></ul> |
157
  * Samples:
158
+ | sentence_0 | sentence_1 | sentence_2 |
159
+ |:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------|:---------------------------------------------------------------------|
160
+ | <code>মেয়ে যখন মায়ের মতন | Bidhilipi | #Shorts | Bengali Family Drama</code> | <code>CGI 3D Animated Shorts: "Rust" - by Matthieu Druaud</code> | <code>Mukhyamantri | মুখ্যমন্ত্রী | Bengali Movie Part 3/12</code> |
161
+ | <code>A Sci-Fi Short Film: "Voltok" - by Jonathan Vleeschower | TheCGBros</code> | <code>CGI MoCap Demo : "Finger Mocap Without Any Post Animation" by the MocapLab</code> | <code>A MAN DEPARTED | Omeleto Drama</code> |
162
+ | <code>LEAKY PIPES</code> | <code>Taking care of a baby at 15 | "Fifteen" - Short film by Sameh Alaa</code> | <code>CGI VFX Spot : "Black Beetle" by - The MILL</code> |
163
  * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
164
  ```json
165
  {
 
173
 
174
  - `per_device_train_batch_size`: 16
175
  - `per_device_eval_batch_size`: 16
176
+ - `num_train_epochs`: 50
177
  - `multi_dataset_batch_sampler`: round_robin
178
 
179
  #### All Hyperparameters
 
196
  - `adam_beta2`: 0.999
197
  - `adam_epsilon`: 1e-08
198
  - `max_grad_norm`: 1
199
+ - `num_train_epochs`: 50
200
  - `max_steps`: -1
201
  - `lr_scheduler_type`: linear
202
  - `lr_scheduler_kwargs`: {}
 
297
  </details>
298
 
299
  ### Training Logs
300
+ | Epoch | Step | Training Loss |
301
+ |:-------:|:-----:|:-------------:|
302
+ | 0.8237 | 500 | 5.0075 |
303
+ | 1.6474 | 1000 | 4.9816 |
304
+ | 2.4712 | 1500 | 5.013 |
305
+ | 3.2949 | 2000 | 4.981 |
306
+ | 4.1186 | 2500 | 4.9981 |
307
+ | 4.9423 | 3000 | 4.9727 |
308
+ | 5.7661 | 3500 | 4.9698 |
309
+ | 6.5898 | 4000 | 4.9839 |
310
+ | 7.4135 | 4500 | 5.0001 |
311
+ | 8.2372 | 5000 | 4.9996 |
312
+ | 9.0610 | 5500 | 4.9993 |
313
+ | 9.8847 | 6000 | 4.9999 |
314
+ | 10.7084 | 6500 | 5.0015 |
315
+ | 11.5321 | 7000 | 4.9934 |
316
+ | 12.3558 | 7500 | 4.9903 |
317
+ | 13.1796 | 8000 | 4.9875 |
318
+ | 14.0033 | 8500 | 5.0018 |
319
+ | 14.8270 | 9000 | 5.0088 |
320
+ | 15.6507 | 9500 | 4.9643 |
321
+ | 16.4745 | 10000 | 4.9447 |
322
+ | 17.2982 | 10500 | 4.8911 |
323
+ | 18.1219 | 11000 | 4.8719 |
324
+ | 18.9456 | 11500 | 4.8671 |
325
+ | 19.7694 | 12000 | 4.8268 |
326
+ | 20.5931 | 12500 | 4.8195 |
327
+ | 21.4168 | 13000 | 4.7726 |
328
+ | 22.2405 | 13500 | 4.7479 |
329
+ | 23.0643 | 14000 | 4.7465 |
330
+ | 23.8880 | 14500 | 4.7776 |
331
+ | 24.7117 | 15000 | 4.7366 |
332
+ | 25.5354 | 15500 | 4.7076 |
333
+ | 26.3591 | 16000 | 4.74 |
334
+ | 27.1829 | 16500 | 4.7118 |
335
+ | 28.0066 | 17000 | 4.6797 |
336
+ | 28.8303 | 17500 | 4.7144 |
337
+ | 29.6540 | 18000 | 4.662 |
338
+ | 30.4778 | 18500 | 4.6849 |
339
+ | 31.3015 | 19000 | 4.6608 |
340
+ | 32.1252 | 19500 | 4.6844 |
341
+ | 32.9489 | 20000 | 4.6561 |
342
+ | 33.7727 | 20500 | 4.6513 |
343
+ | 34.5964 | 21000 | 4.6418 |
344
+ | 35.4201 | 21500 | 4.635 |
345
+ | 36.2438 | 22000 | 4.6418 |
346
+ | 37.0675 | 22500 | 4.62 |
347
+ | 37.8913 | 23000 | 4.615 |
348
+ | 38.7150 | 23500 | 4.6189 |
349
+ | 39.5387 | 24000 | 4.6113 |
350
+ | 40.3624 | 24500 | 4.6054 |
351
+ | 41.1862 | 25000 | 4.5824 |
352
+ | 42.0099 | 25500 | 4.5907 |
353
+ | 42.8336 | 26000 | 4.5949 |
354
+ | 43.6573 | 26500 | 4.5769 |
355
+ | 44.4811 | 27000 | 4.5758 |
356
+ | 45.3048 | 27500 | 4.5613 |
357
+ | 46.1285 | 28000 | 4.5816 |
358
+ | 46.9522 | 28500 | 4.5538 |
359
+ | 47.7759 | 29000 | 4.5645 |
360
+ | 48.5997 | 29500 | 4.5653 |
361
+ | 49.4234 | 30000 | 4.5494 |
362
 
363
 
364
  ### Framework Versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a918830d1c43472f6d370ac225943262274803f3e69eb4e4035a76cfeb339374
3
  size 133462128
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8a0ef17f513afbe54faae7df152aa8782ec9c31ce60484187db0018d367f169a
3
  size 133462128