Add new SentenceTransformer model
Browse files- README.md +106 -59
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -4,48 +4,52 @@ tags:
|
|
| 4 |
- sentence-similarity
|
| 5 |
- feature-extraction
|
| 6 |
- generated_from_trainer
|
| 7 |
-
- dataset_size:
|
| 8 |
- loss:TripletLoss
|
| 9 |
-
base_model:
|
| 10 |
widget:
|
| 11 |
-
- source_sentence:
|
| 12 |
sentences:
|
| 13 |
-
- '
|
| 14 |
-
|
| 15 |
-
-
|
| 16 |
-
-
|
|
|
|
|
|
|
| 17 |
sentences:
|
| 18 |
-
-
|
| 19 |
-
-
|
| 20 |
-
-
|
| 21 |
-
- source_sentence:
|
|
|
|
| 22 |
sentences:
|
| 23 |
-
- 'CGI 3D Animated Short: "
|
| 24 |
-
-
|
| 25 |
-
- 'CGI 3D
|
| 26 |
-
- source_sentence:
|
| 27 |
sentences:
|
| 28 |
-
-
|
| 29 |
-
|
| 30 |
-
-
|
| 31 |
-
-
|
|
|
|
| 32 |
sentences:
|
| 33 |
-
-
|
| 34 |
-
- '
|
| 35 |
-
-
|
| 36 |
pipeline_tag: sentence-similarity
|
| 37 |
library_name: sentence-transformers
|
| 38 |
---
|
| 39 |
|
| 40 |
-
# SentenceTransformer based on
|
| 41 |
|
| 42 |
-
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [
|
| 43 |
|
| 44 |
## Model Details
|
| 45 |
|
| 46 |
### Model Description
|
| 47 |
- **Model Type:** Sentence Transformer
|
| 48 |
-
- **Base model:** [
|
| 49 |
- **Maximum Sequence Length:** 128 tokens
|
| 50 |
- **Output Dimensionality:** 384 dimensions
|
| 51 |
- **Similarity Function:** Cosine Similarity
|
|
@@ -87,9 +91,9 @@ from sentence_transformers import SentenceTransformer
|
|
| 87 |
model = SentenceTransformer("Syldehayem/all-MiniLM-L12-v2_embedder_train")
|
| 88 |
# Run inference
|
| 89 |
sentences = [
|
| 90 |
-
'Horror Short Film
|
| 91 |
-
|
| 92 |
-
'
|
| 93 |
]
|
| 94 |
embeddings = model.encode(sentences)
|
| 95 |
print(embeddings.shape)
|
|
@@ -143,19 +147,19 @@ You can finetune this model on your own dataset.
|
|
| 143 |
|
| 144 |
#### Unnamed Dataset
|
| 145 |
|
| 146 |
-
* Size:
|
| 147 |
* Columns: <code>sentence_0</code>, <code>sentence_1</code>, and <code>sentence_2</code>
|
| 148 |
* Approximate statistics based on the first 1000 samples:
|
| 149 |
-
| | sentence_0
|
| 150 |
-
|:--------|:---------------------------------------------------------------------------------
|
| 151 |
-
| type | string
|
| 152 |
-
| details | <ul><li>min: 3 tokens</li><li>mean: 19.
|
| 153 |
* Samples:
|
| 154 |
-
| sentence_0
|
| 155 |
-
|:---------------------------------------------------------------------------------
|
| 156 |
-
| <code>
|
| 157 |
-
| <code>
|
| 158 |
-
| <code>
|
| 159 |
* Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
|
| 160 |
```json
|
| 161 |
{
|
|
@@ -169,7 +173,7 @@ You can finetune this model on your own dataset.
|
|
| 169 |
|
| 170 |
- `per_device_train_batch_size`: 16
|
| 171 |
- `per_device_eval_batch_size`: 16
|
| 172 |
-
- `num_train_epochs`:
|
| 173 |
- `multi_dataset_batch_sampler`: round_robin
|
| 174 |
|
| 175 |
#### All Hyperparameters
|
|
@@ -192,7 +196,7 @@ You can finetune this model on your own dataset.
|
|
| 192 |
- `adam_beta2`: 0.999
|
| 193 |
- `adam_epsilon`: 1e-08
|
| 194 |
- `max_grad_norm`: 1
|
| 195 |
-
- `num_train_epochs`:
|
| 196 |
- `max_steps`: -1
|
| 197 |
- `lr_scheduler_type`: linear
|
| 198 |
- `lr_scheduler_kwargs`: {}
|
|
@@ -293,25 +297,68 @@ You can finetune this model on your own dataset.
|
|
| 293 |
</details>
|
| 294 |
|
| 295 |
### Training Logs
|
| 296 |
-
| Epoch
|
| 297 |
-
|:------:|:----:|:-------------:|
|
| 298 |
-
| 0.
|
| 299 |
-
| 1.
|
| 300 |
-
|
|
| 301 |
-
|
|
| 302 |
-
|
|
| 303 |
-
|
|
| 304 |
-
|
|
| 305 |
-
|
|
| 306 |
-
|
|
| 307 |
-
|
|
| 308 |
-
|
|
| 309 |
-
|
|
| 310 |
-
|
|
| 311 |
-
|
|
| 312 |
-
|
|
| 313 |
-
|
|
| 314 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 315 |
|
| 316 |
|
| 317 |
### Framework Versions
|
|
|
|
| 4 |
- sentence-similarity
|
| 5 |
- feature-extraction
|
| 6 |
- generated_from_trainer
|
| 7 |
+
- dataset_size:9712
|
| 8 |
- loss:TripletLoss
|
| 9 |
+
base_model: Syldehayem/all-MiniLM-L12-v2_embedder_train
|
| 10 |
widget:
|
| 11 |
+
- source_sentence: CGI 3D Animated Short "The Scarf" - by Team The Scarf
|
| 12 |
sentences:
|
| 13 |
+
- 'CGI 3D Short: "Lenovo Legion: Turning Point" - by Audis Huang & Moonshine Animation
|
| 14 |
+
| TheCGBros'
|
| 15 |
+
- 'CGI Animated Trailers : "Dropzone" - by RealtimeUK'
|
| 16 |
+
- 'CGI 3D Animated Short: "SOLVIVAL" - by Pixelhunters | TheCGBros'
|
| 17 |
+
- source_sentence: CGI Animated Short Film HD "Terazia's Zoo " by Alison Dulou & Estelle
|
| 18 |
+
Lefebvre | CGMeetup
|
| 19 |
sentences:
|
| 20 |
+
- A comedian puppet decides to branch out on his own / You're The Puppet
|
| 21 |
+
- Horror Short Film Series “The Outer Darkness” Part 1 | ALTER
|
| 22 |
+
- ERNIE | Omeleto
|
| 23 |
+
- source_sentence: Kenneth Branagh in the thriller "Schneider's 2nd Stage" - Short
|
| 24 |
+
film by Phil Stoole
|
| 25 |
sentences:
|
| 26 |
+
- 'CGI 3D Animated Short Film: "Fish in LOVE" by ISArt Digital | @CGMeetup'
|
| 27 |
+
- Cookies By The Fire Short Horror Film | Screamfest | Merry Christmas
|
| 28 |
+
- 'CGI 3D Animated Spot: "Mantse Palm Wine" - by Arnold Bannerman | TheCGBros'
|
| 29 |
+
- source_sentence: The Portrait
|
| 30 |
sentences:
|
| 31 |
+
- A teenage girl must quickly adapt to a radically different urban environment |
|
| 32 |
+
Barrio Frontera
|
| 33 |
+
- Queen of Meatloaf | Short film tease
|
| 34 |
+
- 'CGI 3D Tutorial : "Using Zapplink in Zbrush" - by 3dmotive'
|
| 35 |
+
- source_sentence: Horror Short Film "Nice to Finally Meet You" | ALTER | Online Premiere
|
| 36 |
sentences:
|
| 37 |
+
- 'Mondays: The Spielberg Challenge Winner!'
|
| 38 |
+
- 'The Curse of Pandora''s Box Returns to #UniversalHHN 2021'
|
| 39 |
+
- SONS OF APRIL | Omeleto
|
| 40 |
pipeline_tag: sentence-similarity
|
| 41 |
library_name: sentence-transformers
|
| 42 |
---
|
| 43 |
|
| 44 |
+
# SentenceTransformer based on Syldehayem/all-MiniLM-L12-v2_embedder_train
|
| 45 |
|
| 46 |
+
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Syldehayem/all-MiniLM-L12-v2_embedder_train](https://huggingface.co/Syldehayem/all-MiniLM-L12-v2_embedder_train). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
|
| 47 |
|
| 48 |
## Model Details
|
| 49 |
|
| 50 |
### Model Description
|
| 51 |
- **Model Type:** Sentence Transformer
|
| 52 |
+
- **Base model:** [Syldehayem/all-MiniLM-L12-v2_embedder_train](https://huggingface.co/Syldehayem/all-MiniLM-L12-v2_embedder_train) <!-- at revision 58956428f2d485efdf2697a1a2cc793795e25057 -->
|
| 53 |
- **Maximum Sequence Length:** 128 tokens
|
| 54 |
- **Output Dimensionality:** 384 dimensions
|
| 55 |
- **Similarity Function:** Cosine Similarity
|
|
|
|
| 91 |
model = SentenceTransformer("Syldehayem/all-MiniLM-L12-v2_embedder_train")
|
| 92 |
# Run inference
|
| 93 |
sentences = [
|
| 94 |
+
'Horror Short Film "Nice to Finally Meet You" | ALTER | Online Premiere',
|
| 95 |
+
"The Curse of Pandora's Box Returns to #UniversalHHN 2021",
|
| 96 |
+
'Mondays: The Spielberg Challenge Winner!',
|
| 97 |
]
|
| 98 |
embeddings = model.encode(sentences)
|
| 99 |
print(embeddings.shape)
|
|
|
|
| 147 |
|
| 148 |
#### Unnamed Dataset
|
| 149 |
|
| 150 |
+
* Size: 9,712 training samples
|
| 151 |
* Columns: <code>sentence_0</code>, <code>sentence_1</code>, and <code>sentence_2</code>
|
| 152 |
* Approximate statistics based on the first 1000 samples:
|
| 153 |
+
| | sentence_0 | sentence_1 | sentence_2 |
|
| 154 |
+
|:--------|:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
|
| 155 |
+
| type | string | string | string |
|
| 156 |
+
| details | <ul><li>min: 3 tokens</li><li>mean: 19.7 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 19.91 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 20.27 tokens</li><li>max: 50 tokens</li></ul> |
|
| 157 |
* Samples:
|
| 158 |
+
| sentence_0 | sentence_1 | sentence_2 |
|
| 159 |
+
|:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------|:---------------------------------------------------------------------|
|
| 160 |
+
| <code>মেয়ে যখন মায়ের মতন | Bidhilipi | #Shorts | Bengali Family Drama</code> | <code>CGI 3D Animated Shorts: "Rust" - by Matthieu Druaud</code> | <code>Mukhyamantri | মুখ্যমন্ত্রী | Bengali Movie Part – 3/12</code> |
|
| 161 |
+
| <code>A Sci-Fi Short Film: "Voltok" - by Jonathan Vleeschower | TheCGBros</code> | <code>CGI MoCap Demo : "Finger Mocap Without Any Post Animation" by the MocapLab</code> | <code>A MAN DEPARTED | Omeleto Drama</code> |
|
| 162 |
+
| <code>LEAKY PIPES</code> | <code>Taking care of a baby at 15 | "Fifteen" - Short film by Sameh Alaa</code> | <code>CGI VFX Spot : "Black Beetle" by - The MILL</code> |
|
| 163 |
* Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
|
| 164 |
```json
|
| 165 |
{
|
|
|
|
| 173 |
|
| 174 |
- `per_device_train_batch_size`: 16
|
| 175 |
- `per_device_eval_batch_size`: 16
|
| 176 |
+
- `num_train_epochs`: 50
|
| 177 |
- `multi_dataset_batch_sampler`: round_robin
|
| 178 |
|
| 179 |
#### All Hyperparameters
|
|
|
|
| 196 |
- `adam_beta2`: 0.999
|
| 197 |
- `adam_epsilon`: 1e-08
|
| 198 |
- `max_grad_norm`: 1
|
| 199 |
+
- `num_train_epochs`: 50
|
| 200 |
- `max_steps`: -1
|
| 201 |
- `lr_scheduler_type`: linear
|
| 202 |
- `lr_scheduler_kwargs`: {}
|
|
|
|
| 297 |
</details>
|
| 298 |
|
| 299 |
### Training Logs
|
| 300 |
+
| Epoch | Step | Training Loss |
|
| 301 |
+
|:-------:|:-----:|:-------------:|
|
| 302 |
+
| 0.8237 | 500 | 5.0075 |
|
| 303 |
+
| 1.6474 | 1000 | 4.9816 |
|
| 304 |
+
| 2.4712 | 1500 | 5.013 |
|
| 305 |
+
| 3.2949 | 2000 | 4.981 |
|
| 306 |
+
| 4.1186 | 2500 | 4.9981 |
|
| 307 |
+
| 4.9423 | 3000 | 4.9727 |
|
| 308 |
+
| 5.7661 | 3500 | 4.9698 |
|
| 309 |
+
| 6.5898 | 4000 | 4.9839 |
|
| 310 |
+
| 7.4135 | 4500 | 5.0001 |
|
| 311 |
+
| 8.2372 | 5000 | 4.9996 |
|
| 312 |
+
| 9.0610 | 5500 | 4.9993 |
|
| 313 |
+
| 9.8847 | 6000 | 4.9999 |
|
| 314 |
+
| 10.7084 | 6500 | 5.0015 |
|
| 315 |
+
| 11.5321 | 7000 | 4.9934 |
|
| 316 |
+
| 12.3558 | 7500 | 4.9903 |
|
| 317 |
+
| 13.1796 | 8000 | 4.9875 |
|
| 318 |
+
| 14.0033 | 8500 | 5.0018 |
|
| 319 |
+
| 14.8270 | 9000 | 5.0088 |
|
| 320 |
+
| 15.6507 | 9500 | 4.9643 |
|
| 321 |
+
| 16.4745 | 10000 | 4.9447 |
|
| 322 |
+
| 17.2982 | 10500 | 4.8911 |
|
| 323 |
+
| 18.1219 | 11000 | 4.8719 |
|
| 324 |
+
| 18.9456 | 11500 | 4.8671 |
|
| 325 |
+
| 19.7694 | 12000 | 4.8268 |
|
| 326 |
+
| 20.5931 | 12500 | 4.8195 |
|
| 327 |
+
| 21.4168 | 13000 | 4.7726 |
|
| 328 |
+
| 22.2405 | 13500 | 4.7479 |
|
| 329 |
+
| 23.0643 | 14000 | 4.7465 |
|
| 330 |
+
| 23.8880 | 14500 | 4.7776 |
|
| 331 |
+
| 24.7117 | 15000 | 4.7366 |
|
| 332 |
+
| 25.5354 | 15500 | 4.7076 |
|
| 333 |
+
| 26.3591 | 16000 | 4.74 |
|
| 334 |
+
| 27.1829 | 16500 | 4.7118 |
|
| 335 |
+
| 28.0066 | 17000 | 4.6797 |
|
| 336 |
+
| 28.8303 | 17500 | 4.7144 |
|
| 337 |
+
| 29.6540 | 18000 | 4.662 |
|
| 338 |
+
| 30.4778 | 18500 | 4.6849 |
|
| 339 |
+
| 31.3015 | 19000 | 4.6608 |
|
| 340 |
+
| 32.1252 | 19500 | 4.6844 |
|
| 341 |
+
| 32.9489 | 20000 | 4.6561 |
|
| 342 |
+
| 33.7727 | 20500 | 4.6513 |
|
| 343 |
+
| 34.5964 | 21000 | 4.6418 |
|
| 344 |
+
| 35.4201 | 21500 | 4.635 |
|
| 345 |
+
| 36.2438 | 22000 | 4.6418 |
|
| 346 |
+
| 37.0675 | 22500 | 4.62 |
|
| 347 |
+
| 37.8913 | 23000 | 4.615 |
|
| 348 |
+
| 38.7150 | 23500 | 4.6189 |
|
| 349 |
+
| 39.5387 | 24000 | 4.6113 |
|
| 350 |
+
| 40.3624 | 24500 | 4.6054 |
|
| 351 |
+
| 41.1862 | 25000 | 4.5824 |
|
| 352 |
+
| 42.0099 | 25500 | 4.5907 |
|
| 353 |
+
| 42.8336 | 26000 | 4.5949 |
|
| 354 |
+
| 43.6573 | 26500 | 4.5769 |
|
| 355 |
+
| 44.4811 | 27000 | 4.5758 |
|
| 356 |
+
| 45.3048 | 27500 | 4.5613 |
|
| 357 |
+
| 46.1285 | 28000 | 4.5816 |
|
| 358 |
+
| 46.9522 | 28500 | 4.5538 |
|
| 359 |
+
| 47.7759 | 29000 | 4.5645 |
|
| 360 |
+
| 48.5997 | 29500 | 4.5653 |
|
| 361 |
+
| 49.4234 | 30000 | 4.5494 |
|
| 362 |
|
| 363 |
|
| 364 |
### Framework Versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 133462128
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8a0ef17f513afbe54faae7df152aa8782ec9c31ce60484187db0018d367f169a
|
| 3 |
size 133462128
|