x2bee
/

ModernBERT-SimCSE-multitask_v04

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:952a6c22e6fd47eb3c9872be6da5ff1152332bd8f6c51082eed8e3eb73962f49
 size 2362528

 version https://git-lfs.github.com/spec/v1
+oid sha256:31635e07aba0bf9ff1e49bb5cec91388f57ad0a789dbc32c0b7987315304f442
 size 2362528

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ tags:
 - generated_from_trainer
 - dataset_size:5749
 - loss:CosineSimilarityLoss
-base_model: CocoRoF/ModernBERT-SimCSE_v02
 widget:
 - source_sentence: 우리는 움직이는 동행 우주 정지 좌표계에 비례하여 이동하고 있습니다 ... 약 371km / s에서 별자리 leo
     쪽으로. "
@@ -48,7 +48,7 @@ metrics:
 - pearson_max
 - spearman_max
 model-index:
-- name: SentenceTransformer based on CocoRoF/ModernBERT-SimCSE_v02
   results:
   - task:
       type: semantic-similarity
@@ -58,46 +58,46 @@ model-index:
       type: sts_dev
     metrics:
     - type: pearson_cosine
-      value: 0.8223949445074785
       name: Pearson Cosine
     - type: spearman_cosine
-      value: 0.8220107207834706
       name: Spearman Cosine
     - type: pearson_euclidean
-      value: 0.7785831525283676
       name: Pearson Euclidean
     - type: spearman_euclidean
-      value: 0.7815628643916452
       name: Spearman Euclidean
     - type: pearson_manhattan
-      value: 0.7809119630672191
       name: Pearson Manhattan
     - type: spearman_manhattan
-      value: 0.7846536514745763
       name: Spearman Manhattan
     - type: pearson_dot
-      value: 0.7543765794886113
       name: Pearson Dot
     - type: spearman_dot
-      value: 0.7434525191412167
       name: Spearman Dot
     - type: pearson_max
-      value: 0.8223949445074785
       name: Pearson Max
     - type: spearman_max
-      value: 0.8220107207834706
       name: Spearman Max
 ---
-# SentenceTransformer based on CocoRoF/ModernBERT-SimCSE_v02
-This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [CocoRoF/ModernBERT-SimCSE_v02](https://huggingface.co/CocoRoF/ModernBERT-SimCSE_v02). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
 ## Model Details
 ### Model Description
 - **Model Type:** Sentence Transformer
-- **Base model:** [CocoRoF/ModernBERT-SimCSE_v02](https://huggingface.co/CocoRoF/ModernBERT-SimCSE_v02) <!-- at revision de4148c764893843e15a4e0b241fe308147a9aaa -->
 - **Maximum Sequence Length:** 512 tokens
 - **Output Dimensionality:** 768 dimensions
 - **Similarity Function:** Cosine Similarity
@@ -136,7 +136,7 @@ Then you can load this model and run inference.
 from sentence_transformers import SentenceTransformer
 # Download from the 🤗 Hub
-model = SentenceTransformer("CocoRoF/ModernBERT-SimCSE-multitask_v03")
 # Run inference
 sentences = [
     '버스가 바쁜 길을 따라 운전한다.',
@@ -186,18 +186,18 @@ You can finetune this model on your own dataset.
 * Dataset: `sts_dev`
 * Evaluated with [<code>EmbeddingSimilarityEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.EmbeddingSimilarityEvaluator)
-| Metric             | Value     |
-|:-------------------|:----------|
-| pearson_cosine     | 0.8224    |
-| spearman_cosine    | 0.822     |
-| pearson_euclidean  | 0.7786    |
-| spearman_euclidean | 0.7816    |
-| pearson_manhattan  | 0.7809    |
-| spearman_manhattan | 0.7847    |
-| pearson_dot        | 0.7544    |
-| spearman_dot       | 0.7435    |
-| pearson_max        | 0.8224    |
-| **spearman_max**   | **0.822** |
 <!--
 ## Bias, Risks and Limitations
@@ -224,7 +224,7 @@ You can finetune this model on your own dataset.
   |         | sentence1                                                                         | sentence2                                                                         | score                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
-  | details | <ul><li>min: 7 tokens</li><li>mean: 13.52 tokens</li><li>max: 36 tokens</li></ul> | <ul><li>min: 7 tokens</li><li>mean: 13.41 tokens</li><li>max: 32 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.45</li><li>max: 1.0</li></ul> |
 * Samples:
   | sentence1                           | sentence2                                 | score             |
   |:------------------------------------|:------------------------------------------|:------------------|
@@ -249,7 +249,7 @@ You can finetune this model on your own dataset.
   |         | sentence1                                                                         | sentence2                                                                         | score                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
-  | details | <ul><li>min: 7 tokens</li><li>mean: 20.38 tokens</li><li>max: 52 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 20.52 tokens</li><li>max: 54 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.42</li><li>max: 1.0</li></ul> |
 * Samples:
   | sentence1                            | sentence2                           | score             |
   |:-------------------------------------|:------------------------------------|:------------------|
@@ -275,7 +275,7 @@ You can finetune this model on your own dataset.
 - `num_train_epochs`: 10.0
 - `warmup_ratio`: 0.1
 - `push_to_hub`: True
-- `hub_model_id`: CocoRoF/ModernBERT-SimCSE-multitask_v03
 - `hub_strategy`: checkpoint
 - `batch_sampler`: no_duplicates
@@ -362,7 +362,7 @@ You can finetune this model on your own dataset.
 - `use_legacy_prediction_loop`: False
 - `push_to_hub`: True
 - `resume_from_checkpoint`: None
-- `hub_model_id`: CocoRoF/ModernBERT-SimCSE-multitask_v03
 - `hub_strategy`: checkpoint
 - `hub_private_repo`: None
 - `hub_always_push`: False
@@ -403,50 +403,50 @@ You can finetune this model on your own dataset.
 ### Training Logs
 | Epoch  | Step | Training Loss | Validation Loss | sts_dev_spearman_max |
 |:------:|:----:|:-------------:|:---------------:|:--------------------:|
-| 0.2228 | 10   | 0.0283        | -               | -                    |
-| 0.4457 | 20   | 0.0344        | -               | -                    |
-| 0.6685 | 30   | 0.0305        | 0.0310          | 0.7939               |
-| 0.8914 | 40   | 0.0489        | -               | -                    |
-| 1.1337 | 50   | 0.0382        | -               | -                    |
-| 1.3565 | 60   | 0.0271        | 0.0293          | 0.7994               |
-| 1.5794 | 70   | 0.0344        | -               | -                    |
-| 1.8022 | 80   | 0.0382        | -               | -                    |
-| 2.0446 | 90   | 0.0419        | 0.0280          | 0.8059               |
-| 2.2674 | 100  | 0.0244        | -               | -                    |
-| 2.4903 | 110  | 0.0307        | -               | -                    |
-| 2.7131 | 120  | 0.0291        | 0.0269          | 0.8108               |
-| 2.9359 | 130  | 0.038         | -               | -                    |
-| 3.1783 | 140  | 0.0269        | -               | -                    |
-| 3.4011 | 150  | 0.0268        | 0.0262          | 0.8155               |
-| 3.6240 | 160  | 0.0246        | -               | -                    |
-| 3.8468 | 170  | 0.0313        | -               | -                    |
-| 4.0891 | 180  | 0.0303        | 0.0259          | 0.8185               |
-| 4.3120 | 190  | 0.0198        | -               | -                    |
-| 4.5348 | 200  | 0.0257        | -               | -                    |
-| 4.7577 | 210  | 0.0242        | 0.0255          | 0.8202               |
-| 4.9805 | 220  | 0.0293        | -               | -                    |
-| 5.2228 | 230  | 0.0193        | -               | -                    |
-| 5.4457 | 240  | 0.0222        | 0.0254          | 0.8222               |
-| 5.6685 | 250  | 0.0184        | -               | -                    |
-| 5.8914 | 260  | 0.0243        | -               | -                    |
-| 6.1337 | 270  | 0.0204        | 0.0254          | 0.8235               |
-| 6.3565 | 280  | 0.0147        | -               | -                    |
-| 6.5794 | 290  | 0.0196        | -               | -                    |
-| 6.8022 | 300  | 0.0176        | 0.0253          | 0.8227               |
-| 7.0446 | 310  | 0.0202        | -               | -                    |
-| 7.2674 | 320  | 0.0123        | -               | -                    |
-| 7.4903 | 330  | 0.0151        | 0.0254          | 0.8236               |
-| 7.7131 | 340  | 0.0132        | -               | -                    |
-| 7.9359 | 350  | 0.0158        | -               | -                    |
-| 8.1783 | 360  | 0.0118        | 0.0256          | 0.8240               |
-| 8.4011 | 370  | 0.0115        | -               | -                    |
-| 8.6240 | 380  | 0.0105        | -               | -                    |
-| 8.8468 | 390  | 0.0111        | 0.0256          | 0.8215               |
-| 9.0891 | 400  | 0.011         | -               | -                    |
-| 9.3120 | 410  | 0.0076        | -               | -                    |
-| 9.5348 | 420  | 0.0091        | 0.0256          | 0.8220               |
-| 9.7577 | 430  | 0.0075        | -               | -                    |
-| 9.9805 | 440  | 0.0093        | -               | -                    |
 ### Framework Versions

 - generated_from_trainer
 - dataset_size:5749
 - loss:CosineSimilarityLoss
+base_model: CocoRoF/ModernBERT-SimCSE_v04
 widget:
 - source_sentence: 우리는 움직이는 동행 우주 정지 좌표계에 비례하여 이동하고 있습니다 ... 약 371km / s에서 별자리 leo
     쪽으로. "
 - pearson_max
 - spearman_max
 model-index:
+- name: SentenceTransformer based on CocoRoF/ModernBERT-SimCSE_v04
   results:
   - task:
       type: semantic-similarity
       type: sts_dev
     metrics:
     - type: pearson_cosine
+      value: 0.7846905549925053
       name: Pearson Cosine
     - type: spearman_cosine
+      value: 0.7871247667333137
       name: Spearman Cosine
     - type: pearson_euclidean
+      value: 0.7258848709796941
       name: Pearson Euclidean
     - type: spearman_euclidean
+      value: 0.7208562515791448
       name: Spearman Euclidean
     - type: pearson_manhattan
+      value: 0.7251869665655273
       name: Pearson Manhattan
     - type: spearman_manhattan
+      value: 0.7202883259106225
       name: Spearman Manhattan
     - type: pearson_dot
+      value: 0.62098630425604
       name: Pearson Dot
     - type: spearman_dot
+      value: 0.6254562421139086
       name: Spearman Dot
     - type: pearson_max
+      value: 0.7846905549925053
       name: Pearson Max
     - type: spearman_max
+      value: 0.7871247667333137
       name: Spearman Max
 ---
+# SentenceTransformer based on CocoRoF/ModernBERT-SimCSE_v04
+This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [CocoRoF/ModernBERT-SimCSE_v04](https://huggingface.co/CocoRoF/ModernBERT-SimCSE_v04). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
 ## Model Details
 ### Model Description
 - **Model Type:** Sentence Transformer
+- **Base model:** [CocoRoF/ModernBERT-SimCSE_v04](https://huggingface.co/CocoRoF/ModernBERT-SimCSE_v04) <!-- at revision 7d23b869258e5c726c0f536bccac7e873d510d66 -->
 - **Maximum Sequence Length:** 512 tokens
 - **Output Dimensionality:** 768 dimensions
 - **Similarity Function:** Cosine Similarity
 from sentence_transformers import SentenceTransformer
 # Download from the 🤗 Hub
+model = SentenceTransformer("CocoRoF/ModernBERT-SimCSE-multitask_v04")
 # Run inference
 sentences = [
     '버스가 바쁜 길을 따라 운전한다.',
 * Dataset: `sts_dev`
 * Evaluated with [<code>EmbeddingSimilarityEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.EmbeddingSimilarityEvaluator)
+| Metric             | Value      |
+|:-------------------|:-----------|
+| pearson_cosine     | 0.7847     |
+| spearman_cosine    | 0.7871     |
+| pearson_euclidean  | 0.7259     |
+| spearman_euclidean | 0.7209     |
+| pearson_manhattan  | 0.7252     |
+| spearman_manhattan | 0.7203     |
+| pearson_dot        | 0.621      |
+| spearman_dot       | 0.6255     |
+| pearson_max        | 0.7847     |
+| **spearman_max**   | **0.7871** |
 <!--
 ## Bias, Risks and Limitations
   |         | sentence1                                                                         | sentence2                                                                         | score                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
+  | details | <ul><li>min: 7 tokens</li><li>mean: 12.69 tokens</li><li>max: 31 tokens</li></ul> | <ul><li>min: 7 tokens</li><li>mean: 12.56 tokens</li><li>max: 27 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.45</li><li>max: 1.0</li></ul> |
 * Samples:
   | sentence1                           | sentence2                                 | score             |
   |:------------------------------------|:------------------------------------------|:------------------|
   |         | sentence1                                                                         | sentence2                                                                         | score                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 18.89 tokens</li><li>max: 51 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 18.92 tokens</li><li>max: 50 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.42</li><li>max: 1.0</li></ul> |
 * Samples:
   | sentence1                            | sentence2                           | score             |
   |:-------------------------------------|:------------------------------------|:------------------|
 - `num_train_epochs`: 10.0
 - `warmup_ratio`: 0.1
 - `push_to_hub`: True
+- `hub_model_id`: CocoRoF/ModernBERT-SimCSE-multitask_v04
 - `hub_strategy`: checkpoint
 - `batch_sampler`: no_duplicates
 - `use_legacy_prediction_loop`: False
 - `push_to_hub`: True
 - `resume_from_checkpoint`: None
+- `hub_model_id`: CocoRoF/ModernBERT-SimCSE-multitask_v04
 - `hub_strategy`: checkpoint
 - `hub_private_repo`: None
 - `hub_always_push`: False
 ### Training Logs
 | Epoch  | Step | Training Loss | Validation Loss | sts_dev_spearman_max |
 |:------:|:----:|:-------------:|:---------------:|:--------------------:|
+| 0.2228 | 10   | 0.0285        | -               | -                    |
+| 0.4457 | 20   | 0.0396        | -               | -                    |
+| 0.6685 | 30   | 0.0396        | 0.0376          | 0.7647               |
+| 0.8914 | 40   | 0.0594        | -               | -                    |
+| 1.1337 | 50   | 0.0438        | -               | -                    |
+| 1.3565 | 60   | 0.0302        | 0.0358          | 0.7723               |
+| 1.5794 | 70   | 0.0398        | -               | -                    |
+| 1.8022 | 80   | 0.0457        | -               | -                    |
+| 2.0446 | 90   | 0.0464        | 0.0347          | 0.7805               |
+| 2.2674 | 100  | 0.026         | -               | -                    |
+| 2.4903 | 110  | 0.0331        | -               | -                    |
+| 2.7131 | 120  | 0.0318        | 0.0329          | 0.7837               |
+| 2.9359 | 130  | 0.0399        | -               | -                    |
+| 3.1783 | 140  | 0.0264        | -               | -                    |
+| 3.4011 | 150  | 0.0268        | 0.0332          | 0.7884               |
+| 3.6240 | 160  | 0.0241        | -               | -                    |
+| 3.8468 | 170  | 0.0309        | -               | -                    |
+| 4.0891 | 180  | 0.0263        | 0.0326          | 0.7918               |
+| 4.3120 | 190  | 0.0164        | -               | -                    |
+| 4.5348 | 200  | 0.0226        | -               | -                    |
+| 4.7577 | 210  | 0.0196        | 0.0314          | 0.7896               |
+| 4.9805 | 220  | 0.0217        | -               | -                    |
+| 5.2228 | 230  | 0.0134        | -               | -                    |
+| 5.4457 | 240  | 0.0157        | 0.0320          | 0.7911               |
+| 5.6685 | 250  | 0.0136        | -               | -                    |
+| 5.8914 | 260  | 0.0143        | -               | -                    |
+| 6.1337 | 270  | 0.0114        | 0.0322          | 0.7907               |
+| 6.3565 | 280  | 0.0077        | -               | -                    |
+| 6.5794 | 290  | 0.0116        | -               | -                    |
+| 6.8022 | 300  | 0.0087        | 0.0313          | 0.7868               |
+| 7.0446 | 310  | 0.0088        | -               | -                    |
+| 7.2674 | 320  | 0.0048        | -               | -                    |
+| 7.4903 | 330  | 0.0068        | 0.0317          | 0.7895               |
+| 7.7131 | 340  | 0.006         | -               | -                    |
+| 7.9359 | 350  | 0.0051        | -               | -                    |
+| 8.1783 | 360  | 0.0039        | 0.0323          | 0.7882               |
+| 8.4011 | 370  | 0.0036        | -               | -                    |
+| 8.6240 | 380  | 0.0045        | -               | -                    |
+| 8.8468 | 390  | 0.0032        | 0.0317          | 0.7841               |
+| 9.0891 | 400  | 0.0031        | -               | -                    |
+| 9.3120 | 410  | 0.0021        | -               | -                    |
+| 9.5348 | 420  | 0.0029        | 0.0323          | 0.7871               |
+| 9.7577 | 430  | 0.0023        | -               | -                    |
+| 9.9805 | 440  | 0.0027        | -               | -                    |
 ### Framework Versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4d9b65e72c69ee7ad20852d629dd9265d4a591df173662edc0ed58bcefc3cbeb
 size 610640632

 version https://git-lfs.github.com/spec/v1
+oid sha256:0869c16bd8ae16b638ef0de4e504f3e8f3a1c215f6ed1b812d8aa22835f41aff
 size 610640632