KMB_SimCSE_test / README.md

CocoRoF

test Done

e22a6d1 verified over 1 year ago

6.19 kB

library_name: transformers
license: apache-2.0
base_model: x2bee/KoModernBERT-base-mlm-v03-ckp00
tags:
  - generated_from_trainer
model-index:
  - name: KMB_SimCSE_test
    results: []

KMB_SimCSE_test

This model is a fine-tuned version of x2bee/KoModernBERT-base-mlm-v03-ckp00 on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 0.0466
Pearson Cosine: 0.8138
Spearman Cosine: 0.8155
Pearson Manhattan: 0.8156
Spearman Manhattan: 0.8244
Pearson Euclidean: 0.8159
Spearman Euclidean: 0.8248
Pearson Dot: 0.7831
Spearman Dot: 0.7815

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0001
train_batch_size: 16
eval_batch_size: 16
seed: 42
distributed_type: multi-GPU
gradient_accumulation_steps: 8
total_train_batch_size: 128
optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_ratio: 0.1
num_epochs: 1.0

Training results

Training Loss	Epoch	Step	Validation Loss	Pearson Cosine	Spearman Cosine	Pearson Manhattan	Spearman Manhattan	Pearson Euclidean	Spearman Euclidean	Pearson Dot	Spearman Dot
0.6155	0.0469	100	0.0872	0.7869	0.7901	0.7884	0.7903	0.7895	0.7914	0.7575	0.7545
0.4415	0.0937	200	0.0599	0.8037	0.8063	0.8064	0.8128	0.8065	0.8128	0.7711	0.7711
0.4497	0.1406	300	0.0626	0.8158	0.8165	0.8107	0.8179	0.8109	0.8181	0.7944	0.7933
0.4479	0.1874	400	0.0646	0.8130	0.8152	0.8134	0.8209	0.8133	0.8209	0.7848	0.7852
0.4284	0.2343	500	0.0651	0.8051	0.8089	0.8066	0.8134	0.8067	0.8138	0.7820	0.7820
0.3853	0.2812	600	0.0571	0.8038	0.8084	0.8036	0.8107	0.8034	0.8104	0.7922	0.7949
0.3798	0.3280	700	0.0627	0.7998	0.8041	0.7968	0.8043	0.7963	0.8040	0.7824	0.7847
0.3615	0.3749	800	0.0563	0.8038	0.8047	0.8074	0.8113	0.8070	0.8107	0.7799	0.7773
0.3333	0.4217	900	0.0584	0.8054	0.8067	0.8051	0.8101	0.8051	0.8098	0.7824	0.7821
0.3693	0.4686	1000	0.0583	0.7878	0.7864	0.7686	0.7809	0.7679	0.7804	0.7796	0.7773
0.3623	0.5155	1100	0.0574	0.8054	0.8090	0.8054	0.8116	0.8054	0.8116	0.7882	0.7907
0.3795	0.5623	1200	0.0592	0.8028	0.8065	0.8075	0.8133	0.8069	0.8122	0.7769	0.7771
0.3053	0.6092	1300	0.0460	0.8208	0.8220	0.8163	0.8238	0.8166	0.8240	0.8059	0.8056
0.3254	0.6560	1400	0.0519	0.8104	0.8132	0.8115	0.8189	0.8119	0.8194	0.7882	0.7892
0.3399	0.7029	1500	0.0482	0.8192	0.8200	0.8162	0.8242	0.8162	0.8238	0.8010	0.8003
0.3418	0.7498	1600	0.0507	0.8195	0.8227	0.8185	0.8248	0.8191	0.8252	0.7973	0.7978
0.329	0.7966	1700	0.0490	0.8062	0.8075	0.8066	0.8125	0.8070	0.8131	0.7809	0.7785
0.2774	0.8435	1800	0.0436	0.8131	0.8136	0.8124	0.8212	0.8124	0.8213	0.7812	0.7793
0.3113	0.8903	1900	0.0503	0.8192	0.8219	0.8187	0.8256	0.8191	0.8260	0.7944	0.7945
0.3495	0.9372	2000	0.0519	0.8079	0.8110	0.8123	0.8184	0.8121	0.8183	0.7810	0.7810
0.3015	0.9841	2100	0.0466	0.8138	0.8155	0.8156	0.8244	0.8159	0.8248	0.7831	0.7815

Framework versions

Transformers 4.48.0.dev0
Pytorch 2.5.1+cu121
Datasets 3.1.0
Tokenizers 0.21.0