Buckets:

MichaHenh
/

cil-sentiment-analysis

200 MB

10 files

Updated about 1 month ago

Ctrl+K

Name	Size	Uploaded	Xet hash
.gitattributes	1.58 kB xet	about 1 month ago	1913ee3d
README.md	2.56 kB xet	about 1 month ago	dc64b882
adapter_config.json	1.12 kB xet	about 1 month ago	adf63f07
adapter_model.safetensors	116 MB xet	about 1 month ago	da92e95d
decoder_config.json	205 Bytes xet	about 1 month ago	ad77e68d
metrics.json	1.28 kB xet	about 1 month ago	21a115a0
noise_weighting_summary.json	313 Bytes xet	about 1 month ago	8654f98f
oof_diagnostics.csv	74.9 MB xet	about 1 month ago	9a46a1e5
training_args.bin	5.52 kB xet	about 1 month ago	f47233cb
val_error_dataframe.csv	9.37 MB xet	about 1 month ago	232d8541

README.md

cil-noise-weight-q5-xlmr-large-seed1

This model is a fine-tuned version of xlm-roberta-large on the None dataset. It achieves the following results on the evaluation set:

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.00015
train_batch_size: 64
eval_batch_size: 1024
seed: 1
optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 100
num_epochs: 1
mixed_precision_training: Native AMP

Training Loss	Epoch	Step	Validation Loss	Accuracy	Map Mae	Bayes Mae	Expected Score Mae
1.1054	0.1411	500	0.8717	0.6246	0.4273	0.4203	0.4875
0.8149	0.2822	1000	0.8269	0.6487	0.3873	0.3873	0.4555
0.7660	0.4233	1500	0.8029	0.6539	0.3825	0.3839	0.4438
0.7476	0.5643	2000	0.7872	0.6571	0.3860	0.3809	0.4351
0.7361	0.7054	2500	0.7922	0.6606	0.3796	0.3771	0.4285
0.7281	0.8465	3000	0.7943	0.6615	0.3771	0.3757	0.4254
0.7269	0.9876	3500	0.7867	0.6622	0.3763	0.3747	0.4254
0.7269	1.0	3544	0.7867	0.6623	0.3763	0.3746	0.4254