End of training

cf0d16a verified almost 2 years ago

1.93 kB

license: apache-2.0
library_name: peft
tags:
  - trl
  - sft
  - generated_from_trainer
base_model: petals-team/falcon-rw-1b
model-index:
  - name: falcon-rw-1b-code-generation-llm-task2-modelC
    results: []

falcon-rw-1b-code-generation-llm-task2-modelC

This model is a fine-tuned version of petals-team/falcon-rw-1b on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 1.6594

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 2
eval_batch_size: 8
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 4
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_ratio: 0.03
training_steps: 600

Training results

Training Loss	Epoch	Step	Validation Loss
1.626	0.0356	20	1.7087
1.9368	0.0712	40	1.6675
1.4542	0.1068	60	1.6467
1.2704	0.1423	80	1.6474
1.1888	0.1779	100	1.6618
0.9006	0.2135	120	1.6415
1.1376	0.2491	140	1.6583
0.9937	0.2847	160	1.6454
0.8624	0.3203	180	1.6594

Framework versions

PEFT 0.10.0
Transformers 4.40.0
Pytorch 2.2.1+cu121
Datasets 2.19.0
Tokenizers 0.19.1