--- tags: - generated_from_trainer model-index: - name: pali-captioning-lm-sweep results: [] --- [Visualize in Weights & Biases](https://wandb.ai/colemanhaley/pali-captioning-lm-sweep/runs/93x0gbhc) # pali-captioning-lm-sweep This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset. It achieves the following results on the evaluation set: - Loss: 2.9665 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 0.0001 - train_batch_size: 8 - eval_batch_size: 4 - seed: 42 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - lr_scheduler_warmup_steps: 2 - training_steps: 70804 ### Training results | Training Loss | Epoch | Step | Validation Loss | |:-------------:|:------:|:-----:|:---------------:| | No log | 0 | 0 | 10.3383 | | 1.9493 | 0.0040 | 10000 | 3.4608 | | 1.8267 | 0.0081 | 20000 | 3.3065 | | 1.6539 | 0.0121 | 30000 | 3.2140 | | 1.6238 | 0.0162 | 40000 | 3.1555 | | 1.5725 | 0.0202 | 50000 | 3.0853 | | 1.4285 | 0.0242 | 60000 | 3.0110 | | 1.4442 | 0.0283 | 70000 | 2.9665 | ### Framework versions - Transformers 4.42.4 - Pytorch 2.3.1+cu121 - Tokenizers 0.19.1