Phase 2 SFT Report
- Run directory:
/kaggle/working/python-llm-grpo-new-techniques/training/output/phase2-codellama-7b-lora-kaggle-1h - Trainer state:
/kaggle/working/python-llm-grpo-new-techniques/training/output/phase2-codellama-7b-lora-kaggle-1h/checkpoint-43/trainer_state.json
Metrics
- global_step:
43 - best_metric:
None - last_train_loss:
0.7746126174926757 - best_eval_loss:
None - last_eval_loss:
None - last_learning_rate:
0.00015600000000000002 - last_step_logged:
40 - num_train_loss_logs:
4 - num_eval_loss_logs:
0