Hak5
/

python-llm-phase0_5

Model card Files Files and versions

python-llm-phase0_5 / phase2_report.md

Hak5's picture

Upload SFT artifact (adapter_only) from phase2-codellama-7b-lora-kaggle-1h

868831d verified 2 months ago

|

history blame contribute delete

558 Bytes

Phase 2 SFT Report

Run directory: /kaggle/working/python-llm-grpo-new-techniques/training/output/phase2-codellama-7b-lora-kaggle-1h
Trainer state: /kaggle/working/python-llm-grpo-new-techniques/training/output/phase2-codellama-7b-lora-kaggle-1h/checkpoint-43/trainer_state.json

Metrics

global_step: 43
best_metric: None
last_train_loss: 0.7746126174926757
best_eval_loss: None
last_eval_loss: None
last_learning_rate: 0.00015600000000000002
last_step_logged: 40
num_train_loss_logs: 4
num_eval_loss_logs: 0