infused-reasoning-phi2 / training_info.json
Pierizvi's picture
epoch-639
c9d92fa verified
{
"step": 639,
"best_reward": 1.008333444595337,
"timestamp": "2025-05-06 19:35:56"
}