feat: add T4-optimized SFT and RLVR training scripts, evaluation utilities, and updated documentation 58916ea Navigam commited on Apr 26
feat: add training pipeline with SFT and RLVR support for Qwen 2.5-3B-Instruct 5c8287c Navigam commited on Apr 26
feat: enhance training environment and documentation for CORP-ENV models abaaa50 Navigam commited on Apr 26
chore: increase max prompt and sequence lengths in training scripts c0d85b8 Navigam commited on Apr 26
feat: add new training jobs and scripts for DeepSeek and Nemotron models f0688bd Navigam commited on Apr 26
refactor: update training scripts and environment setup for Qwen3 model ef0aeea Navigam commited on Apr 25
refactor: update training scripts and documentation for SFT and RLVR processes 4e1a75b Navigam commited on Apr 25
feat: add environment setup and requirements for Lightning AI H100 training b737c1e Navigam commited on Apr 25
feat: update README and runbook for SFT and GRPO training enhancements 6b13adb Navigam commited on Apr 25
feat: update evaluation results and training scripts for Qwen2.5-7B-Instruct 6e2b9c3 Navigam commited on Apr 25
feat: update training scripts and add judge-rerunnable notebook for CORP-ENV 97b9312 Navigam commited on Apr 25