Added training of RL environment script with unsloth and HF GPU cloud ffeaf35 Addyk24 commited on 21 days ago
Added basic configs for env needed with dockerfile for deployment on space 675c21c Addyk24 commited on 22 days ago
feat: add eval baseline script (inference testing for env), report, prompter, and env config fdd6ae0 Addyk24 commited on 22 days ago