lllqaq's picture
Add README
a08a0ef verified
# lllqaq/R2EGym-7B-Agent-Coder-Instruct-filtered1
This model is a full SFT fine-tune of `Qwen/Qwen2.5-Coder-7B-Instruct` on traj_gpt5mini (filtered1).
Export source checkpoint: `/data/jiarong/LLaMA-Factory/saves/R2EGym-7B-Agent-Coder-Instruct1/checkpoint-442`