lllqaq/R2EGym-7B-Agent-Coder-Instruct-filtered2

This model is a full SFT fine-tune of Qwen/Qwen2.5-Coder-7B-Instruct on traj_gpt5mini (filtered2).

Export source checkpoint: /data/jiarong/LLaMA-Factory/saves/R2EGym-7B-Agent-Coder-Instruct2/checkpoint-188