Danau5tin
/

Orca-Agent-v0.1

Model card Files Files and versions

Danau5tin commited on Nov 3, 2025

Commit

eafa937

·

verified ·

1 Parent(s): 9b66289

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 base_model:
-- Qwen/Qwen3-14B
 license: apache-2.0
 datasets:
 - Danau5tin/terminal-tasks
@@ -65,7 +65,7 @@ context_refs:
   - 16x for training
   - 8x inference for Orca-Agent
   - 8x inference for subagent (Qwen3-Coder-30B-A3B)
-- Trained with GRPO + curriculum learning (2 stages of RL, with increasing task difficulty each stage)
 - Batch size 256, 64 rollouts per task
 - More details [here](https://github.com/Danau5tin/Orca-Agent-RL)

 ---
 base_model:
+- willcb/Qwen3-14B
 license: apache-2.0
 datasets:
 - Danau5tin/terminal-tasks
   - 16x for training
   - 8x inference for Orca-Agent
   - 8x inference for subagent (Qwen3-Coder-30B-A3B)
+- Trained with GRPO + curriculum learning
 - Batch size 256, 64 rollouts per task
 - More details [here](https://github.com/Danau5tin/Orca-Agent-RL)