Danau5tin commited on
Commit
eafa937
·
verified ·
1 Parent(s): 9b66289

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  base_model:
3
- - Qwen/Qwen3-14B
4
  license: apache-2.0
5
  datasets:
6
  - Danau5tin/terminal-tasks
@@ -65,7 +65,7 @@ context_refs:
65
  - 16x for training
66
  - 8x inference for Orca-Agent
67
  - 8x inference for subagent (Qwen3-Coder-30B-A3B)
68
- - Trained with GRPO + curriculum learning (2 stages of RL, with increasing task difficulty each stage)
69
  - Batch size 256, 64 rollouts per task
70
  - More details [here](https://github.com/Danau5tin/Orca-Agent-RL)
71
 
 
1
  ---
2
  base_model:
3
+ - willcb/Qwen3-14B
4
  license: apache-2.0
5
  datasets:
6
  - Danau5tin/terminal-tasks
 
65
  - 16x for training
66
  - 8x inference for Orca-Agent
67
  - 8x inference for subagent (Qwen3-Coder-30B-A3B)
68
+ - Trained with GRPO + curriculum learning
69
  - Batch size 256, 64 rollouts per task
70
  - More details [here](https://github.com/Danau5tin/Orca-Agent-RL)
71