Update README.md
Browse files
README.md
CHANGED
|
@@ -1,6 +1,6 @@
|
|
| 1 |
---
|
| 2 |
base_model:
|
| 3 |
-
-
|
| 4 |
license: apache-2.0
|
| 5 |
datasets:
|
| 6 |
- Danau5tin/terminal-tasks
|
|
@@ -65,7 +65,7 @@ context_refs:
|
|
| 65 |
- 16x for training
|
| 66 |
- 8x inference for Orca-Agent
|
| 67 |
- 8x inference for subagent (Qwen3-Coder-30B-A3B)
|
| 68 |
-
- Trained with GRPO + curriculum learning
|
| 69 |
- Batch size 256, 64 rollouts per task
|
| 70 |
- More details [here](https://github.com/Danau5tin/Orca-Agent-RL)
|
| 71 |
|
|
|
|
| 1 |
---
|
| 2 |
base_model:
|
| 3 |
+
- willcb/Qwen3-14B
|
| 4 |
license: apache-2.0
|
| 5 |
datasets:
|
| 6 |
- Danau5tin/terminal-tasks
|
|
|
|
| 65 |
- 16x for training
|
| 66 |
- 8x inference for Orca-Agent
|
| 67 |
- 8x inference for subagent (Qwen3-Coder-30B-A3B)
|
| 68 |
+
- Trained with GRPO + curriculum learning
|
| 69 |
- Batch size 256, 64 rollouts per task
|
| 70 |
- More details [here](https://github.com/Danau5tin/Orca-Agent-RL)
|
| 71 |
|