Update README.md
Browse files
README.md
CHANGED
|
@@ -25,7 +25,7 @@ JoyAI-LLM Flash is a state-of-the-art medium-sized instruct language model with
|
|
| 25 |
|
| 26 |
### Key Features
|
| 27 |
|
| 28 |
-
- Fiber Bundle RL:
|
| 29 |
- Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
|
| 30 |
- Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.
|
| 31 |
|
|
|
|
| 25 |
|
| 26 |
### Key Features
|
| 27 |
|
| 28 |
+
- Fiber Bundle RL: Introduces fiber bundle theory into reinforcement learning, proposing a novel optimization framework, FiberPO. This method is specifically designed to handle the challenges of large-scale and heterogeneous agent training, improving stability and robustness under complex data distributions.
|
| 29 |
- Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
|
| 30 |
- Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.
|
| 31 |
|