jdopensource
/

JoyAI-LLM-Flash

Text Generation

joyai_llm_flash

Model card Files Files and versions

bianrongcheng0124 commited on Feb 15

Commit

6175e71

·

verified ·

1 Parent(s): ee80a7f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ JoyAI-LLM Flash is a state-of-the-art medium-sized instruct language model with
 ### Key Features
-- Fiber Bundle RL: invole geometric manifold theory into reinforcement learning, proposing an innovative technique known as FiberPO. This approach is designed to address the growing trends of increasing heterogeneous agent scales.
 - Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
 - Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.

 ### Key Features
+- Fiber Bundle RL: Introduces fiber bundle theory into reinforcement learning, proposing a novel optimization framework, FiberPO. This method is specifically designed to handle the challenges of large-scale and heterogeneous agent training, improving stability and robustness under complex data distributions.
 - Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
 - Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.