liberal commited on
Commit
eb8a54b
·
verified ·
1 Parent(s): ef1c71c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ tags:
22
  # contact: Twitter: https://x.com/BogUnusov Telegram: @Quloneco email: qulone.corpo@gmail.com
23
 
24
  🧠 Adaptive Reasoning Loop with Critic-Driven GMPo and Intuition Feedback
25
- LiberalMind v1.5 is trained using a custom reinforcement learning system that extends classical RLHF and diverges from standard GMPO (Generative Model Policy Optimization). Instead, it employs a reasoning-centered pipeline we call GMPo (Generate–Match–Plan–Optimize) augmented with a Critic Loop and a novel intuition-based meta-signal.
26
 
27
  This design targets more explainable, structurally grounded reasoning via RL updates, optimized with KL-divergence regularization and guided feedback from a Critic module.
28
 
 
22
  # contact: Twitter: https://x.com/BogUnusov Telegram: @Quloneco email: qulone.corpo@gmail.com
23
 
24
  🧠 Adaptive Reasoning Loop with Critic-Driven GMPo and Intuition Feedback
25
+ Arctic AI is trained using a custom reinforcement learning system that extends classical RLHF and diverges from standard GMPO (Generative Model Policy Optimization). Instead, it employs a reasoning-centered pipeline we call GMPo (Generate–Match–Plan–Optimize) augmented with a Critic Loop and a novel intuition-based meta-signal.
26
 
27
  This design targets more explainable, structurally grounded reasoning via RL updates, optimized with KL-divergence regularization and guided feedback from a Critic module.
28