Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -328,3 +328,4 @@ Training Scope	Only LoRA weights updated; main model remains fixed
 This approach enables self-corrective, explainable, and meta-aware learning, pushing beyond standard RLHF and toward autonomous reasoning agents.
 ![GMPo Diagram](https://huggingface.co/liberalusa/liberalmind_bin/blob/main/kl_critic_plot.png)


328	This approach enables self-corrective, explainable, and meta-aware learning, pushing beyond standard RLHF and toward autonomous reasoning agents.
329
330	![GMPo Diagram](https://huggingface.co/liberalusa/liberalmind_bin/blob/main/kl_critic_plot.png)
331	+