liberal commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -328,3 +328,4 @@ Training Scope Only LoRA weights updated; main model remains fixed
|
|
| 328 |
This approach enables self-corrective, explainable, and meta-aware learning, pushing beyond standard RLHF and toward autonomous reasoning agents.
|
| 329 |
|
| 330 |

|
|
|
|
|
|
| 328 |
This approach enables self-corrective, explainable, and meta-aware learning, pushing beyond standard RLHF and toward autonomous reasoning agents.
|
| 329 |
|
| 330 |

|
| 331 |
+
|