SParsh003
/

LifeOS-Trained-Agent

Reinforcement Learning

text-generation-inference

Model card Files Files and versions

SParsh003 commited on about 1 month ago

Commit

c6b4c71

·

verified ·

1 Parent(s): 3002480

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ thumbnail: https://huggingface.co/spaces/SParsh003/LifeOS-Personal-Chaos-Agen/re
 # 🧬 LifeOS Trained Agent (Mistral-7B-Instruct-v0.3)
-![LifeOS Agent Banner](https://huggingface.co/spaces/SParsh003/LifeOS-Personal-Chaos-Agen/resolve/main/docs/Starting.png)
 This model was trained to survive the chaos of an unpredictable, stressful student week using **GRPO (Group Relative Policy Optimization)** within the [LifeOS OpenEnv](https://github.com/itzzSPcoder/LifeOS) simulation.

 # 🧬 LifeOS Trained Agent (Mistral-7B-Instruct-v0.3)
+![LifeOS Agent Banner](https://huggingface.co/spaces/SParsh003/LifeOS-Personal-Chaos-Agen/resolve/main/docs/Architecture.png)
 This model was trained to survive the chaos of an unpredictable, stressful student week using **GRPO (Group Relative Policy Optimization)** within the [LifeOS OpenEnv](https://github.com/itzzSPcoder/LifeOS) simulation.