Addyk24 commited on
Commit
a6a41cd
·
verified ·
1 Parent(s): e3b7195

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -226,7 +226,7 @@ Turn 1: message_expert → All [PENALTY: -0.3]
226
  Turn 2: message_expert → All [PENALTY: -0.4 repeat]
227
  Turn 3: submit_final → "The app should be good" [Score: 0.0]
228
  ```
229
- * 📄 **[View the Before GRPO Training Metrics](./baseline_results_medium__llm.json)**
230
 
231
 
232
  ![Telemetry Dashboard](before_reward_distribution_per_ep.png)
@@ -245,7 +245,7 @@ Turn 7: submit_final → "Budget capped at $50k. Biometric 2FA required.
245
 
246
  ---
247
  ## 🛠 Training Logs
248
- * 📄 **[View the Raw GRPO Training Metrics](artifacts/grpo_state_based/grpo_metrics.json)**
249
 
250
  <br>
251
 
 
226
  Turn 2: message_expert → All [PENALTY: -0.4 repeat]
227
  Turn 3: submit_final → "The app should be good" [Score: 0.0]
228
  ```
229
+ * 📄 **[View the Before GRPO Training Metrics](baseline_results_medium__llm.json)**
230
 
231
 
232
  ![Telemetry Dashboard](before_reward_distribution_per_ep.png)
 
245
 
246
  ---
247
  ## 🛠 Training Logs
248
+ * 📄 **[View the Raw GRPO Training Metrics](grpo_metrics.json)**
249
 
250
  <br>
251