Developer-Amar commited on
Commit
b1408c3
·
1 Parent(s): 205dc3f

docs: add GRPO training evidence plots

Browse files
before_after_comparison.png ADDED

Git LFS Details

  • SHA256: ce57b3413e243bad9e3b91ea1ab55825f786f792768cca4f28c49d393604a41b
  • Pointer size: 130 Bytes
  • Size of remote file: 72 kB
loss_curve.png ADDED

Git LFS Details

  • SHA256: 25196071428f68099fd365fe0591a333042a9ea7b217cdc982fd0092a44bba35
  • Pointer size: 131 Bytes
  • Size of remote file: 144 kB
reward_curve.png ADDED

Git LFS Details

  • SHA256: 5117f3c98e318e6ea40b279afb60b32f2eda378bd4312c3d7e43b3c3467b4448
  • Pointer size: 131 Bytes
  • Size of remote file: 107 kB