Phu Nguyen commited on
Commit
bf35015
·
verified ·
1 Parent(s): 896acb1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -5
README.md CHANGED
@@ -10,13 +10,11 @@
10
  - **Algorithm**: GRPO
11
  - **Reward Modeling**
12
  - **Answer correctness reward**
13
- <img src="https://cdn-uploads.huggingface.co/production/uploads/67c563afa34e1ad5a3533ccf/X15GjihIRO9hkfL361Pfd.png" width="500">
14
-
15
  - **Format correctness reward**
16
- <img src="https://cdn-uploads.huggingface.co/production/uploads/67c563afa34e1ad5a3533ccf/ib5bJu4lMkREigExRAUn9.png" width="500">
17
-
18
  - **Final reward function**
19
- <img src="https://cdn-uploads.huggingface.co/production/uploads/67c563afa34e1ad5a3533ccf/UXsKqJIFjCpT_vUUSTigr.png" width="500">
20
 
21
  For a deeper look into the implementation details, refer to the our repository: [Intelligent-Internet/ii-thought](https://github.com/Intelligent-Internet/ii-thought/tree/main).
22
 
 
10
  - **Algorithm**: GRPO
11
  - **Reward Modeling**
12
  - **Answer correctness reward**
13
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/67c563afa34e1ad5a3533ccf/X15GjihIRO9hkfL361Pfd.png" width="300">
 
14
  - **Format correctness reward**
15
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/67c563afa34e1ad5a3533ccf/ib5bJu4lMkREigExRAUn9.png" width="300">
 
16
  - **Final reward function**
17
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/67c563afa34e1ad5a3533ccf/UXsKqJIFjCpT_vUUSTigr.png" width="300">
18
 
19
  For a deeper look into the implementation details, refer to the our repository: [Intelligent-Internet/ii-thought](https://github.com/Intelligent-Internet/ii-thought/tree/main).
20