lll2343 commited on
Commit
ec105fd
·
verified ·
1 Parent(s): 71164b1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -194,7 +194,7 @@ Trade-off between performance and speed under different confidence thresholds τ
194
  Our training setting is:
195
 
196
  <p align="center">
197
- <img src="https://huggingface.co/OpenGVLab/SDLM-32B-D4/resolve/main/assets/hyper-param.png" width="50%"></a>
198
  </p>
199
 
200
  The training loss of our 3B model. loss_pos_`i` refers to the loss at the `i`-th position of each block. The loss at `i=0` is close to the SFT loss of AR's NTP.
@@ -214,7 +214,7 @@ Trade-off between performance and speed under different confidence thresholds τ
214
  | :-- | :-- | :-- | :-- | :-- | :-- | :-- | :-- |
215
  | loss_pos_1 | loss_pos_2 | loss_pos_3 | loss_pos_4 | -- | -- | -- | -- |
216
 
217
- ![](https://huggingface.co/OpenGVLab/SDLM-32B-D4/resolve/main/assets/train_log_3b.png)
218
 
219
  ## Evaluation
220
 
@@ -223,7 +223,7 @@ Currently, we use [Opencompass](https://github.com/open-compass/opencompass) for
223
  ## Case
224
 
225
  <p align="center">
226
- <img src="https://huggingface.co/OpenGVLab/SDLM-32B-D4/resolve/main/assets/case.gif" width="70%"></a>
227
  </p>
228
 
229
  ## Acknowledge
 
194
  Our training setting is:
195
 
196
  <p align="center">
197
+ <img src="https://github.com/OpenGVLab/SDLM/blob/main/assets/hyper-param.png" width="50%"></a>
198
  </p>
199
 
200
  The training loss of our 3B model. loss_pos_`i` refers to the loss at the `i`-th position of each block. The loss at `i=0` is close to the SFT loss of AR's NTP.
 
214
  | :-- | :-- | :-- | :-- | :-- | :-- | :-- | :-- |
215
  | loss_pos_1 | loss_pos_2 | loss_pos_3 | loss_pos_4 | -- | -- | -- | -- |
216
 
217
+ ![](https://github.com/OpenGVLab/SDLM/blob/main/assets/train_log_3b.png)
218
 
219
  ## Evaluation
220
 
 
223
  ## Case
224
 
225
  <p align="center">
226
+ <img src="https://github.com/OpenGVLab/SDLM/blob/main/assets/case.gif" width="70%"></a>
227
  </p>
228
 
229
  ## Acknowledge