linden713 commited on
Commit
e749432
·
verified ·
1 Parent(s): 02f9ef5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -4
README.md CHANGED
@@ -5,7 +5,11 @@ language:
5
  base_model:
6
  - google/gemma-3-4b-it
7
  ---
8
- con_learn highrl: continue learning with rl:8e-5 and r:64, max epoch:1
9
- con learn r16: continue learning with rl:2e-5 and r:16, max epoch:1
10
- con learn r64: continue learning with rl:2e-5 and r:64, max epoch:1
11
- finetune_firefly: finetune on firefly with rl:1e-4 and r:16, max epoch:5
 
 
 
 
 
5
  base_model:
6
  - google/gemma-3-4b-it
7
  ---
8
+ ### con_learn highrl
9
+ Continue learning with rl:8e-5 and r:64, max epoch:1
10
+ ### con learn r16
11
+ Continue learning with rl:2e-5 and r:16, max epoch:1
12
+ ### con learn r64
13
+ Continue learning with rl:2e-5 and r:64, max epoch:1
14
+ ### finetune_firefly
15
+ Finetune on firefly with rl:1e-4 and r:16, max epoch:5