Update README.md
Browse files
README.md
CHANGED
|
@@ -5,7 +5,11 @@ language:
|
|
| 5 |
base_model:
|
| 6 |
- google/gemma-3-4b-it
|
| 7 |
---
|
| 8 |
-
con_learn highrl
|
| 9 |
-
|
| 10 |
-
con learn
|
| 11 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
base_model:
|
| 6 |
- google/gemma-3-4b-it
|
| 7 |
---
|
| 8 |
+
### con_learn highrl
|
| 9 |
+
Continue learning with rl:8e-5 and r:64, max epoch:1
|
| 10 |
+
### con learn r16
|
| 11 |
+
Continue learning with rl:2e-5 and r:16, max epoch:1
|
| 12 |
+
### con learn r64
|
| 13 |
+
Continue learning with rl:2e-5 and r:64, max epoch:1
|
| 14 |
+
### finetune_firefly
|
| 15 |
+
Finetune on firefly with rl:1e-4 and r:16, max epoch:5
|