Update README.md
Browse files
README.md
CHANGED
|
@@ -4,5 +4,7 @@ base_model:
|
|
| 4 |
- ServiceNow-AI/Apriel-Nemotron-15b-Thinker
|
| 5 |
---
|
| 6 |
Quick and dirty finetune of Apriel, using an improved dataset produced by scoring all replies with a Reward model, then discarding scores <5/5.
|
|
|
|
| 7 |
Tried to filter for impersonation as well, but LLama 8B was too stupid.
|
|
|
|
| 8 |
Seems to like really low temp and a touch of DRY.
|
|
|
|
| 4 |
- ServiceNow-AI/Apriel-Nemotron-15b-Thinker
|
| 5 |
---
|
| 6 |
Quick and dirty finetune of Apriel, using an improved dataset produced by scoring all replies with a Reward model, then discarding scores <5/5.
|
| 7 |
+
|
| 8 |
Tried to filter for impersonation as well, but LLama 8B was too stupid.
|
| 9 |
+
|
| 10 |
Seems to like really low temp and a touch of DRY.
|