Update README.md
Browse files
README.md
CHANGED
|
@@ -14,7 +14,7 @@ pipeline_tag: text-generation
|
|
| 14 |
|
| 15 |
## TL;DR
|
| 16 |
- 14B-parameter continuation of the Diogenes alignment work, distilled from the v0.1 80B run so it fits on a single high-memory GPU.
|
| 17 |
-
-
|
| 18 |
- Trained with Unsloth’s 4-bit QLoRA stack (`rank=32`) and merged back to full precision weights for drop-in `transformers` usage.
|
| 19 |
|
| 20 |
## Release Highlights
|
|
|
|
| 14 |
|
| 15 |
## TL;DR
|
| 16 |
- 14B-parameter continuation of the Diogenes alignment work, distilled from the v0.1 80B run so it fits on a single high-memory GPU.
|
| 17 |
+
- Curated Salesforce dataset (102,827 prompt/response pairs) re-tokenized with the Qwen3 chat template and optimized for pragmatic, enumerated answers.
|
| 18 |
- Trained with Unsloth’s 4-bit QLoRA stack (`rank=32`) and merged back to full precision weights for drop-in `transformers` usage.
|
| 19 |
|
| 20 |
## Release Highlights
|