tilman-d
/

sf-diogenes-v0.2

Text Generation

instruction-tuned

Model card Files Files and versions

tilman-d commited on Nov 18, 2025

Commit

8c26dca

·

verified ·

1 Parent(s): 1b734a4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ pipeline_tag: text-generation
 ## TL;DR
 - 14B-parameter continuation of the Diogenes alignment work, distilled from the v0.1 80B run so it fits on a single high-memory GPU.
-- Same curated Salesforce assistant dataset (102,827 prompt/response pairs) re-tokenized with the Qwen3 chat template and optimized for pragmatic, enumerated answers.
 - Trained with Unsloth’s 4-bit QLoRA stack (`rank=32`) and merged back to full precision weights for drop-in `transformers` usage.
 ## Release Highlights

 ## TL;DR
 - 14B-parameter continuation of the Diogenes alignment work, distilled from the v0.1 80B run so it fits on a single high-memory GPU.
+- Curated Salesforce dataset (102,827 prompt/response pairs) re-tokenized with the Qwen3 chat template and optimized for pragmatic, enumerated answers.
 - Trained with Unsloth’s 4-bit QLoRA stack (`rank=32`) and merged back to full precision weights for drop-in `transformers` usage.
 ## Release Highlights