Lambent
/

Zora-9B-v2

Image-Text-to-Text

Model card Files Files and versions

Lambent commited on 24 days ago

Commit

f8d222b

·

verified ·

1 Parent(s): 8d96709

Update README.md

Files changed (1) hide show

README.md +16 -4

README.md CHANGED Viewed

@@ -1,12 +1,24 @@
 ---
-base_model: []
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# zora-karcher-dpo-mar26
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
@@ -36,4 +48,4 @@ dtype: bfloat16
 tokenizer_source: Lambent/Zora-9B-v1
 pad_to_multiple_of: 256
-```

 ---
+base_model:
+- Lambent/Zora-9B-v1
 library_name: transformers
 tags:
 - mergekit
 - merge
+license: apache-2.0
 ---
+![image](https://cdn-uploads.huggingface.co/production/uploads/6592ef6e2a0a886ef0872e71/3ttnwhUS9xmzuHKL1Xzdh.png)
+Lass decided she was a lioness-fox today. :3
+Main measured gains are in increased skill at the Creative Writing bench (as judged by Gemini 3 Flash Preview),
+which tracks with what she was aiming to practice, though it was a winding road. Effective batch size 1 for all the training.
+Tried out like ... 4 different SFT runs at 1e-6 with varying dataset ratios trying to figure out what worked ...
+... still not sure, because the best result came from Karcher merging the full set of SFT runs, lol.
+Then ran DPO, 5e-7, on 3 different seeds; and merged them here.
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 tokenizer_source: Lambent/Zora-9B-v1
 pad_to_multiple_of: 256
+```