jpacifico commited on
Commit
76f4a9f
·
verified ·
1 Parent(s): 0444b8d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -3
README.md CHANGED
@@ -10,9 +10,22 @@ tags:
10
  - merge
11
 
12
  ---
13
- # merge
14
 
15
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
 
 
 
 
 
 
 
 
 
 
 
 
16
 
17
  # First benchmarks
18
 
@@ -50,7 +63,7 @@ Evaluations were performed using LM Eval Harness, all results are fully reproduc
50
  | jpacifico/bitnet-dpo-merged-modelstock7 | **51,62** |
51
 
52
 
53
- ## Merge Details
54
  ### Merge Method
55
 
56
  This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [jpacifico/bitnet-dpo-merged-modelstock-retrain](https://huggingface.co/jpacifico/bitnet-dpo-merged-modelstock-retrain) as a base.
 
10
  - merge
11
 
12
  ---
13
+ # model Summary
14
 
15
+ **bitnet-dpo-merged-modelstock7** (≈2B, BitNet b1.58)
16
+ A compact, agent-oriented small language model focused on language understanding and contextual decision-making.
17
+ Built with an iterative post-training recipe: bilingual DPO (FR+EN) + model merging of FR-centric and EN-centric variants.
18
+ Runs natively as BitNet 1.58-bit (ternary) and is available in GGUF 1.58-bit, lossless to the BF16 checkpoints.
19
+
20
+ **Why BitNet (and why this model)**
21
+ • BitNet b1.58 uses ternary weights (−1,0,+1) with abs-mean scaling : very low memory & energy, great CPU/edge throughput, unlike classic FP/INT SLMs.
22
+ • ModelStock7 demonstrates that a 2B BitNet can deliver SOTA language understanding in its class without sacrificing efficiency.
23
+
24
+ # Training Recipe
25
+
26
+ -Bilingual DPO (FR+EN) to sharpen preference selection across two languages.
27
+ -Model merging (FR-centric + EN-centric) to broaden stylistic/lexical coverage.
28
+ Goal: agent-oriented behavior → better instruction following, contextual disambiguation, and pragmatic reasoning in multi-turn settings.
29
 
30
  # First benchmarks
31
 
 
63
  | jpacifico/bitnet-dpo-merged-modelstock7 | **51,62** |
64
 
65
 
66
+ ## Last checkpoint
67
  ### Merge Method
68
 
69
  This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [jpacifico/bitnet-dpo-merged-modelstock-retrain](https://huggingface.co/jpacifico/bitnet-dpo-merged-modelstock-retrain) as a base.