jpacifico
/

Aramis-2B-BitNet-bf16

Text Generation

Model card Files Files and versions

jpacifico commited on Aug 17, 2025

Commit

478f1d7

·

verified ·

1 Parent(s): 6ecdfe4

Update README.md

Files changed (1) hide show

README.md +10 -3

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ language:
 - en
 - fr
 ---
-# model Summary
 **bitnet-dpo-merged-modelstock7** *(2.41B params / Context Length: Maximum sequence length of 4096 tokens)*
 A compact, agent-oriented small language model focused on language understanding and contextual decision-making.
@@ -41,7 +41,7 @@ Iterative DPO + Model merging :
 - Bilingual DPO (FR+EN) to sharpen preference selection across two languages, using the following datasets :
   [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised)
   [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
-- Model merging (FR-centric + EN-centric) using [mergekit](https://github.com/cg123/mergekit) to broaden stylistic/lexical coverage.
 # First benchmarks
@@ -109,4 +109,11 @@ parameters:
 dtype: bfloat16
 tokenizer_source: jpacifico/bitnet-dpo-merged-modelstock-retrain
-```

 - en
 - fr
 ---
+# Model Summary
 **bitnet-dpo-merged-modelstock7** *(2.41B params / Context Length: Maximum sequence length of 4096 tokens)*
 A compact, agent-oriented small language model focused on language understanding and contextual decision-making.
 - Bilingual DPO (FR+EN) to sharpen preference selection across two languages, using the following datasets :
   [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised)
   [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
+- Model merging (ModelStock and TIES methods, via [Mergekit](https://github.com/cg123/mergekit) to combine complementary strengths of bilingual models (FR-centric + EN-centric), improving robustness across reasoning and comprehension tasks while maintaining stability.
 # First benchmarks
 dtype: bfloat16
 tokenizer_source: jpacifico/bitnet-dpo-merged-modelstock-retrain
+```
+- **Developed by:** Jonathan Pacifico, 2025
+- **Model type:** LLM
+- **Language(s) (NLP):** French, English
+- **License:** MIT
+Made with ❤️ in France