dicta-il
/

dictalm2.0

Text Generation

text-generation-inference

Model card Files Files and versions

Shaltiel commited on Jul 10, 2024

Commit

f8ab320

·

verified ·

1 Parent(s): 55af0f5

Update README.md

Files changed (1) hide show

README.md +11 -3

README.md CHANGED Viewed

@@ -13,11 +13,11 @@ inference:
 [<img src="https://i.ibb.co/5Lbwyr1/dicta-logo.jpg" width="300px"/>](https://dicta.org.il)
-# Model Card for DictaLM-2.0
 The DictaLM-2.0 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters trained to specialize in Hebrew text.
-For full details of this model please read our [release blog post](https://dicta.org.il/dicta-lm).
 This is the full-precision base model.
 You can view and access the full collection of base/instruct unquantized/quantized versions of `DictaLM-2.0` [here](https://huggingface.co/collections/dicta-il/dicta-lm-20-collection-661bbda397df671e4a430c27).
@@ -98,5 +98,13 @@ DictaLM 2.0 is a pretrained base model and therefore does not have any moderatio
 If you use this model, please cite:
 ```bibtex
-[Will be added soon]
 ```

 [<img src="https://i.ibb.co/5Lbwyr1/dicta-logo.jpg" width="300px"/>](https://dicta.org.il)
+# Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities
 The DictaLM-2.0 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters trained to specialize in Hebrew text.
+For full details of this model please read our [release blog post](https://dicta.org.il/dicta-lm) or the [technical report](https://arxiv.org/abs/2407.07080).
 This is the full-precision base model.
 You can view and access the full collection of base/instruct unquantized/quantized versions of `DictaLM-2.0` [here](https://huggingface.co/collections/dicta-il/dicta-lm-20-collection-661bbda397df671e4a430c27).
 If you use this model, please cite:
 ```bibtex
+@misc{shmidman2024adaptingllmshebrewunveiling,
+      title={Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities},
+      author={Shaltiel Shmidman and Avi Shmidman and Amir DN Cohen and Moshe Koppel},
+      year={2024},
+      eprint={2407.07080},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2407.07080},
+}
 ```