lightblue
/

suzume-llama-3-8B-multilingual

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

ptrdvn commited on May 22, 2024

Commit

a91a263

·

verified ·

1 Parent(s): c7b55e8

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -17,6 +17,8 @@ model-index:
 # Suzume
 This Suzume 8B, a multilingual finetune of Llama 3 ([meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)).
 Llama 3 has exhibited excellent performance on many English language benchmarks.
@@ -262,6 +264,21 @@ The following hyperparameters were used during training:
 - Datasets 2.18.0
 - Tokenizers 0.15.0
 # Developer
 Peter Devine - ([ptrdvn](https://huggingface.co/ptrdvn))

 # Suzume
+[[Paper](https://arxiv.org/abs/2405.12612)] [[Dataset](https://huggingface.co/datasets/lightblue/tagengo-gpt4)]
 This Suzume 8B, a multilingual finetune of Llama 3 ([meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)).
 Llama 3 has exhibited excellent performance on many English language benchmarks.
 - Datasets 2.18.0
 - Tokenizers 0.15.0
+# How to cite
+Please cite [this paper](https://arxiv.org/abs/2405.12612) when referencing this model.
+```tex
+@misc{devine2024tagengo,
+      title={Tagengo: A Multilingual Chat Dataset},
+      author={Peter Devine},
+      year={2024},
+      eprint={2405.12612},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```
 # Developer
 Peter Devine - ([ptrdvn](https://huggingface.co/ptrdvn))