22h
/

open-cabrita3b

Text Generation

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

celiolarcher commited on Sep 19, 2023

Commit

fc2a2de

·

1 Parent(s): e2ed3c7

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -1,3 +1,22 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+language:
+- pt
+- en
 ---
+The Cabrita model is a collection of continued pre-trained and tokenizer-adapted models for the Portuguese language.
+This artifact is the 3 billion size variant.
+The weights were initially obtained from the open-llama project (https://github.com/openlm-research/open_llama) in the
+open_llama_3b option.
+```
+@misc{larcher2023cabrita,
+      title={Cabrita: closing the gap for foreign languages},
+      author={Celio Larcher and Marcos Piau and Paulo Finardi and Pedro Gengo and Piero Esposito and Vinicius Caridá},
+      year={2023},
+      eprint={2308.11878},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```