CohereLabs
/

aya-101

text2text-generation

text-generation-inference

Model card Files Files and versions

ahmetustun commited on Feb 13, 2024

Commit

fc44676

·

verified ·

1 Parent(s): d53e4f9

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -127,7 +127,7 @@ metrics:
 - **Developed by:** Cohere For AI
 - **Model type:** a Transformer style autoregressive massively multilingual language model.
-- **Paper**: [Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model](arxiv.com)
 - **Point of Contact**: Cohere For AI: [cohere.for.ai](cohere.for.ai)
 - **Languages**: Refer to the list of languages in the `language` section of this model card.
 - **License**: Apache-2.0
@@ -180,16 +180,16 @@ The Aya model is trained on the following datasets:
 - [DataProvenance collection](https://huggingface.co/datasets/DataProvenanceInitiative/Commercially-Verified-Licenses)
 - ShareGPT-Command
-All datasets are subset to the 101 languages supported by [mT5]. See the [paper](arxiv.com) for details about filtering and pruning.
 ## Evaluation
-We refer to Section 5 from our paper for multilingual eval across 99 languages – including discriminative, generative tasks, human evaluation and simulated win rates that cover both held-out tasks and in-distribution performance.
 ## Bias, Risks, and Limitations
-For a detailed overview of our effort at safety mitigation and benchmarking toxicity and bias across multiple languages, we refer Sections 6 and 7 of our paper: [Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model](arxiv.com).
 We hope that the release of the Aya model will make community-based redteaming efforts possible, by exposing an open-source massively-multilingual model for community research.

 - **Developed by:** Cohere For AI
 - **Model type:** a Transformer style autoregressive massively multilingual language model.
+- **Paper**: [Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model](https://arxiv.org/abs/2402.07827)
 - **Point of Contact**: Cohere For AI: [cohere.for.ai](cohere.for.ai)
 - **Languages**: Refer to the list of languages in the `language` section of this model card.
 - **License**: Apache-2.0
 - [DataProvenance collection](https://huggingface.co/datasets/DataProvenanceInitiative/Commercially-Verified-Licenses)
 - ShareGPT-Command
+All datasets are subset to the 101 languages supported by [mT5]. See the [paper](https://arxiv.org/abs/2402.07827) for details about filtering and pruning.
 ## Evaluation
+We refer to Section 5 from our paper for multilingual eval across 99 languages – including discriminative and generative tasks, human evaluation, and simulated win rates that cover both held-out tasks and in-distribution performance.
 ## Bias, Risks, and Limitations
+For a detailed overview of our effort at safety mitigation and benchmarking toxicity and bias across multiple languages, we refer to Sections 6 and 7 of our paper: [Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model](https://arxiv.org/abs/2402.07827).
 We hope that the release of the Aya model will make community-based redteaming efforts possible, by exposing an open-source massively-multilingual model for community research.