Update README.md
Browse files
README.md
CHANGED
|
@@ -68,4 +68,39 @@ on multiple web sources. We intend to conduct research in these areas in the fut
|
|
| 68 |
|
| 69 |
### Instruction Data
|
| 70 |
|
| 71 |
-
The training corpus is composed of 140B tokens gathered from web crawlings and public domain data.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 68 |
|
| 69 |
### Instruction Data
|
| 70 |
|
| 71 |
+
The training corpus is composed of 140B tokens gathered from web crawlings and public domain data.
|
| 72 |
+
|
| 73 |
+
## Additional information
|
| 74 |
+
|
| 75 |
+
### Author
|
| 76 |
+
The Language Technologies Unit from Barcelona Supercomputing Center.
|
| 77 |
+
|
| 78 |
+
### Contact
|
| 79 |
+
For further information, please send an email to <langtech@bsc.es>.
|
| 80 |
+
|
| 81 |
+
### Copyright
|
| 82 |
+
Copyright(c) 2023 by Language Technologies Unit, Barcelona Supercomputing Center.
|
| 83 |
+
|
| 84 |
+
### License
|
| 85 |
+
[Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0)
|
| 86 |
+
|
| 87 |
+
### Funding
|
| 88 |
+
This work was funded by [Departament de la Vicepresidència i de Polítiques Digitals i Territori de la Generalitat de Catalunya](https://politiquesdigitals.gencat.cat/ca/inici/index.html#googtrans(ca|en) within the framework of [Projecte AINA](https://politiquesdigitals.gencat.cat/ca/economia/catalonia-ai/aina).
|
| 89 |
+
|
| 90 |
+
### Disclaimer
|
| 91 |
+
|
| 92 |
+
<details>
|
| 93 |
+
<summary>Click to expand</summary>
|
| 94 |
+
|
| 95 |
+
The model published in this repository is intended for a generalist purpose and is available to third parties under a permissive Apache License, Version 2.0.
|
| 96 |
+
|
| 97 |
+
Be aware that the model may have biases and/or any other undesirable distortions.
|
| 98 |
+
|
| 99 |
+
When third parties deploy or provide systems and/or services to other parties using this model (or any system based on it)
|
| 100 |
+
or become users of the model, they should note that it is their responsibility to mitigate the risks arising from its use and,
|
| 101 |
+
in any event, to comply with applicable regulations, including regulations regarding the use of Artificial Intelligence.
|
| 102 |
+
|
| 103 |
+
In no event shall the owner and creator of the model (Barcelona Supercomputing Center)
|
| 104 |
+
be liable for any results arising from the use made by third parties.
|
| 105 |
+
|
| 106 |
+
</details>
|