Update README.md
Browse files
README.md
CHANGED
|
@@ -19,12 +19,19 @@ duplicated_from: psmathur/model_007_13b_v2
|
|
| 19 |
|
| 20 |
# model-007-2-13b
|
| 21 |
|
| 22 |
-
A modified fork
|
| 23 |
|
| 24 |
-
|
| 25 |
-
Sharded model (~8 GB peak RAM usage): polymer/model-007-2-13b-sharded
|
| 26 |
|
| 27 |
-
Original model
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
|
| 29 |
# model_007_13b_v2
|
| 30 |
|
|
@@ -206,4 +213,5 @@ Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams,
|
|
| 206 |
Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom},
|
| 207 |
year={2023}
|
| 208 |
}
|
| 209 |
-
```
|
|
|
|
|
|
| 19 |
|
| 20 |
# model-007-2-13b
|
| 21 |
|
| 22 |
+
A modified fork of [psmathur/model_007_13b_v2](https://huggingface.co/psmathur/model_007_13b_v2) prepared for training with the Hugging Face Transformers library.
|
| 23 |
|
| 24 |
+
### Links
|
|
|
|
| 25 |
|
| 26 |
+
Original model: [psmathur/model_007_13b_v2](https://huggingface.co/psmathur/model_007_13b_v2)
|
| 27 |
+
|
| 28 |
+
Sharded model (~8 GB peak RAM usage during loading): [polymer/model-007-2-13b-sharded](https://huggingface.co/polymer/model-007-2-13b-sharded)
|
| 29 |
+
|
| 30 |
+
### Original model card
|
| 31 |
+
|
| 32 |
+
The model card from the original repository:
|
| 33 |
+
|
| 34 |
+
<blockquote>
|
| 35 |
|
| 36 |
# model_007_13b_v2
|
| 37 |
|
|
|
|
| 213 |
Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom},
|
| 214 |
year={2023}
|
| 215 |
}
|
| 216 |
+
```
|
| 217 |
+
</blockquote>
|