Update README to add title and Gradient logo
#7
by
tpeng726 - opened
README.md
CHANGED
|
@@ -6,6 +6,9 @@ tags:
|
|
| 6 |
- meta
|
| 7 |
- llama-3
|
| 8 |
---
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |

|
| 11 |
|
|
@@ -40,6 +43,14 @@ For training data, we generate long contexts by augmenting [SlimPajama](https://
|
|
| 40 |
| # GPUs | 8 | 8 |
|
| 41 |
| GPU Type | NVIDIA L40S| NVIDIA L40S|
|
| 42 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 43 |
## References
|
| 44 |
|
| 45 |
[1] Peng, Bowen, et al. "Yarn: Efficient context window extension of large language models." arXiv preprint arXiv:2309.00071 (2023).
|
|
@@ -48,13 +59,6 @@ For training data, we generate long contexts by augmenting [SlimPajama](https://
|
|
| 48 |
|
| 49 |
[3] https://github.com/jzhang38/EasyContext
|
| 50 |
|
| 51 |
-
## The Gradient AI Team
|
| 52 |
-
|
| 53 |
-
Gradient is accelerating AI transformation across industries. https://gradient.ai/
|
| 54 |
-
|
| 55 |
-
## Contact Us
|
| 56 |
-
|
| 57 |
-
Drop an email to [contact@gradient.ai](mailto:contact@gradient.ai)
|
| 58 |
|
| 59 |
----
|
| 60 |
|
|
|
|
| 6 |
- meta
|
| 7 |
- llama-3
|
| 8 |
---
|
| 9 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/655bb613e8a8971e89944f3e/TSa3V8YpoVagnTYgxiLaO.png" width="200"/>
|
| 10 |
+
|
| 11 |
+
# Llama-3 8B Instruct 262k
|
| 12 |
|
| 13 |

|
| 14 |
|
|
|
|
| 43 |
| # GPUs | 8 | 8 |
|
| 44 |
| GPU Type | NVIDIA L40S| NVIDIA L40S|
|
| 45 |
|
| 46 |
+
## The Gradient AI Team
|
| 47 |
+
|
| 48 |
+
Gradient is accelerating AI transformation across industries. https://gradient.ai/
|
| 49 |
+
|
| 50 |
+
## Contact Us
|
| 51 |
+
|
| 52 |
+
Drop an email to [contact@gradient.ai](mailto:contact@gradient.ai)
|
| 53 |
+
|
| 54 |
## References
|
| 55 |
|
| 56 |
[1] Peng, Bowen, et al. "Yarn: Efficient context window extension of large language models." arXiv preprint arXiv:2309.00071 (2023).
|
|
|
|
| 59 |
|
| 60 |
[3] https://github.com/jzhang38/EasyContext
|
| 61 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 62 |
|
| 63 |
----
|
| 64 |
|