Update README.md
Browse files
README.md
CHANGED
|
@@ -62,7 +62,7 @@ print(tokenizer.decode(outputs[0]))
|
|
| 62 |
# 🟠 Amber Training Details
|
| 63 |
|
| 64 |
## Datasets and Mix
|
| 65 |
-
[
|
| 66 |
| Subset | Tokens (Billion) |
|
| 67 |
| ----------- | ----------- |
|
| 68 |
| Arxiv | 30.00 |
|
|
|
|
| 62 |
# 🟠 Amber Training Details
|
| 63 |
|
| 64 |
## Datasets and Mix
|
| 65 |
+
[Access the fully processed Amber pretraining data here](https://huggingface.co/datasets/LLM360/AmberDatasets)
|
| 66 |
| Subset | Tokens (Billion) |
|
| 67 |
| ----------- | ----------- |
|
| 68 |
| Arxiv | 30.00 |
|