Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -16,6 +16,6 @@ So far, we have released:
|
|
| 16 |
- [Comma v0.1-1T](https://huggingface.co/common-pile/comma-v0.1-1t) and [Comma v0.1-2T](https://huggingface.co/common-pile/comma-v0.1-2t), 7B parameter LLMs trained on text from the Common Pile v0.1
|
| 17 |
- The [training dataset](https://huggingface.co/datasets/common-pile/comma_v0.1_training_dataset) used to train the Comma v0.1 models
|
| 18 |
- Our [code](https://github.com/r-three/common-pile/) for collecting data from each source
|
| 19 |
-
- Our paper: [The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text](https://
|
| 20 |
|
| 21 |
If you're interested in contributing, please [open an issue on GitHub](https://github.com/r-three/common-pile/issues/new)!
|
|
|
|
| 16 |
- [Comma v0.1-1T](https://huggingface.co/common-pile/comma-v0.1-1t) and [Comma v0.1-2T](https://huggingface.co/common-pile/comma-v0.1-2t), 7B parameter LLMs trained on text from the Common Pile v0.1
|
| 17 |
- The [training dataset](https://huggingface.co/datasets/common-pile/comma_v0.1_training_dataset) used to train the Comma v0.1 models
|
| 18 |
- Our [code](https://github.com/r-three/common-pile/) for collecting data from each source
|
| 19 |
+
- Our paper: [The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text](https://huggingface.co/papers/2506.05209)
|
| 20 |
|
| 21 |
If you're interested in contributing, please [open an issue on GitHub](https://github.com/r-three/common-pile/issues/new)!
|