storytracer commited on
Commit
1ff0185
·
verified ·
1 Parent(s): 3a075ab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -16,6 +16,6 @@ So far, we have released:
16
  - [Comma v0.1-1T](https://huggingface.co/common-pile/comma-v0.1-1t) and [Comma v0.1-2T](https://huggingface.co/common-pile/comma-v0.1-2t), 7B parameter LLMs trained on text from the Common Pile v0.1
17
  - The [training dataset](https://huggingface.co/datasets/common-pile/comma_v0.1_training_dataset) used to train the Comma v0.1 models
18
  - Our [code](https://github.com/r-three/common-pile/) for collecting data from each source
19
- - Our paper: [The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text](https://arxiv.org/abs/2506.05209)
20
 
21
  If you're interested in contributing, please [open an issue on GitHub](https://github.com/r-three/common-pile/issues/new)!
 
16
  - [Comma v0.1-1T](https://huggingface.co/common-pile/comma-v0.1-1t) and [Comma v0.1-2T](https://huggingface.co/common-pile/comma-v0.1-2t), 7B parameter LLMs trained on text from the Common Pile v0.1
17
  - The [training dataset](https://huggingface.co/datasets/common-pile/comma_v0.1_training_dataset) used to train the Comma v0.1 models
18
  - Our [code](https://github.com/r-three/common-pile/) for collecting data from each source
19
+ - Our paper: [The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text](https://huggingface.co/papers/2506.05209)
20
 
21
  If you're interested in contributing, please [open an issue on GitHub](https://github.com/r-three/common-pile/issues/new)!