Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LisaMegaWatts
/
SymbioGPT-10M
like
0
PyTorch
English
symbiogenesis
multi-organelle
monarch-mixer
philosophy
License:
mit
Model card
Files
Files and versions
xet
Community
main
SymbioGPT-10M
/
data
1.38 GB
1 contributor
History:
4 commits
LisaMegaWatts
Add 2MB val text sample for Gemma/HF tokenizer notebooks
588ff7f
verified
2 days ago
train_curated.txt.tokens.pt
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.IntStorage"
What is a pickle import?
1.06 GB
xet
Add curated training tokens (266M tokens, Chinchilla-optimal)
3 days ago
train_curated_sample.txt
20 MB
xet
Add 20MB raw text sample for Gemma/HF tokenizer notebooks
2 days ago
val.txt.tokens.pt
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.IntStorage"
,
"collections.OrderedDict"
What is a pickle import?
289 MB
xet
Add validation tokens (72M tokens)
3 days ago
val_sample.txt
2 MB
Add 2MB val text sample for Gemma/HF tokenizer notebooks
2 days ago