Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
datasysdev
/
helm-d-130m-hyperbolic
like
0
Text Generation
open-thoughts/OpenThoughts-114k
HuggingFaceTB/smollm-corpus
English
hyperbolic
lorentz
geometric-deep-learning
language-model
chain-of-thought
reasoning
arxiv:
2505.24722
License:
mit
Model card
Files
Files and versions
xet
Community
main
helm-d-130m-hyperbolic
2.42 GB
Ctrl+K
Ctrl+K
1 contributor
History:
9 commits
datasysdev
feat: upload 200M CoT checkpoint (step 5600, loss ~4.7)
8371cd9
verified
4 days ago
.gitattributes
Safe
1.52 kB
initial commit
5 days ago
README.md
Safe
8.63 kB
docs: update model card for 200M CoT training run
4 days ago
cot_step5600.pt
pickle
Detected Pickle imports (5)
"torch.BoolStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.ComplexFloatStorage"
,
"torch.FloatStorage"
What is a pickle import?
2.42 GB
xet
feat: upload 200M CoT checkpoint (step 5600, loss ~4.7)
4 days ago