Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
TRI-ML
/
DCLM-1B
like
15
Follow
Toyota Research Institute
172
Transformers
Safetensors
openlm
arxiv:
2406.11794
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
4
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (2)
Sort: Recently created
Is this model supported for finetuning with flash attention ?
#4 opened 8 months ago by
thaodd11
MMLU Performance After Token Training
👍
2
#3 opened over 1 year ago by
adol01