Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
yulan-team
/
YuLan-Mini-Before-Annealing
like
7
Follow
RUC-GSAI-YuLan
46
Safetensors
yulanmini
optimizer_states
custom_code
arxiv:
2412.17743
License:
mit
Model card
Files
Files and versions
xet
Community
2
Copy to bucket
new
main
YuLan-Mini-Before-Annealing
Ctrl+K
Ctrl+K
1 contributor
History:
13 commits
IvanHU
Update README.md
7a063d0
verified
about 1 year ago
global_step243198_universal
Upload correct optimizer states
over 1 year ago
.gitattributes
1.58 kB
Upload deepspeed checkpoint
over 1 year ago
README.md
7.66 kB
Update README.md
about 1 year ago
config.json
2 kB
Update config.json
over 1 year ago
configuration_yulanmini.py
15.3 kB
Upload correct optimizer states
over 1 year ago
latest_universal
27 Bytes
Upload correct optimizer states
over 1 year ago
model.safetensors
4.85 GB
xet
Remove wrong checkpoint #1
over 1 year ago
modeling_yulanmini.py
75.8 kB
Upload correct optimizer states
over 1 year ago
special_tokens_map.json
Safe
552 Bytes
Upload deepspeed checkpoint
over 1 year ago
tokenizer.json
Safe
4.72 MB
Upload deepspeed checkpoint
over 1 year ago
tokenizer_config.json
Safe
2.52 kB
Upload deepspeed checkpoint
over 1 year ago
trainer_state.json
742 Bytes
Upload deepspeed checkpoint
over 1 year ago