Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
yulan-team
/
YuLan-Mini-Before-Annealing
like
7
Follow
RUC-GSAI-YuLan
44
Safetensors
yulanmini
optimizer_states
custom_code
arxiv:
2412.17743
License:
mit
Model card
Files
Files and versions
xet
Community
2
main
YuLan-Mini-Before-Annealing
36.1 GB
1 contributor
History:
13 commits
IvanHU
Update README.md
7a063d0
verified
11 months ago
global_step243198_universal
Upload correct optimizer states
about 1 year ago
.gitattributes
1.58 kB
Upload deepspeed checkpoint
about 1 year ago
README.md
7.66 kB
Update README.md
11 months ago
config.json
2 kB
Update config.json
12 months ago
configuration_yulanmini.py
15.3 kB
Upload correct optimizer states
about 1 year ago
latest_universal
27 Bytes
Upload correct optimizer states
about 1 year ago
model.safetensors
4.85 GB
xet
Remove wrong checkpoint #1
about 1 year ago
modeling_yulanmini.py
75.8 kB
Upload correct optimizer states
about 1 year ago
special_tokens_map.json
552 Bytes
Upload deepspeed checkpoint
about 1 year ago
tokenizer.json
4.72 MB
Upload deepspeed checkpoint
about 1 year ago
tokenizer_config.json
2.52 kB
Upload deepspeed checkpoint
about 1 year ago
trainer_state.json
742 Bytes
Upload deepspeed checkpoint
about 1 year ago