Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LifelongAlignment
/
DPO_CPPO
like
0
Follow
Lifelong Alignment of Agents
7
Safetensors
Model card
Files
Files and versions
xet
Community
main
DPO_CPPO
/
dataset-0
/
checkpoint-600
6.93 GB
1 contributor
History:
1 commit
Shahradmz
Upload folder using huggingface_hub
2901fae
verified
9 months ago
global_step600
Upload folder using huggingface_hub
9 months ago
added_tokens.json
80 Bytes
Upload folder using huggingface_hub
9 months ago
config.json
731 Bytes
Upload folder using huggingface_hub
9 months ago
generation_config.json
242 Bytes
Upload folder using huggingface_hub
9 months ago
latest
14 Bytes
Upload folder using huggingface_hub
9 months ago
merges.txt
1.67 MB
Upload folder using huggingface_hub
9 months ago
model.safetensors
988 MB
xet
Upload folder using huggingface_hub
9 months ago
rng_state_0.pth
15 kB
xet
Upload folder using huggingface_hub
9 months ago
rng_state_1.pth
15 kB
xet
Upload folder using huggingface_hub
9 months ago
rng_state_2.pth
15 kB
xet
Upload folder using huggingface_hub
9 months ago
rng_state_3.pth
15 kB
xet
Upload folder using huggingface_hub
9 months ago
scheduler.pt
1.06 kB
xet
Upload folder using huggingface_hub
9 months ago
special_tokens_map.json
367 Bytes
Upload folder using huggingface_hub
9 months ago
tokenizer.json
11.4 MB
xet
Upload folder using huggingface_hub
9 months ago
tokenizer_config.json
1.33 kB
Upload folder using huggingface_hub
9 months ago
trainer_state.json
17.6 kB
Upload folder using huggingface_hub
9 months ago
training_args.bin
7.99 kB
xet
Upload folder using huggingface_hub
9 months ago
vocab.json
2.78 MB
Upload folder using huggingface_hub
9 months ago
zero_to_fp32.py
33.3 kB
Upload folder using huggingface_hub
9 months ago