Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GPRM
/
Mistral-7B-PairRM-DPO
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
Mistral-7B-PairRM-DPO
256 MB
1 contributor
History:
3 commits
GPRM
Create README.md
e6eb525
verified
9 months ago
.gitattributes
1.52 kB
initial commit
9 months ago
README.md
39 Bytes
Create README.md
9 months ago
adapter_config.json
778 Bytes
Upload folder using huggingface_hub
9 months ago
adapter_model.safetensors
83.9 MB
xet
Upload folder using huggingface_hub
9 months ago
optimizer.pt
168 MB
xet
Upload folder using huggingface_hub
9 months ago
rng_state_0.pth
14.5 kB
xet
Upload folder using huggingface_hub
9 months ago
rng_state_1.pth
14.5 kB
xet
Upload folder using huggingface_hub
9 months ago
scheduler.pt
1.06 kB
xet
Upload folder using huggingface_hub
9 months ago
special_tokens_map.json
437 Bytes
Upload folder using huggingface_hub
9 months ago
tokenizer.json
3.51 MB
Upload folder using huggingface_hub
9 months ago
tokenizer.model
493 kB
xet
Upload folder using huggingface_hub
9 months ago
tokenizer_config.json
2.17 kB
Upload folder using huggingface_hub
9 months ago
trainer_state.json
3.91 kB
Upload folder using huggingface_hub
9 months ago
training_args.bin
5.5 kB
xet
Upload folder using huggingface_hub
9 months ago