Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AiAF
/
Mistral-7B-Instruct-v0.2_DPO-training-test
like
0
Text Generation
Transformers
Safetensors
mistral
Generated from Trainer
trl
dpo
conversational
text-generation-inference
4-bit precision
bitsandbytes
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Mistral-7B-Instruct-v0.2_DPO-training-test
/
tokenizer.model
Commit History
Training in progress, step 1274
4ee61d5
verified
AiAF
commited on
Aug 21, 2025