Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Sean13
/
mistral-7b-instruct-v0.2-simpo-full
like
0
Text Generation
Transformers
Safetensors
princeton-nlp/mistral-instruct-ultrafeedback
mistral
alignment-handbook
trl
simpo
Generated from Trainer
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
mistral-7b-instruct-v0.2-simpo-full
Commit History
End of training
b2655bd
verified
Sean13
commited on
Sep 6, 2025
Model save
4a14960
verified
Sean13
commited on
Sep 6, 2025
Training in progress, step 233
1af18cb
verified
Sean13
commited on
Sep 6, 2025
End of training
40b825b
verified
Sean13
commited on
Sep 6, 2025
Model save
75a95f2
verified
Sean13
commited on
Sep 6, 2025
Training in progress, step 233
2c422c8
verified
Sean13
commited on
Sep 6, 2025
initial commit
4938428
verified
Sean13
commited on
Sep 6, 2025