Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Sean13
/
mistral-7b-instruct-v0.2-ripo-full
like
0
Text Generation
Transformers
TensorBoard
Safetensors
mistral
Generated from Trainer
trl
dpo
conversational
text-generation-inference
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
mistral-7b-instruct-v0.2-ripo-full
/
runs
1.56 MB
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
Sean13
Model save
a7b21e5
verified
9 months ago
Aug01_04-37-55_pm-d04f
Training in progress, step 467
9 months ago
Aug03_11-53-53_pm-d04f
Training in progress, step 467
9 months ago
Aug03_11-57-37_pm-d04f
Training in progress, step 467
9 months ago
Aug03_22-28-48_pm-d04f
Model save
9 months ago
Aug04_03-16-18_pm-d04f
Model save
9 months ago
Jul31_18-56-18_pm-d04f
Training in progress, step 467
9 months ago
Jul31_22-32-19_pm-d04f
Model save
9 months ago