Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Amartya77
/
RLHF_PPOppo_model
like
0
Reinforcement Learning
Transformers
Safetensors
mt5
text2text-generation
code
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
Amartya77
commited on
Jan 24, 2024
Commit
db7a170
·
verified
·
1 Parent(s):
71c1986
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+6
-0
README.md
ADDED
Viewed
@@ -0,0 +1,6 @@
1
+
---
2
+
license: mit
3
+
pipeline_tag: reinforcement-learning
4
+
tags:
5
+
- code
6
+
---