Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
cduoduo
/
TCMConverse-4B-SFT-PPO-MultiReward-Alignment
like
0
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
TCMConverse-4B-SFT-PPO-MultiReward-Alignment
Commit History
Update README.md
ee420b2
verified
cduoduo
commited on
Nov 16, 2024
initial commit
ceffd8f
verified
cduoduo
commited on
Nov 16, 2024