Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
cduoduo
/
TCMConverse-4B-SFT-PPO-MultiReward-Alignment
like
0
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
TCMConverse-4B-SFT-PPO-MultiReward-Alignment
/
.gitattributes
Commit History
initial commit
ceffd8f
verified
cduoduo
commited on
Nov 16, 2024