Amartya77
/

RLHF_PPOppo_model

Reinforcement Learning

text2text-generation

Model card Files Files and versions

Amartya77 commited on Jan 24, 2024

Commit

db7a170

·

verified ·

1 Parent(s): 71c1986

Create README.md

Files changed (1) hide show

README.md +6 -0

README.md ADDED Viewed

	@@ -0,0 +1,6 @@

+---
+license: mit
+pipeline_tag: reinforcement-learning
+tags:
+- code
+---