RegularizedSelfPlay
/

sppo_forward1reverse5-0.1-Llama-3-8B-Instruct-RSPO-Iter1

Model card Files Files and versions

sppo_forward1reverse5-0.1-Llama-3-8B-Instruct-RSPO-Iter1

1.52 kB

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

Sangwoong's picture

initial commit

d7b42d6 verified over 1 year ago

.gitattributes

1.52 kB
initial commit over 1 year ago