omesbah
/

topo-align

Reinforcement Learning

human-in-the-loop

custom-implementation

Model card Files Files and versions

topo-align / examples

13.7 kB

1 contributor

History: 1 commit

omesbah's picture

feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license.

258dd6d 3 days ago

README.md

551 Bytes

feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license. 3 days ago
generate_sperner_dataset.py

4.27 kB

feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license. 3 days ago
rlhf_steering_demo.py

8.85 kB

feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license. 3 days ago