Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
omesbah
/
topo-align
like
0
Reinforcement Learning
rlhf
alignment
topology
mathematics
sperner-lemma
human-in-the-loop
library
custom-implementation
License:
mit
Model card
Files
Files and versions
xet
Community
main
topo-align
/
examples
13.7 kB
1 contributor
History:
1 commit
omesbah
feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license.
258dd6d
3 days ago
README.md
Safe
551 Bytes
feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license.
3 days ago
generate_sperner_dataset.py
Safe
4.27 kB
feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license.
3 days ago
rlhf_steering_demo.py
Safe
8.85 kB
feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license.
3 days ago