Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

omesbah
/
topo-align

Reinforcement Learning
rlhf
alignment
topology
mathematics
sperner-lemma
human-in-the-loop
library
custom-implementation
Model card Files Files and versions
xet
Community
topo-align / examples
13.7 kB
  • 1 contributor
History: 1 commit
omesbah's picture
omesbah
feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license.
258dd6d 3 days ago
  • README.md
    551 Bytes
    feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license. 3 days ago
  • generate_sperner_dataset.py
    4.27 kB
    feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license. 3 days ago
  • rlhf_steering_demo.py
    8.85 kB
    feat: Introduce `equilib` package with RLHF steering and Sperner dataset generation examples, and add project license. 3 days ago