PAD: Personalized Alignment at Decoding-time. ICLR 2025
This repo contains the personalized reward model (PerRM) for alignment.
Our paper: https://openreview.net/pdf?id=e7AUJpP8bV
- Downloads last month
- 1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for RuizheChen/PAD
Base model
meta-llama/Meta-Llama-3-8B