Tune Jury
TuneJury Landing Page
None defined yet.
An open reward model for music generation preference alignment.
TuneJury is a follow-up to Music Arena (NeurIPS 2025, Creative AI Track), the live A/B human-preference arena for text-to-music. Where Music Arena collects head-to-head human votes on generated music, TuneJury distills those votes, together with three other open human-preference datasets, into a single reusable reward model. A lightweight head over frozen music encoders maps an audio clip and an optional text prompt to one preference score.
The same frozen reward drives three downstream uses: inference-time best-of-N selection, DITTO-style latent optimization, and expert-iteration post-training. Anchor calibration adapts the score to generators released after training, without retraining.
This organization hosts the released TuneJury checkpoints and the listening demo.
Links