matlok 's Collections

Papers - RL - Consistency Model (RLCM)

a multi-step Markov Decision Process, allowing one to fine-tune consistency models toward a downstream task using just a reward function.