Salesforce/blip2-opt-2.7b
Image-Text-to-Text • 4B • Updated • 582k • 443
None defined yet.
Learning from Language Feedback via Variational Policy Distillation
The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation