meta-llama/CodeLlama-70b-hf
Text Generation • 69B • Updated
• 52 • 27
None defined yet.
CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production
TLDR: Token-Level Detective Reward Model for Large Vision Language Models