declare-lab/Emma-X
Image-Text-to-Text • 8B • Updated • 6 • 10
Natural Language Processing
On the Limits of LLM-as-Judge for Scientific Novelty Assessment
GRAIL: Gradient-Reweighted Advantages for Reinforcement Learning with Verifiable Rewards