feat(eval): add METEOR + optional LLM-as-judge for VQA scoring 8f6cf28 convitom Claude Opus 4.7 commited on 16 days ago