-
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
Paper • 2406.12624 • Published • 37 -
A Survey on LLM-as-a-Judge
Paper • 2411.15594 • Published -
LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
Paper • 2412.05579 • Published • 2 -
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Paper • 2411.16594 • Published • 39
Lu Zhang
kaitou951
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 15 hours ago
Retrieving Any Relevant Moments: Benchmark and Models for Generalized Moment Retrieval liked a dataset about 16 hours ago
diiiA22B9S/Soccer-GMR updated a collection 9 months ago
Daily PapersOrganizations
None yet