Collection of LLM Evaluation Frameworks
-
iioos/llm-evaluation-model
Updated -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 78 -
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
Paper • 2605.03042 • Published • 120 -
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence
Paper • 2605.12882 • Published • 262