Adam Mahdi
ammaox
·
AI & ML interests
LLMs, multimodal AI
Recent Activity
updated
a Space 37 minutes ago
OxRML/README updated
a dataset 3 days ago
OxRML/MADQA upvoted a paper 4 months ago
Measuring what Matters: Construct Validity in Large Language Model
Benchmarks