Reasoning with Machines Lab, University of Oxford

university

https://oxrml.com/

AI & ML interests

Benchmarks and Evaluation, Agentic AI for Science, AI Safety and Security, Human-AI Interaction

Recent Activity

shreyanshpadarha updated a dataset 1 day ago

shreyanshpadarha authored a paper about 1 month ago

AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

ryanothk authored a paper about 1 month ago

AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

View all activity

shreyanshpadarha

updated a dataset 1 day ago

OxRML/AgentSLR

Viewer • Updated 1 day ago • 20.9k • 360 • 2

shreyanshpadarha

authored a paper about 1 month ago

AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

Paper • 2603.22327 • Published Mar 20 • 10

authored a paper about 1 month ago

AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

Paper • 2603.22327 • Published Mar 20 • 10

shreyanshpadarha

submitted a paper to Daily Papers about 1 month ago

AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

Paper • 2603.22327 • Published Mar 20 • 10

shreyanshpadarha

published a dataset about 2 months ago

OxRML/AgentSLR

Viewer • Updated 1 day ago • 20.9k • 360 • 2

updated a Space about 2 months ago

README

authored 3 papers about 2 months ago

A Few-Shot Semantic Parser for Wizard-of-Oz Dialogues with the Precise ThingTalk Representation

Paper • 2009.07968 • Published Sep 16, 2020

Measuring what Matters: Construct Validity in Large Language Model Benchmarks

Paper • 2511.04703 • Published Nov 3, 2025 • 8

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

shreyanshpadarha

updated a Space about 2 months ago

README

authored a paper about 2 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

shreyanshpadarha

authored a paper about 2 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

shreyanshpadarha

updated a dataset about 2 months ago

OxRML/MADQA

Viewer • Updated Mar 13 • 3.05k • 544 • 15

shreyanshpadarha

published a Space about 2 months ago

README

shreyanshpadarha

published a dataset about 2 months ago

OxRML/MADQA

Viewer • Updated Mar 13 • 3.05k • 544 • 15

updated a dataset about 2 months ago

OxRML/MADQA

Viewer • Updated Mar 13 • 3.05k • 544 • 15

shreyanshpadarha

authored a paper 2 months ago

Agentic Reinforcement Learning for Search is Unsafe

Paper • 2510.17431 • Published Oct 20, 2025 • 5

authored a paper about 1 year ago

Clinical knowledge in LLMs does not translate to human interactions

Paper • 2504.18919 • Published Apr 26, 2025 • 26

authored 2 papers about 1 year ago

Clinical knowledge in LLMs does not translate to human interactions

Paper • 2504.18919 • Published Apr 26, 2025 • 26

LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation

Paper • 2503.02972 • Published Mar 4, 2025 • 25