AI & ML interests

Large Language Model evaluation, benchmarking, factual reasoning, multi-hop QA, latency analysis, and reproducible AI evaluation pipelines.

models 0

None public yet

datasets 0

None public yet