AI & ML interests
Large Language Model evaluation, benchmarking, factual reasoning, multi-hop QA, latency analysis, and reproducible AI evaluation pipelines.
No public activity
Large Language Model evaluation, benchmarking, factual reasoning, multi-hop QA, latency analysis, and reproducible AI evaluation pipelines.
No public activity