AI & ML interests
Large Language Model evaluation, benchmarking, factual reasoning, multi-hop QA, latency analysis, and reproducible AI evaluation pipelines.
models 0
None public yet
datasets 0
None public yet
Large Language Model evaluation, benchmarking, factual reasoning, multi-hop QA, latency analysis, and reproducible AI evaluation pipelines.