AI & ML interests
Large Language Model evaluation, benchmarking, factual reasoning, multi-hop QA, latency analysis, and reproducible AI evaluation pipelines.
enlatics-ai-research 's datasets
None public yet
Large Language Model evaluation, benchmarking, factual reasoning, multi-hop QA, latency analysis, and reproducible AI evaluation pipelines.