AI & ML interests

Design and evaluation of LLM agents and impact measurements

evalscience 's models

None public yet