AI & ML interests

Moving LLM evaluation from static snapshots to stateful agency. I am developing a framework to measure functional intelligence through long-horizon reasoning, evolving state management, and compute-efficiency—prioritizing real-world persistence over surface-level fluency.

models 0

None public yet

datasets 0

None public yet