cortex / benchmark /tasks.py

Commit History

Enhance benchmark and Cortex modules with new training utilities and improved state management. Update README with example output for Llama-3.2-1B and add training CLI for Cortex module tuning. Refactor scoring functions to reset Cortex state between examples and ensure consistent output. Modify task handling to ensure proper formatting of input data.
0de2901

theapemachine commited on

Add benchmark harness: tasks.py - Standard NLP task definitions
e014dff
verified

theapemachine commited on