cortex / benchmark

Commit History

Add benchmark harness: tasks.py - Standard NLP task definitions
e014dff
verified

theapemachine commited on

Add benchmark harness: memory_tasks.py - Passkey retrieval and multi-hop memory
81ff944
verified

theapemachine commited on

Add benchmark harness: __init__.py
7dd817a
verified

theapemachine commited on

Add benchmark harness: scoring.py
4c1ba64
verified

theapemachine commited on