Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
new activity about 7 hours ago
oi-uae/oi-OCR:Request for reproducible evaluation details for claimed ParseBench result liked a model about 8 hours ago
deepseek-ai/DeepSeek-V4-Flash updated a collection about 8 hours ago
benchmarksOrganizations
benchmarks
RULER Datasets Falcon-H1-3B-Base
RULER Datasets
RULER Datasets Lamma3-Instruct
RULER Datasets
RULER Datasets Qwen2.5-Instruct
RULER Datasets
RULER Datasets Qwen-3-Instruct
RULER Datasets
RULER Datasets Qwen-3
RULER Datasets
agents
Agents ressources
All the ressources I found / used when getting up to speed with agents.