agentbench / tests /test_langchain_baseline

Commit History

fix: stream stage events live, thread source_chunks, fix LangChain wrapper
77c4ed4

Nomearod Claude Opus 4.6 (1M context) commited on

fix: mypy errors in agent factory (model_name, type annotation)
6d1ce7f

Nomearod Claude Opus 4.6 (1M context) commited on

fix: deferred imports, match iteration budget, token cost tracking
2c64504

Nomearod Claude Opus 4.6 (1M context) commited on

feat: langchain evaluation CLI script and Makefile target
9f98da1

Nomearod Claude Opus 4.6 (1M context) commited on

feat: langchain evaluation runner producing EvalResult objects
2ee54ac

Nomearod Claude Opus 4.6 (1M context) commited on

feat: langchain tool-calling agent factory
352bc78

Nomearod Claude Opus 4.6 (1M context) commited on

fix: ruff violations and sync fallback safety for nested event loops
41f1194

Nomearod Claude Opus 4.6 (1M context) commited on

feat: langchain search tool with metadata capture + calculator
98f8930

Nomearod Claude Opus 4.6 (1M context) commited on

feat: langchain retriever wrapper over existing async hybrid retriever
f5d9df4

Nomearod Claude Opus 4.6 (1M context) commited on