More Benchmarks

#5
by thomasmaindron - opened

Do you plan to evaluate this specific model on more benchmarks? (SWE-bench Verified, AIME26...) Looks promising though!

Orion LLM Labs org

We don't have plans to add more benchmarks for now, sorry.

Sign up or log in to comment