More Benchmarks
#5
by thomasmaindron - opened
Do you plan to evaluate this specific model on more benchmarks? (SWE-bench Verified, AIME26...) Looks promising though!
We don't have plans to add more benchmarks for now, sorry.
Do you plan to evaluate this specific model on more benchmarks? (SWE-bench Verified, AIME26...) Looks promising though!
We don't have plans to add more benchmarks for now, sorry.