Add Terminal-Bench evaluation result (27.8%)

#64
by burtenshaw HF Staff - opened

Sign up or log in to comment