Request for reproducible evaluation details for claimed ParseBench result

#1
by boyang-runllama - opened

Hi, thanks for sharing your ParseBench result.

From the model card, it looks like the main information provided is the claimed score on our benchmark, but there are not enough details for us to verify or reproduce the result.

Could you please provide the evaluation details, including:

  • The exact model used
  • Evaluation config and prompts

Without these details, we cannot validate the claimed result or compare it fairly with other submissions on the leaderboard.

Thanks!

Sign up or log in to comment