How to reproduce the GLM-5.2 benchmarks across various datasets?

#11
by tuo02 - opened

Hi, team! I would like to reproduce the GLM-5.2 benchmarks across various datasets. How did you measure these scores? Is there a detailed documentation available?

Sign up or log in to comment