Hello, thank you for the detailed GSMA Open-Telco LLM Benchmarks 2.0 write-up.
I have a question regarding the TeleYAML evaluation.
In the blog, TeleYAML is evaluated using an LLM-as-a-Judge approach. Could you please clarify whether the judge prompt / evaluation instruction is publicly available?
Any pointer to a repo, paper, or supplementary material would be greatly appreciated.
Thank you.