BpJian Liu

SuperBJ711

SuperBJ711

AI & ML interests

None yet

Recent Activity

commented on an article 23 days ago

GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms

View all activity

Organizations

None yet

commented on GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms 23 days ago

Hello, thank you for the detailed GSMA Open-Telco LLM Benchmarks 2.0 write-up.

I have a question regarding the TeleYAML evaluation.

In the blog, TeleYAML is evaluated using an LLM-as-a-Judge approach. Could you please clarify whether the judge prompt / evaluation instruction is publicly available?

Any pointer to a repo, paper, or supplementary material would be greatly appreciated.

Thank you.

BpJian Liu

AI & ML interests

Recent Activity

Organizations

SuperBJ711's activity