Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ChatEval DSTC12

community
https://chateval.org/dstc12
Activity Feed

AI & ML interests

None defined yet.

John Mendonça's profile picture Sermo Lab's profile picture

Johndfm 
authored 5 papers 5 months ago

ECoh: Turn-level Coherence Evaluation for Multilingual Dialogues

Paper • 2407.11660 • Published Jul 16, 2024

Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs

Paper • 2408.10902 • Published Aug 20, 2024

On the Benchmarking of LLMs for Open-Domain Dialogue Evaluation

Paper • 2407.03841 • Published Jul 4, 2024

MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Chatbots and Dialogue Evaluators

Paper • 2505.22777 • Published May 28, 2025

CAMÕES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese

Paper • 2508.19721 • Published Aug 27, 2025 • 5
Johndfm 
updated 3 datasets 10 months ago

chatevaldstc12/ProsocialDialog

Viewer • Updated May 2, 2025 • 892k • 3

chatevaldstc12/bot_adversarial_dialogue

Viewer • Updated May 2, 2025 • 594k • 6

chatevaldstc12/dialogue_safety

Viewer • Updated May 2, 2025 • 240k • 4
Johndfm 
updated a collection 12 months ago

Task1

Collection
Task 1 Data • 1 item • Updated Mar 10, 2025
Johndfm 
updated a collection about 1 year ago

Task2

Collection
Datasets for Task 2 • 3 items • Updated Jan 3, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs