Collections

Discover the best community collections!

Collections trending this week
Customer Service QA Fine-tuned SLMs
Fine-tuned SLMs for context-summarized multi-turn customer service response generation.
SLM Cost Benchmarking Datasets
Datasets used for benchmarking computational cost and inference efficiency of SLMs in customer service QA experiments.
Customer Service Human Evaluation Data (Evaluator 2)
Per-model human evaluation datasets (evaluator_2) for customer service client-agent conversations.
Pairwise Comparison Datasets (Virtuoso-Large vs SLMs)
Pairwise comparison datasets used to evaluate SLM responses against Virtuoso-Large on customer service client-agent conversations.
Customer Service Context Summarization Evaluation Data
Per-model evaluation datasets (~10k rows each) for context summarization experiments in customer service conversations.
Customer Service Human Evaluation Data (Evaluator 3)
Per-model human evaluation datasets (evaluator_3) for customer service client-agent conversations.
Customer Service Human Evaluation Data (Evaluator 1)
Per-model human evaluation datasets (evaluator_1) for customer service client-agent conversations.
Pairwise Comparison (Gemini-2.5-Flash vs SLMs)
Pairwise comparison datasets used to evaluate SLM responses against Gemini-2.5-Flash on customer service client-agent conversations.
Customer Service QA Fine-tuned SLMs
Fine-tuned SLMs for context-summarized multi-turn customer service response generation.
Customer Service Context Summarization Evaluation Data
Per-model evaluation datasets (~10k rows each) for context summarization experiments in customer service conversations.
SLM Cost Benchmarking Datasets
Datasets used for benchmarking computational cost and inference efficiency of SLMs in customer service QA experiments.
Customer Service Human Evaluation Data (Evaluator 3)
Per-model human evaluation datasets (evaluator_3) for customer service client-agent conversations.
Customer Service Human Evaluation Data (Evaluator 2)
Per-model human evaluation datasets (evaluator_2) for customer service client-agent conversations.
Customer Service Human Evaluation Data (Evaluator 1)
Per-model human evaluation datasets (evaluator_1) for customer service client-agent conversations.
Pairwise Comparison Datasets (Virtuoso-Large vs SLMs)
Pairwise comparison datasets used to evaluate SLM responses against Virtuoso-Large on customer service client-agent conversations.
Pairwise Comparison (Gemini-2.5-Flash vs SLMs)
Pairwise comparison datasets used to evaluate SLM responses against Gemini-2.5-Flash on customer service client-agent conversations.