Datasets used for benchmarking computational cost and inference efficiency of SLMs in customer service QA experiments.