--- license: bigscience-openrail-m language: - en base_model: - Qwen/Qwen2.5-Coder-3B-Instruct pipeline_tag: translation --- ### Performance on the BIRD Development Set We further evaluate **DatA-SQL-3B** on the **BIRD** development set using different self-consistency voting sizes. Under **Vote@8**, our model attains an **execution accuracy (EX) of 61.05 %**. When the voting size increases to **Vote@32**, the EX further improves to **62.58 %**. These results confirm that larger voting ensembles enhance semantic robustness and execution stability while maintaining nearly the same inference cost due to our lightweight multi-agent design. Overall, **DatA-SQL** achieves competitive or superior accuracy compared with GPT-based pipelines at only a fraction of their computational expense.