---
license: mit
language:
- en
base_model:
- Qwen/Qwen3-4B-Instruct-2507
tags:
- text-to-sql
- ambiguity
- reinforcement-learning
- grpo
---

# IntentRL-Ambig-Text2SQL-4B

This model is trained to handle **ambiguous text-to-SQL requests** by explicitly reasoning about user intent and producing multiple interpretation–answer pairs rather than silently committing to a single interpretation.

It is based on [Qwen/Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507), fine-tuned with **RL (DAPO/GRPO)** using a custom reward that encourages recall (covering more valid interpretations) for ambiguous questions and precision for unambiguous ones.

## Example

Given a schema and an ambiguous question:

> **Schema:** `CREATE TABLE Jobs (JobID INTEGER PRIMARY KEY, Min_Years INTEGER, Pref_Years INTEGER, Position TEXT, Salary REAL);`
>
> **Question:** Show the required experience for the best-paid role.

The model produces multiple interpretation–answer pairs:

1. **Minimum years of experience required** → `SELECT Min_Years ...`
2. **Preferred years of experience** → `SELECT Pref_Years ...`
3. **Both minimum and preferred years** → `SELECT Min_Years, Pref_Years ...`

## Paper

[Reasoning about Intent for Ambiguous Requests](https://arxiv.org/abs/2511.10453)

**Authors:** Irina Saparina, Mirella Lapata

## Training Details

- **Base model:** Qwen3-4B-Instruct-2507
- **Method:** RL with DAPO/GRPO and a custom recall/precision reward
- **Training data:** [Ambrosia](https://ambrosia-benchmark.github.io/) text-to-SQL benchmark
- **Ambiguous examples** are upsampled to balance training

## Code

Training and evaluation code: [https://github.com/saparina/intentRL](https://github.com/saparina/intentRL)

## Citation
```bibtex
@misc{saparina2025reasoningintentambiguousrequests,
      title={Reasoning about Intent for Ambiguous Requests},
      author={Irina Saparina and Mirella Lapata},
      year={2025},
      eprint={2511.10453},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2511.10453},
}
```