thanhdathoang commited on
Commit
3a36c7b
·
verified ·
1 Parent(s): 2ada5b5

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - griffith-bigdata/Qwen-2.5-Coder-3B-SQL-Writer
4
+ ---
5
+
6
+ # FINER-SQL-3B-BIRD
7
+
8
+ Trained from [`griffith-bigdata/Qwen-2.5-Coder-3B-SQL-Writer`](https://huggingface.co/griffith-bigdata/Qwen-2.5-Coder-3B-SQL-Writer) using GRPO with two dense rewards from the FINER-SQL paper:
9
+
10
+ 🧠 Memory Reward — aligns reasoning with verified traces
11
+ ⚙️ Atomic Reward — measures operation-level SQL overlap
12
+
13
+ ✅ 67.5% EX on BIRD with a single 24 GB GPU
14
+
15
+ 📄 See other models: https://huggingface.co/collections/griffith-bigdata/finer-sql
16
+
17
+ 📄 Github: https://github.com/thanhdath/finer-sql/tree/main