Table-R1-SFT-8B / README.md
nielsr's picture
nielsr HF Staff
Add model card
8fc3aa6 verified
|
raw
history blame
802 Bytes
metadata
license: cc-by-4.0
library_name: transformers
pipeline_tag: table-question-answering

The Table-R1 model, as presented in the paper Table-R1: Inference-Time Scaling for Table Reasoning, explores inference-time scaling on table reasoning tasks. It uses distillation from frontier model reasoning traces and reinforcement learning with verifiable rewards (RLVR). The model matches or exceeds the performance of GPT-4.1 and DeepSeek-R1, while using only a 7B-parameter LLM.