--- license: mit base_model: microsoft/Phi-3-small-8k-instruct datasets: - dnaihao/Table-Instructs language: - en library_name: transformers pipeline_tag: text-generation tags: - table-understanding - instruction-tuning - replication - tabular-data --- # phi-3-tablebench Replication of [**TableBenchLLM**](https://arxiv.org/abs/2408.09174), trained from [**Phi-3-small-8k-instruct**](https://huggingface.co/microsoft/Phi-3-small-8k-instruct) on the corresponding instruction-tuning corpus. Released as part of the EACL 2026 Findings paper *"What Really Matters for Table LLMs? A Meta-Evaluation of Model and Data Effects"* (Deng et al., 2026). The paper instruction-tunes three 7B foundation models (Mistral-v0.3, OLMo, Phi-3) on four existing training corpora (TableLlama, TableLLM, TableBench, TableGPT) to disentangle the contributions of base model versus training data, finding that **base model choice plays a more dominant role than the training data itself**. - 📄 Paper: [aclanthology.org/2026.findings-eacl.195](https://aclanthology.org/2026.findings-eacl.195/) - 💻 Code & eval scripts: [github.com/dnaihao/table-sft-eacl-2026](https://github.com/dnaihao/table-sft-eacl-2026) - 🤗 All replicated models: [collection](https://huggingface.co/collections/dnaihao/table-llms) ## Training | | | |---|---| | Base model | [`microsoft/Phi-3-small-8k-instruct`](https://huggingface.co/microsoft/Phi-3-small-8k-instruct) | | Training corpus | `tablebench_train.json` from [`dnaihao/Table-Instructs`](https://huggingface.co/datasets/dnaihao/Table-Instructs) | | Method | Full SFT via [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory) | | Learning rate | 5e-7 | Full hyperparameter sweep, ablations, and per-benchmark numbers are reported in the paper. ## Evaluation Per-`{model, benchmark}` eval scripts and parsed metrics are available at [github.com/dnaihao/table-sft-eacl-2026/tree/main/eval/phi-3-tablebench](https://github.com/dnaihao/table-sft-eacl-2026/tree/main/eval/phi-3-tablebench). Raw model outputs (`generated_predictions.jsonl`) are released as the dataset [`dnaihao/table-sft-eval-predictions`](https://huggingface.co/datasets/dnaihao/table-sft-eval-predictions). ## Usage ```python from transformers import AutoModelForCausalLM, AutoTokenizer tokenizer = AutoTokenizer.from_pretrained("dnaihao/phi-3-tablebench") model = AutoModelForCausalLM.from_pretrained( "dnaihao/phi-3-tablebench", torch_dtype="auto", device_map="auto", ) ``` ## License This model inherits the license of its base model ([`microsoft/Phi-3-small-8k-instruct`](https://huggingface.co/microsoft/Phi-3-small-8k-instruct): mit). ## Citation ```bibtex @inproceedings{deng-etal-2026-really, title = "What Really Matters for Table {LLM}s? A Meta-Evaluation of Model and Data Effects", author = "Deng, Naihao and Zhang, Sheng and Zhu, Henghui and Chang, Shuaichen and Zhang, Jiani and Li, Alexander Hanbo and Hang, Chung-Wei and Kobayashi, Hideo and Hu, Yiqun and Ng, Patrick", booktitle = "Findings of the Association for Computational Linguistics: EACL 2026", year = "2026", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2026.findings-eacl.195/", doi = "10.18653/v1/2026.findings-eacl.195" } ```