Instructions to use QuantFactory/TableLLM-13b-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use QuantFactory/TableLLM-13b-GGUF with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("table-question-answering", model="QuantFactory/TableLLM-13b-GGUF")

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("QuantFactory/TableLLM-13b-GGUF", dtype="auto")

llama-cpp-python

How to use QuantFactory/TableLLM-13b-GGUF with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="QuantFactory/TableLLM-13b-GGUF",
	filename="TableLLM-13b.Q2_K.gguf",
)

output = llm(
	"Once upon a time,",
	max_tokens=512,
	echo=True
)
print(output)

Notebooks
Google Colab
Kaggle
Local Apps Settings

llama.cpp

How to use QuantFactory/TableLLM-13b-GGUF with llama.cpp:

Install (macOS, Linux)

curl -LsSf https://llama.app/install.sh | sh
# Start a local OpenAI-compatible server with a web UI:
llama serve -hf QuantFactory/TableLLM-13b-GGUF:Q4_K_M
# Run inference directly in the terminal:
llama cli -hf QuantFactory/TableLLM-13b-GGUF:Q4_K_M

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama serve -hf QuantFactory/TableLLM-13b-GGUF:Q4_K_M
# Run inference directly in the terminal:
llama cli -hf QuantFactory/TableLLM-13b-GGUF:Q4_K_M

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf QuantFactory/TableLLM-13b-GGUF:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf QuantFactory/TableLLM-13b-GGUF:Q4_K_M

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf QuantFactory/TableLLM-13b-GGUF:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf QuantFactory/TableLLM-13b-GGUF:Q4_K_M

Use Docker

docker model run hf.co/QuantFactory/TableLLM-13b-GGUF:Q4_K_M

LM Studio
Jan
Ollama
How to use QuantFactory/TableLLM-13b-GGUF with Ollama:
```
ollama run hf.co/QuantFactory/TableLLM-13b-GGUF:Q4_K_M
```

Unsloth Studio

How to use QuantFactory/TableLLM-13b-GGUF with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for QuantFactory/TableLLM-13b-GGUF to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for QuantFactory/TableLLM-13b-GGUF to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for QuantFactory/TableLLM-13b-GGUF to start chatting

Atomic Chat new
Docker Model Runner
How to use QuantFactory/TableLLM-13b-GGUF with Docker Model Runner:
```
docker model run hf.co/QuantFactory/TableLLM-13b-GGUF:Q4_K_M
```

Lemonade

How to use QuantFactory/TableLLM-13b-GGUF with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull QuantFactory/TableLLM-13b-GGUF:Q4_K_M

Run and chat with the model

lemonade run user.TableLLM-13b-GGUF-Q4_K_M

List all available models

lemonade list

aashish1904

nielsr HF Staff commited on Mar 6, 2025

Commit

462bbbe

verified ·

1 Parent(s): 6444303

Update model card metadata: pipeline tag, license, and add Github link (#1)

Browse files

- Update model card metadata: pipeline tag, license, and add Github link (4d9ccaded7954c2ffa58f71318d8d85370a8dfd1)

Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show

README.md +8 -12

README.md CHANGED Viewed

@@ -1,10 +1,8 @@
 ---
-pipeline_tag: text-generation
 base_model: TableLLM-13b
 library_name: transformers
 ---
 [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
@@ -17,7 +15,6 @@ This is quantized version of [RUCKBReasoning/TableLLM-13b](https://huggingface.c
 ---
-license: llama2
 datasets:
 - RUCKBReasoning/TableLLM-SFT
 language:
@@ -29,7 +26,7 @@ tags:
 ---
-[![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
 # QuantFactory/TableLLM-13b-GGUF
@@ -52,9 +49,9 @@ We evaluate the code solution generation ability of TableLLM on three benchmarks
 | Model                | WikiTQ | TAT-QA | FeTaQA |  OTTQA  | WikiSQL | Spider | Self-created | Average |
 | :------------------- | :----: | :----: | :----: | :-----: | :-----: | :----: | :----------: | :-----: |
 | TaPEX                |  38.5  |    –   |    –   |    –    |   83.9  |  15.0  |       /      |   45.8  |
-| TaPas                |  31.5  |    –   |    –   |    –    |   74.2  |  23.1  |       /      |   42.92 |
 | TableLlama           |  24.0  |  22.2  |  20.5  |   6.4   |   43.7  |   9.0  |       /      |   20.7  |
-| GPT3.5               |  58.5  |<ins>72.1</ins>|  71.2  |  60.8   |   81.7   |  67.4  | 77.1 |   69.8  |
 | GPT4                 |**74.1**|**77.1**|**78.4**|**69.5** |   84.0  |  69.5  |     77.8     | **75.8**|
 | Llama2-Chat (13B)    |  48.8  |  49.6  |  67.7  |  61.5   |    –    |    –   |       –      |   56.9  |
 | CodeLlama (13B)      |  43.4  |  47.2  |  57.2  |  49.7   |   38.3  |  21.9  |     47.6     |   43.6  |
@@ -62,8 +59,8 @@ We evaluate the code solution generation ability of TableLLM on three benchmarks
 | StructGPT (GPT3.5)   |  52.5  |  27.5  |  11.8  |  14.0   |   67.8  |**84.8**|       /      |   48.9  |
 | Binder (GPT3.5)      |  61.6  |  12.8  |   6.8  |   5.1   |   78.6  |  52.6  |       /      |   42.5  |
 | DATER (GPT3.5)       |  53.4  |  28.4  |  18.3  |  13.0   |   58.2  |  26.5  |       /      |   37.0  |
-| TableLLM-7B (Ours)   |  58.8  |  66.9  |  72.6  |<ins>63.1</ins>|<ins>86.6</ins>|  82.6  |<ins>78.8</ins>|   72.8  |
-| TableLLM-13B (Ours)  |<ins>62.4</ins>|  68.2  |<ins>74.5</ins>|  62.5   | **90.7**|<ins>83.4</ins>|   **80.8**   |<ins>74.7</ins>|
 ## Prompt Template
 The prompts we used for generating code solutions and text answers are introduced below.
@@ -120,5 +117,4 @@ The prompt template for direct text answer generation on short tables.
 ### [Solution][INST/]
 ````
-For more details about how to use TableLLM, please refer to our GitHub page: <https://github.com/TableLLM/TableLLM>

 ---
 base_model: TableLLM-13b
 library_name: transformers
+pipeline_tag: table-question-answering
+license: llama2
 ---
 [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
 ---
 datasets:
 - RUCKBReasoning/TableLLM-SFT
 language:
 ---
+[![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44UcQKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
 # QuantFactory/TableLLM-13b-GGUF
 | Model                | WikiTQ | TAT-QA | FeTaQA |  OTTQA  | WikiSQL | Spider | Self-created | Average |
 | :------------------- | :----: | :----: | :----: | :-----: | :-----: | :----: | :----------: | :-----: |
 | TaPEX                |  38.5  |    –   |    –   |    –    |   83.9  |  15.0  |       /      |   45.8  |
+| TaPas                |  31.5  |    –   |    –    |   74.2  |  23.1  |       /      |   42.92 |
 | TableLlama           |  24.0  |  22.2  |  20.5  |   6.4   |   43.7  |   9.0  |       /      |   20.7  |
+| GPT3.5               |  58.5  | 72.1  |  71.2  |  60.8   |   81.7   |  67.4  | 77.1 |   69.8  |
 | GPT4                 |**74.1**|**77.1**|**78.4**|**69.5** |   84.0  |  69.5  |     77.8     | **75.8**|
 | Llama2-Chat (13B)    |  48.8  |  49.6  |  67.7  |  61.5   |    –    |    –   |       –      |   56.9  |
 | CodeLlama (13B)      |  43.4  |  47.2  |  57.2  |  49.7   |   38.3  |  21.9  |     47.6     |   43.6  |
 | StructGPT (GPT3.5)   |  52.5  |  27.5  |  11.8  |  14.0   |   67.8  |**84.8**|       /      |   48.9  |
 | Binder (GPT3.5)      |  61.6  |  12.8  |   6.8  |   5.1   |   78.6  |  52.6  |       /      |   42.5  |
 | DATER (GPT3.5)       |  53.4  |  28.4  |  18.3  |  13.0   |   58.2  |  26.5  |       /      |   37.0  |
+| TableLLM-7B (Ours)   |  58.8  |  66.9  |  72.6  | 63.1 | 86.6|  82.6  | 78.8|   72.8  |
+| TableLLM-13B (Ours)  | 62.4|  68.2  | 74.5|  62.5   | **90.7**| 83.4|   **80.8**   | 74.7|
 ## Prompt Template
 The prompts we used for generating code solutions and text answers are introduced below.
 ### [Solution][INST/]
 ````
+For more details about how to use TableLLM, please refer to our GitHub page: <https://github.com/TableLLM/TableLLM>