Instructions to use TIGER-Lab/StructLM-7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use TIGER-Lab/StructLM-7B with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="TIGER-Lab/StructLM-7B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("TIGER-Lab/StructLM-7B")
model = AutoModelForCausalLM.from_pretrained("TIGER-Lab/StructLM-7B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use TIGER-Lab/StructLM-7B with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "TIGER-Lab/StructLM-7B"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TIGER-Lab/StructLM-7B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/TIGER-Lab/StructLM-7B

SGLang

How to use TIGER-Lab/StructLM-7B with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "TIGER-Lab/StructLM-7B" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TIGER-Lab/StructLM-7B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "TIGER-Lab/StructLM-7B" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TIGER-Lab/StructLM-7B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use TIGER-Lab/StructLM-7B with Docker Model Runner:
```
docker model run hf.co/TIGER-Lab/StructLM-7B
```

azhx commited on Feb 26, 2024

Commit

a92109d

verified ·

1 Parent(s): d6b7884

Update README.md

Browse files

Files changed (1) hide show

README.md +30 -31

README.md CHANGED Viewed

@@ -5,7 +5,9 @@ datasets:
 language:
 - en
 ---
-# StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
 Project Page: [https://tiger-ai-lab.github.io/StructLM/](https://tiger-ai-lab.github.io/StructLM/)
@@ -14,15 +16,17 @@ Paper: Arxiv link not yet announced
 Code: [https://github.com/TIGER-AI-Lab/StructLM](https://github.com/TIGER-AI-Lab/StructLM)
-## Introduction
-StructLM, is a series of open-source large language models (LLMs) finetuned for structured knowledge grounding (SKG) tasks.
-We release 3 models:
-|-----|---------------------------------------------------------------|
-| 7B  | [StructLM-7B](https://huggingface.co/TIGER-Lab/StructLM-7B)   |
-| 13B | [StructLM-13B](https://huggingface.co/TIGER-Lab/StructLM-13B) |
-| 34B | [StructLM-34B](https://huggingface.co/TIGER-Lab/StructLM-34B) |
 ## Training Data
@@ -33,29 +37,24 @@ These models are trained on 🤗 [SKGInstruct Dataset](https://huggingface.co/da
 The models are fine-tuned with CodeLlama-Instruct-hf models as base models. Each model is trained for 3 epochs, and the best checkpoint is selected.
 ## Evaluation
-The models are evaluated using open-ended and multiple-choice math problems from several datasets. Here are the results:
-| **Model**             	| **Decoding** 	| **GSM**  	| **MATH** 	| **AQuA** 	| **NumG** 	| **SVA**  	| **Mat**  	| **Sim**  	| **SAT**  	| **MMLU** 	| **AVG**  	|
-|-----------------------|--------------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|
-| **MAmmoTH-7B**        	| CoT          	| 50.5     	| 10.4     	| 43.7     	| 44.0     	| 47.3     	| 9.2      	| 18.9     	| 32.7     	| 39.9     	| 33.0     	|
-|                       	| PoT          	| 51.6     	| 28.7     	| 43.3     	| 52.3     	| 65.1     	| 41.9     	| 48.2     	| 39.1     	| 44.6     	| 46.1     	|
-|                       	| **Hybrid**   	| **53.6** 	| **31.5** 	| **44.5** 	| **61.2** 	| **67.7** 	| **46.3** 	| **41.2** 	| **42.7** 	| **42.6** 	| **47.9** 	|
-| **MAmmoTH-Coder-7B**  	| CoT          	| 22.4     	| 7.9      	| 36.2     	| 36.0     	| 37.0     	| 8.2      	| 7.2      	| 32.7     	| 34.6     	| 24.7     	|
-|                       	| PoT          	| 58.8     	| 32.1     	| 47.2     	| 57.1     	| 71.1     	| 53.9     	| 44.6     	| 40.0     	| 47.8     	| 50.3     	|
-|                       	| **Hybrid**   	| **59.4** 	| **33.4** 	| **47.2** 	| **66.4** 	| **71.4** 	| **55.4** 	| **45.9** 	| **40.5** 	| **48.3** 	| **52.0** 	|
-| **MAmmoTH-13B**       	| CoT          	| 56.3     	| 12.9     	| 45.3     	| 45.6     	| 53.8     	| 11.7     	| 22.4     	| 43.6     	| 42.3     	| 37.1     	|
-|                       	| PoT          	| 61.3     	| 32.6     	| 48.8     	| 59.6     	| 72.2     	| 48.5     	| 40.3     	| 46.8     	| 45.4     	| 50.6     	|
-|                       	| **Hybrid**   	| **62.0** 	| **34.2** 	| **51.6** 	| **68.7** 	| **72.4** 	| **49.2** 	| **43.2** 	| **46.8** 	| **47.6** 	| **52.9** 	|
-| **MAmmoTH-Coder-13B** 	| CoT          	| 32.1     	| 10.2     	| 40.6     	| 36.2     	| 43.0     	| 9.6      	| 10.1     	| 40.9     	| 36.6     	| 28.8     	|
-|                       	| PoT          	| 64.3     	| 35.2     	| 46.8     	| 54.2     	| 73.2     	| 60.0     	| 44.2     	| 48.2     	| 48.2     	| 52.7     	|
-|                       	| **Hybrid**   	| **64.7** 	| **36.3** 	| **46.9** 	| **66.8** 	| **73.7** 	| **61.5** 	| **47.1** 	| **48.6** 	| **48.3** 	| **54.9** 	|
-| **MAmmoTH-Coder-33B** 	| CoT          	| 34.3     	| 11.6     	| 39.0     	| 36.2     	| 44.6     	| 10.8     	| 10.9     	| 46.4     	| 42.9     	| 30.7     	|
-|                       	| PoT          	| 72.3     	| 42.8     	| 53.8     	| 59.6     	| 84.0     	| 64.7     	| 50.6     	| 58.6     	| 52.7     	| 59.9     	|
-|                       	| **Hybrid**   	| **72.7** 	| **43.6** 	| **54.7** 	| **71.6** 	| **84.3** 	| **65.4** 	| **51.8** 	| **60.9** 	| **53.8** 	| **62.1** 	|
-| **MAmmoTH-70B**       	| CoT          	| 72.4     	| 21.1     	| 57.9     	| 58.9     	| 71.6     	| 20.0     	| 31.9     	| 57.3     	| 52.1     	| 49.2     	|
-|                       	| PoT          	| 76.7     	| 40.1     	| 60.2     	| 64.3     	| 81.7     	| 55.3     	| 45.3     	| 64.1     	| 53.5     	| 60.1     	|
-|                       	| **Hybrid**   	| **76.9** 	| **41.8** 	| **65.0** 	| **74.4** 	| **82.4** 	| **55.6** 	| **51.4** 	| **66.4** 	| **56.7** 	| **63.4** 	|
 ## Usage
 You can use the models through Huggingface's Transformers library.

 language:
 - en
 ---
+# 🏗️ StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
 Project Page: [https://tiger-ai-lab.github.io/StructLM/](https://tiger-ai-lab.github.io/StructLM/)
 Code: [https://github.com/TIGER-AI-Lab/StructLM](https://github.com/TIGER-AI-Lab/StructLM)
+![Alt text](https://raw.githubusercontent.com/TIGER-AI-Lab/StructLM/gh-pages/static/images/thumbnail.drawio%20(1).png)
+## Introduction
+StructLM, is a series of open-source large language models (LLMs) finetuned for structured knowledge grounding (SKG) tasks. We release 3 models:
+ 7B  | [StructLM-7B](https://huggingface.co/TIGER-Lab/StructLM-7B)
+ 13B | [StructLM-13B](https://huggingface.co/TIGER-Lab/StructLM-13B)
+ 34B | [StructLM-34B](https://huggingface.co/TIGER-Lab/StructLM-34B)
 ## Training Data
 The models are fine-tuned with CodeLlama-Instruct-hf models as base models. Each model is trained for 3 epochs, and the best checkpoint is selected.
 ## Evaluation
+Here are a subset of model evaluation results:
+### Held in
+| **Model**             	| **ToTTo** 	| **GrailQA**  	| **CompWebQ** 	| **MMQA** 	| **Feverous** 	| **Spider**  	| **TabFact**  	| **Dart**  	|
+|-----------------------|--------------|----------|----------|----------|----------|----------|----------|----------|
+| **StructLM-7B**        	| 49.4          	| 80.4     	| 78.3  	| 85.2     	| 84.4     	| 72.4     	| 80.8      	| 62.2     	|
+| **StructLM-13B**  	| 49.3          	| 79.2    	| 80.4  	| 86.0     	| 85.0     	| 74.1     	| 84.7     	| 61.4      	|
+| **StructLM-34B**       	| 50.2          	| 82.2     	| 81.9 	| 88.1     	| 85.7     	| 74.6     	| 86.6     	| 61.8     	|
+### Held out
+| **Model**             	| **BIRD** 	| **InfoTabs**  	| **FinQA** 	| **SQA** 	|
+|-----------------------|--------------|----------|----------|----------|
+| **StructLM-7B**        	| 22.3          	| 55.3     	| 27.3     	| 49.7     	|
+| **StructLM-13B**  	| 22.8          	| 58.1     	| 25.6      	| 36.1     	|
+| **StructLM-34B**       	| 24.7          	| 61.8     	| 36.2     	| 44.2     	|
 ## Usage
 You can use the models through Huggingface's Transformers library.