Instructions to use ncbi/Gene-R1-8B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use ncbi/Gene-R1-8B with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="ncbi/Gene-R1-8B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("ncbi/Gene-R1-8B")
model = AutoModelForCausalLM.from_pretrained("ncbi/Gene-R1-8B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use ncbi/Gene-R1-8B with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "ncbi/Gene-R1-8B"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ncbi/Gene-R1-8B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/ncbi/Gene-R1-8B

SGLang

How to use ncbi/Gene-R1-8B with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "ncbi/Gene-R1-8B" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ncbi/Gene-R1-8B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "ncbi/Gene-R1-8B" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ncbi/Gene-R1-8B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use ncbi/Gene-R1-8B with Docker Model Runner:
```
docker model run hf.co/ncbi/Gene-R1-8B
```

Gene-R1-8B

File size: 7,961 Bytes

---
license: other
library_name: transformers
pipeline_tag: text-generation
language:
- en
tags:
- gene-set-analysis
- biomedical
- reasoning
- llama
base_model:
- meta-llama/Llama-3.1-8B-Instruct
- meta-llama/Llama-3.2-1B-Instruct
- meta-llama/Llama-3.2-3B-Instruct
---

# Overview of Gene-R1

**Introduction**

- Gene-R1 is a data-augmented learning framework that equips lightweight and open-source LLMs with step-by-step reasoning capabilities tailored to the gene set analysis task. 
- It has been fine-tuned by ~270K gene sets collected from 16 genomic databases.
- Experimental results demonstrate that Gene-R1 achieves substantial performance gains, matching commercial LLMs.
- For more details, please check out our [paper](https://www.worldscientific.com/doi/abs/10.1142/9789819824755_0035) (PSB, 2026).

**Gene-R1 helps for gene set analysis through fine-tuned small language models (SLMs) that can be locally deployed.** 
The model contains three versions:
- [Gene-R1-8B](https://huggingface.co/ncbi/Gene-R1-8B): A version fine-tuned based on the Llama-3.1-8B-Instruct.
- [Gene-R1-1B](https://huggingface.co/ncbi/Gene-R1-1B): A version fine-tuned based on the Llama-3.2-1B-Instruct.
- [Gene-R1-3B](https://huggingface.co/ncbi/Gene-R1-3B): A version fine-tuned based on the Llama-3.2-3B-Instruct.


# Model Deployment for Private Gene Set Analysis

```python
  import transformers
  from transformers import AutoTokenizer, AutoModelForCausalLM

  model_id = "ncbi/Gene-R1-8B"
  tokenizer_test = AutoTokenizer.from_pretrained(
      model_id,
      token = "xxxxxxxxx" # Your access key of hugging face
  ) 
  model_test = AutoModelForCausalLM.from_pretrained(
      model_id,
      device_map = "auto",
      token = "xxxxxxxxx" # Your access key of hugging face
  )
  
  def complete_chat(system, prompt, model, tokenizer):
      model.generation_config.do_sample=False
      tokenized_chat = tokenizer('#SYSTEM: \n'+ system + '#USER: \n'+ prompt+' #Assistant: \n', return_tensors="pt").input_ids.to(model.device)
      outputs = model.generate(tokenized_chat, max_new_tokens=4000, temperature = 0) 
      return tokenizer.decode(outputs[0])
  
  system = "You are an efficient and insightful assistant to a molecular biologist."
  users = lambda genes: f"""
  Write a critical analysis of the biological processes performed by this system of interacting proteins.
  Base your analysis on prior knowledge available in your training data.
  After the analysis, propose a brief name for the most prominent biological process performed by the system.
  Place the name at the top of the analysis in the format: "Process: <name>".
  Be concise. Avoid unnecessary words.
  Use plain text only. Do not include format symbols such as asterisks, dashes, or bullets.
  Be specific. Avoid overly general statements such as "the proteins are involved in various cellular processes."
  Be factual. Do not include editorial opinions or unsupported claims.
  For each important point, clearly explain your reasoning and provide supporting information.
  For each identified biological function, specify the corresponding gene names.
  Here is the gene set: {genes}
  """

  def llama(genes):    
      genes = genes.replace("/",",").replace(" ",",")
      prompt = users(genes)
      summary =complete_chat(system, prompt, model_test, tokenizer_test)
      return summary

  if __name__ == "__main__":
    genes = "xxxxxxxxx" # Your private gene set that is separated by comma (,)!
    result = llama(genes)
    print(result)
```

The expected output looks like:
```
  Process: Pancreatic development and glucose homeostasis
  
  1. PDX1 is a homeodomain transcription factor involved in the specification of the early pancreatic epithelium and its subsequent differentiation. 
  It activates the transcription of several genes including insulin, somatostatin, glucokinase and glucose transporter type 2. 
  It is essential for maintenance of the normal hormone-producing phenotype in the pancreatic beta-cell. 
  In pancreatic acinar cells, forms a complex with PBX1b and MEIS2b and mediates the activation of the ELA1 enhancer.
  
  2. NKX6-1 is also a transcription factor involved in the development of pancreatic beta-cells during the secondary transition. 
  Together with NKX2-2 and IRX3, controls the generation of motor neurons in the neural tube and belongs to the neural progenitor 
  factors induced by Sonic Hedgehog (SHH) signals.
  
  3.GCG and GLP1, respectively glucagon and glucagon-like peptide 1, are involved in glucose metabolism and homeostasis. 
  GCG raises blood glucose levels by promoting gluconeogenesis and is the counter regulatory hormone of Insulin. 
  GLP1 is a potent stimulator of Glucose-Induced Insulin Secretion (GSIS). Plays roles in gastric motility and suppresses blood glucagon levels. 
  Promotes growth of the intestinal epithelium and pancreatic islet mass both by islet neogenesis and islet cell proliferation.
  
  4. SLC2A2, also known as GLUT2, is a facilitative hexose transporter. In hepatocytes, it mediates bi-directional transport of glucose accross the plasma membranes, 
  while in the pancreatic beta-cell, it is the main transporter responsible for glucose uptake and part of the cell's glucose-sensing mechanism. 
  It is involved in glucose transport in the small intestine and kidney too.
  
  To summarize, the genes in this set are involved in the specification, differentiation, growth and functionality of the pancreas, 
  with a particular emphasis on the pancreatic beta-cell. Particularly, the architecture of the pancreatic islet ensures proper glucose sensing 
  and homeostasis via a number of different hormones and receptors that can elicit both synergistic and antagonistic effects in the pancreas itself and other peripheral tissues.
```

⚠️ **Notice: The outputs sometimes are not following the instruction, you can try again if this case occurs.**

More details of model usage can be referred at our GitHub: [GitHub](https://github.com/ncbi-nlp/Gene-R1)

# Download statistics

Hugging Face tracks downloads automatically based on requests to model query files such as `config.json`. 
To ensure downloads are counted, please load the full models directly from the Hub using `transformers`:

```python
from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "ncbi/Gene-R1-8B"
tokenizer = AutoTokenizer.from_pretrained(model_id, hf_token)
model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", hf_token)
```

# Acknowledgments

This research was supported in part by the Intramural Research Program of the National Institutes of Health (NIH). 
The contributions of the NIH authors are considered Works of the United States Government. 
The findings and conclusions presented in this paper are those of the authors and do not necessarily reflect the views of the NIH or the U.S. Department of Health and Human Services.

# Disclaimer

These models show the results of research conducted in the Computational Biology Branch, NCBI/NLM. 
The information produced on this website is not intended for direct diagnostic use or medical decision-making without review and oversight by a clinical professional. 
Individuals should not change their health behavior solely on the basis of information produced on this website. 
NIH does not independently verify the validity or utility of the information produced by this tool. 
If you have questions about the information produced on this website, please see a health care professional.
More information about NCBI's disclaimer policy is available.

# Citation

```bibtext
@inproceedings{wang2025gene,
  title={Gene-R1: Reasoning with Data-Augmented Lightweight LLMs for Gene Set Analysis},
  author={Wang, Zhizheng and Yang, Yifan and Jin, Qiao and Lu, Zhiyong},
  booktitle={Biocomputing 2026: Proceedings of the Pacific Symposium},
  pages={494--507},
  year={2025},
  organization={World Scientific}
}
```