How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="UCL-CSSB/PlasmidGPT")
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("UCL-CSSB/PlasmidGPT")
model = AutoModelForCausalLM.from_pretrained("UCL-CSSB/PlasmidGPT")
Quick Links

PlasmidGPT

A HuggingFace-compatible repackaging of PlasmidGPT (Shao, 2024) — a GPT-2-style decoder pretrained on 153k engineered plasmid sequences from Addgene. Loadable with standard AutoModelForCausalLM and AutoTokenizer. Used as the base for PlasmidGPT-SFT and PlasmidGPT-GRPO.

Quick start

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("UCL-CSSB/PlasmidGPT")
tokenizer = AutoTokenizer.from_pretrained("UCL-CSSB/PlasmidGPT")

input_ids = tokenizer("ATG", return_tensors="pt").input_ids
outputs = model.generate(input_ids, max_new_tokens=512, do_sample=True, temperature=1.0)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Citation

@article{shao2024plasmidgpt,
  title   = {{PlasmidGPT}: a generative framework for plasmid design and annotation},
  author  = {Shao, Bin},
  journal = {bioRxiv},
  year    = {2024},
  doi     = {10.1101/2024.09.30.615762}
}
Downloads last month
125
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for UCL-CSSB/PlasmidGPT

Finetunes
2 models

Collection including UCL-CSSB/PlasmidGPT