Instructions to use InstaDeepAI/nucleotide-transformer-2.5b-multi-species with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use InstaDeepAI/nucleotide-transformer-2.5b-multi-species with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="InstaDeepAI/nucleotide-transformer-2.5b-multi-species")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("InstaDeepAI/nucleotide-transformer-2.5b-multi-species") model = AutoModelForMaskedLM.from_pretrained("InstaDeepAI/nucleotide-transformer-2.5b-multi-species") - Notebooks
- Google Colab
- Kaggle
Model Embedding and vocab size mismatch.
#7
by pg20sanger - opened
Model config.json (https://huggingface.co/InstaDeepAI/nucleotide-transformer-2.5b-multi-species/blob/main/config.json#L27) says that the vocab size is 4105 , but vocab.txt has 4107 tokens. Is it correct?