Instructions to use jubba/nano_nextgpt with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use jubba/nano_nextgpt with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="jubba/nano_nextgpt")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("jubba/nano_nextgpt") model = AutoModelForCausalLM.from_pretrained("jubba/nano_nextgpt") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use jubba/nano_nextgpt with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "jubba/nano_nextgpt" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "jubba/nano_nextgpt", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/jubba/nano_nextgpt
- SGLang
How to use jubba/nano_nextgpt with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "jubba/nano_nextgpt" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "jubba/nano_nextgpt", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "jubba/nano_nextgpt" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "jubba/nano_nextgpt", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use jubba/nano_nextgpt with Docker Model Runner:
docker model run hf.co/jubba/nano_nextgpt
nano nextgpt Weights
This repository contains the weights for nano nextgpt, a minimalist re-implementation of NextGPT in the style of Andrej Karpathy's nanoGPT. The project is based on the NextGPT architecture, which is detailed here: NextGPT.
Repository for nano nextgpt
You can find the main repository and source code for nano nextgpt here: nano nextgpt GitHub Repository.
About nano nextgpt
nano nextgpt is a stripped-down version of NextGPT, focusing solely on image and text processing, omitting the video and audio processing capabilities. The model underwent two primary stages of training:
Linear Layer Training: This involved mapping ImageBind embeddings onto the LLM (Large Language Model) embedding space. The training dataset comprised 20,000 image-text pairs sourced from COCO 3m.
Instruction Tuning: This stage involved training the entire model, including the linear layer and LLM, end-to-end. This was done using qlora and peft techniques, with a dataset containing 80,000 image-text pairs in a conversational format, taken from the Llava project.
Usage
For detailed usage instructions, including how to integrate these weights into your applications, please refer to the nano next gpt GitHub repository.
Please note that this README is for the weights of the nano nextgpt model. For more information on the model architecture, training procedures, or any other inquiries, refer to the main nano nextgpt repository linked above.
- Downloads last month
- 12