You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Trustworthy and Fair SkinGPT-R1 for Democratizing Dermatological Reasoning across Diverse Ethnicities

SkinGPT-R1-test is a dermatological reasoning vision language model. 🩺✨

The Chinese University of Hong Kong, Shenzhen

Updates

We will soon release the SkinGPT-R1-7B weights.

Overview

SkinGPT-R1-test/
├── checkpoints/
├── environment.yml
├── inference/
│   ├── full_precision/
│   └── int4_quantized/
├── requirements.txt
└── README.md

This repo provides full-precision inference, INT4 quantized inference, multi-turn chat, and FastAPI serving.

Install

environment.yml is a Conda environment definition file for reproducing the recommended runtime environment.

From scratch:

git clone https://huggingface.co/yuhos16/SkinGPT-R1-test
cd SkinGPT-R1-test
conda env create -f environment.yml
conda activate skingpt-r1

Manual setup:

git clone https://huggingface.co/yuhos16/SkinGPT-R1-test
cd SkinGPT-R1-test
conda create -n skingpt-r1 python=3.10.20 -y
conda activate skingpt-r1
pip install -r requirements.txt

Quick Start

Use the repository ./checkpoints directory as the model weights directory.
Prepare a test image, for example ./test_images/lesion.jpg.
Run a first test.

Full precision:

bash inference/full_precision/run_infer.sh --image ./test_images/lesion.jpg

INT4:

bash inference/int4_quantized/run_infer.sh --image_path ./test_images/lesion.jpg

Usage

Mode	Full Precision	INT4 Quantized
Single image	`bash inference/full_precision/run_infer.sh --image ./test_images/lesion.jpg`	`bash inference/int4_quantized/run_infer.sh --image_path ./test_images/lesion.jpg`
Multi-turn chat	`bash inference/full_precision/run_chat.sh --image ./test_images/lesion.jpg`	`bash inference/int4_quantized/run_chat.sh --image ./test_images/lesion.jpg`
API service	`bash inference/full_precision/run_api.sh`	`bash inference/int4_quantized/run_api.sh`

Default API ports:

Full precision: 5900
INT4 quantized: 5901

Notes

On multi-GPU servers, prepend commands with CUDA_VISIBLE_DEVICES=0 if you want to pin one GPU.
RTX 50 series should use the default sdpa path.
A100 / RTX 3090 / RTX 4090 / H100 can also try flash_attention_2 if their CUDA stack supports it.

API Endpoints

Both API services expose the same endpoints:

POST /v1/upload/{state_id}
POST /v1/predict/{state_id}
POST /v1/reset/{state_id}
POST /diagnose/stream
GET /health

Disclaimer

This project is for research and educational use only. It is not a substitute for professional medical advice, diagnosis, or treatment. ⚠️

License

This repository is released under the MIT License. See LICENSE for details.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

Image-Text-to-Text

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for yuhos16/SkinGPT-R1-test

Trustworthy and Fair SkinGPT-R1 for Democratizing Dermatological Reasoning across Diverse Ethnicities

Paper • 2511.15242 • Published Nov 19, 2025