Instructions to use lgy0404/MemGUI-8B-SFT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use lgy0404/MemGUI-8B-SFT with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="lgy0404/MemGUI-8B-SFT")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM

processor = AutoProcessor.from_pretrained("lgy0404/MemGUI-8B-SFT")
model = AutoModelForMultimodalLM.from_pretrained("lgy0404/MemGUI-8B-SFT")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use lgy0404/MemGUI-8B-SFT with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "lgy0404/MemGUI-8B-SFT"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lgy0404/MemGUI-8B-SFT",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker

docker model run hf.co/lgy0404/MemGUI-8B-SFT

SGLang

How to use lgy0404/MemGUI-8B-SFT with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "lgy0404/MemGUI-8B-SFT" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lgy0404/MemGUI-8B-SFT",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "lgy0404/MemGUI-8B-SFT" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lgy0404/MemGUI-8B-SFT",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Docker Model Runner
How to use lgy0404/MemGUI-8B-SFT with Docker Model Runner:
```
docker model run hf.co/lgy0404/MemGUI-8B-SFT
```

MemGUI-8B-SFT / Modelfile

lgy0404

Upload folder using huggingface_hub

54735ea verified 24 days ago

Raw

History Blame Contribute Delete

3.87 kB

	# Generated at 2026-05-08T16:38:42+08:00

	# Model name
	NAME checkpoint-453-merged

	# Model architecture (Generated from transformers_version in config.json)
	ARCH transformer

	# Model family (Generated from model_type in config.json)
	FAMILY qwen3_vl

	# Config files (Generated from the files in the workspace directory)
	# Supported file types: .json, .jsonl, .json5, .jsonc, .yaml, .yml, .toml, .ini, .config, .cfg, .conf, .properties, .props, .prop, .xml, .xsd, .rng, .modelcard, .meta, tokenizer.model, config.json., .hparams, .params, .hyperparams, .wandb, .mlflow, .tensorboard
	CONFIG added_tokens.json
	CONFIG args.json
	CONFIG config.json
	CONFIG generation_config.json
	CONFIG model.safetensors.index.json
	CONFIG preprocessor_config.json
	CONFIG special_tokens_map.json
	CONFIG tokenizer.json
	CONFIG tokenizer_config.json
	CONFIG video_preprocessor_config.json
	CONFIG vocab.json

	# Code files (Generated from the files in the workspace directory)
	# Supported file types: .py, .ipynb, .sh, .patch, .c, .h, .hxx, .cpp, .cc, .cxx, .c++, .hpp, .hh, .h++, .java, .js, .mjs, .cjs, .jsx, .ts, .tsx, .go, .rs, .swift, .rb, .php, .scala, .kt, .kts, .r, .R, .m, .mm, .f, .f90, .f95, .f03, .f08, .jl, .lua, .pl, .pm, .cs, .vb, .dart, .groovy, .elm, .erl, .hrl, .ex, .exs, .hs, .lhs, .clj, .cljs, .cljc, .cl, .lisp, .lsp, .scm, .ss, .rkt, .sql, .psql, .mysql, .sqlite, .zig, .cu, .cuh, .bash, .zsh, .fish, .csh, .tcsh, .ksh, .ps1, .psm1, .psd1, .bat, .cmd, .vbs, .wsf, .applescript, .scpt, .awk, .sed, .expect, .env, .env., .env, Makefile, .dockerfile, Dockerfile, .mk, .cmake, CMakeLists.txt, .gradle, .gradle.kts, build.gradle, settings.gradle, .sbt, .mill, .bazel, .bzl, BUILD, WORKSPACE, .buck, BUCK, .ninja, .gyp, .gypi, .waf, wscript, package.json, package-lock.json, yarn.lock, pnpm-lock.yaml, requirements.txt, Pipfile, pyproject.toml, setup.cfg, tox.ini, poetry.lock, Cargo.toml, Cargo.lock, go.mod, go.sum, composer.json, composer.lock, Gemfile, .gemspec, mix.exs, mix.lock, rebar.config, rebar.lock, .so, .dll, .dylib, .lib, .a
	CODE chat_template.jinja

	# Model files (Generated from the files in the workspace directory)
	# Supported file types: .safetensors, .bin, .pt, .pth, .mar, .pte, .pt2, .ptl, .tflite, .h5, .hdf, .hdf5, .pb, .meta, .data-, .index, .gguf, .ggml, .ggmf, .ggjt, .q4_0, .q4_1, .q5_0, .q5_1, .q8_0, .f16, .f32, .ckpt, .checkpoint, .dist_ckpt, .tensor, .weights, .state, .embedding, .vocab, .ot, .engine, .trt, .onnx, .msgpack, .model, .pkl, .pickle, .keras, .joblib, .npy, .npz, .nc, .mlmodel, .coreml, .mleap, .surml, .llamafile, .caffemodel, .prototxt, .dlc, .circle, *.nb
	MODEL model-00001-of-00004.safetensors
	MODEL model-00002-of-00004.safetensors
	MODEL model-00003-of-00004.safetensors
	MODEL model-00004-of-00004.safetensors

	# Documentation files (Generated from the files in the workspace directory)
	# Supported file types: .txt, .md, .pdf, LICENSE, README, SETUP, requirements, .log, .doc, .docx, .docm, .dot, .dotx, .dotm, .rtf, .odt, .ott, .fodt, .pages, .wpd, .xls, .xlsx, .xlsm, .xlsb, .xlt, .xltx, .xltm, .ods, .ots, .fods, .numbers, .csv, .ppt, .pptx, .pptm, .pps, .ppsx, .ppsm, .pot, .potx, .potm, .odp, .otp, .fodp, .key, .epub, .mobi, .azw, .azw3, .fb2, .fb3, .lit, .pdb, .djvu, .djv, .html, .htm, .xhtml, .mhtml, .mht, .xml, .xsl, .xslt, .tex, .latex, .ltx, .bib, .rst, .asciidoc, .adoc, .textile, .wiki, .mediawiki, .org, .texi, .texinfo, .info, .man, .chm, .hlp, .xps, .jpg, .jpeg, .png, .gif, .bmp, .tiff, .ico, .webp, .heic, .heif, .hevc, .svg, .mp4, .mov, .avi, .mkv, .webm, .m4v, .flv, .wmv, .mpg, .mpeg
	DOC merges.txt