Image-Text-to-Text
Transformers
Safetensors
English
qwen3_vl
gui-agent
mobile-gui
android
memory
context-management
conact
memgui-agent
long-horizon
conversational
Eval Results (legacy)
Instructions to use lgy0404/MemGUI-8B-SFT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use lgy0404/MemGUI-8B-SFT with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="lgy0404/MemGUI-8B-SFT") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("lgy0404/MemGUI-8B-SFT") model = AutoModelForMultimodalLM.from_pretrained("lgy0404/MemGUI-8B-SFT") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use lgy0404/MemGUI-8B-SFT with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "lgy0404/MemGUI-8B-SFT" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "lgy0404/MemGUI-8B-SFT", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/lgy0404/MemGUI-8B-SFT
- SGLang
How to use lgy0404/MemGUI-8B-SFT with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "lgy0404/MemGUI-8B-SFT" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "lgy0404/MemGUI-8B-SFT", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "lgy0404/MemGUI-8B-SFT" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "lgy0404/MemGUI-8B-SFT", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use lgy0404/MemGUI-8B-SFT with Docker Model Runner:
docker model run hf.co/lgy0404/MemGUI-8B-SFT
| # Generated at 2026-05-08T16:38:42+08:00 | |
| # Model name | |
| NAME checkpoint-453-merged | |
| # Model architecture (Generated from transformers_version in config.json) | |
| ARCH transformer | |
| # Model family (Generated from model_type in config.json) | |
| FAMILY qwen3_vl | |
| # Config files (Generated from the files in the workspace directory) | |
| # Supported file types: *.json, *.jsonl, *.json5, *.jsonc, *.yaml, *.yml, *.toml, *.ini, *.config, *.cfg, *.conf, *.properties, *.props, *.prop, *.xml, *.xsd, *.rng, *.modelcard, *.meta, *tokenizer.model*, config.json.*, *.hparams, *.params, *.hyperparams, *.wandb, *.mlflow, *.tensorboard | |
| CONFIG added_tokens.json | |
| CONFIG args.json | |
| CONFIG config.json | |
| CONFIG generation_config.json | |
| CONFIG model.safetensors.index.json | |
| CONFIG preprocessor_config.json | |
| CONFIG special_tokens_map.json | |
| CONFIG tokenizer.json | |
| CONFIG tokenizer_config.json | |
| CONFIG video_preprocessor_config.json | |
| CONFIG vocab.json | |
| # Code files (Generated from the files in the workspace directory) | |
| # Supported file types: *.py, *.ipynb, *.sh, *.patch, *.c, *.h, *.hxx, *.cpp, *.cc, *.cxx, *.c++, *.hpp, *.hh, *.h++, *.java, *.js, *.mjs, *.cjs, *.jsx, *.ts, *.tsx, *.go, *.rs, *.swift, *.rb, *.php, *.scala, *.kt, *.kts, *.r, *.R, *.m, *.mm, *.f, *.f90, *.f95, *.f03, *.f08, *.jl, *.lua, *.pl, *.pm, *.cs, *.vb, *.dart, *.groovy, *.elm, *.erl, *.hrl, *.ex, *.exs, *.hs, *.lhs, *.clj, *.cljs, *.cljc, *.cl, *.lisp, *.lsp, *.scm, *.ss, *.rkt, *.sql, *.psql, *.mysql, *.sqlite, *.zig, *.cu, *.cuh, *.bash, *.zsh, *.fish, *.csh, *.tcsh, *.ksh, *.ps1, *.psm1, *.psd1, *.bat, *.cmd, *.vbs, *.wsf, *.applescript, *.scpt, *.awk, *.sed, *.expect, *.env, *.env.*, .env*, Makefile*, *.dockerfile, Dockerfile*, *.mk, *.cmake, CMakeLists.txt, *.gradle, *.gradle.kts, build.gradle*, settings.gradle*, *.sbt, *.mill, *.bazel, *.bzl, BUILD*, WORKSPACE*, *.buck, BUCK*, *.ninja, *.gyp, *.gypi, *.waf, wscript*, package.json, package-lock.json, yarn.lock, pnpm-lock.yaml, requirements*.txt, Pipfile*, pyproject.toml, setup.cfg, tox.ini, poetry.lock, Cargo.toml, Cargo.lock, go.mod, go.sum, composer.json, composer.lock, Gemfile*, *.gemspec, mix.exs, mix.lock, rebar.config, rebar.lock, *.so, *.dll, *.dylib, *.lib, *.a | |
| CODE chat_template.jinja | |
| # Model files (Generated from the files in the workspace directory) | |
| # Supported file types: *.safetensors, *.bin, *.pt, *.pth, *.mar, *.pte, *.pt2, *.ptl, *.tflite, *.h5, *.hdf, *.hdf5, *.pb, *.meta, *.data-*, *.index, *.gguf, *.ggml, *.ggmf, *.ggjt, *.q4_0, *.q4_1, *.q5_0, *.q5_1, *.q8_0, *.f16, *.f32, *.ckpt, *.checkpoint, *.dist_ckpt, *.tensor, *.weights, *.state, *.embedding, *.vocab, *.ot, *.engine, *.trt, *.onnx, *.msgpack, *.model, *.pkl, *.pickle, *.keras, *.joblib, *.npy, *.npz, *.nc, *.mlmodel, *.coreml, *.mleap, *.surml, *.llamafile, *.caffemodel, *.prototxt, *.dlc, *.circle, *.nb | |
| MODEL model-00001-of-00004.safetensors | |
| MODEL model-00002-of-00004.safetensors | |
| MODEL model-00003-of-00004.safetensors | |
| MODEL model-00004-of-00004.safetensors | |
| # Documentation files (Generated from the files in the workspace directory) | |
| # Supported file types: *.txt, *.md, *.pdf, LICENSE*, README*, SETUP*, *requirements*, *.log, *.doc, *.docx, *.docm, *.dot, *.dotx, *.dotm, *.rtf, *.odt, *.ott, *.fodt, *.pages, *.wpd, *.xls, *.xlsx, *.xlsm, *.xlsb, *.xlt, *.xltx, *.xltm, *.ods, *.ots, *.fods, *.numbers, *.csv, *.ppt, *.pptx, *.pptm, *.pps, *.ppsx, *.ppsm, *.pot, *.potx, *.potm, *.odp, *.otp, *.fodp, *.key, *.epub, *.mobi, *.azw, *.azw3, *.fb2, *.fb3, *.lit, *.pdb, *.djvu, *.djv, *.html, *.htm, *.xhtml, *.mhtml, *.mht, *.xml, *.xsl, *.xslt, *.tex, *.latex, *.ltx, *.bib, *.rst, *.asciidoc, *.adoc, *.textile, *.wiki, *.mediawiki, *.org, *.texi, *.texinfo, *.info, *.man, *.chm, *.hlp, *.xps, *.jpg, *.jpeg, *.png, *.gif, *.bmp, *.tiff, *.ico, *.webp, *.heic, *.heif, *.hevc, *.svg, *.mp4, *.mov, *.avi, *.mkv, *.webm, *.m4v, *.flv, *.wmv, *.mpg, *.mpeg | |
| DOC merges.txt | |