Image-Text-to-Text
Transformers
Safetensors
English
qwen3_vl
agent
image-generation
tool-use
visual-reasoning
self-distillation
grpo
reinforcement-learning
multimodal
qwen3-vl
conversational
Instructions to use MeiGen-AI/GenEvolve with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use MeiGen-AI/GenEvolve with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="MeiGen-AI/GenEvolve") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("MeiGen-AI/GenEvolve") model = AutoModelForImageTextToText.from_pretrained("MeiGen-AI/GenEvolve") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use MeiGen-AI/GenEvolve with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "MeiGen-AI/GenEvolve" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MeiGen-AI/GenEvolve", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/MeiGen-AI/GenEvolve
- SGLang
How to use MeiGen-AI/GenEvolve with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "MeiGen-AI/GenEvolve" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MeiGen-AI/GenEvolve", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "MeiGen-AI/GenEvolve" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MeiGen-AI/GenEvolve", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use MeiGen-AI/GenEvolve with Docker Model Runner:
docker model run hf.co/MeiGen-AI/GenEvolve
Update public links in model card
Browse files
README.md
CHANGED
|
@@ -16,7 +16,7 @@ tags:
|
|
| 16 |
- multimodal
|
| 17 |
- qwen3-vl
|
| 18 |
datasets:
|
| 19 |
-
- MeiGen-AI/GenEvolve-Data
|
| 20 |
---
|
| 21 |
|
| 22 |
<div align="center">
|
|
@@ -30,11 +30,9 @@ datasets:
|
|
| 30 |
<p>
|
| 31 |
<a href="https://ephemeral182.github.io/GenEvolve/">
|
| 32 |
<img alt="Project Page" src="https://img.shields.io/badge/π_Project-Page-1f6feb"></a>
|
| 33 |
-
<a href="https://
|
| 34 |
-
<img alt="arXiv" src="https://img.shields.io/badge/π_arXiv-XXXX.XXXXX-b31b1b"></a>
|
| 35 |
-
<a href="https://github.com/Ephemeral182/GenEvolve">
|
| 36 |
<img alt="Code" src="https://img.shields.io/badge/πΎ_GitHub-Code-181717"></a>
|
| 37 |
-
<a href="https://huggingface.co/datasets/MeiGen-AI/GenEvolve-Data">
|
| 38 |
<img alt="Dataset" src="https://img.shields.io/badge/π€_Dataset-GenEvolve--Data-FFD21E"></a>
|
| 39 |
</p>
|
| 40 |
|
|
@@ -111,12 +109,12 @@ For a user request, the agent samples a multi-turn trajectory of tool calls befo
|
|
| 111 |
|
| 112 |
## π Quick Start
|
| 113 |
|
| 114 |
-
The deployed checkpoint is the **student policy** β it consumes a user prompt and returns a JSON `gen_prompt + reference_images` program through a `<think>/<tool_call>/<answer>` loop. The end-to-end runtime (vLLM serving + agent loop + tools + Qwen/Nano renderers) lives in the [GitHub repo](https://github.com/
|
| 115 |
|
| 116 |
### 1. Install the main GenEvolve runtime
|
| 117 |
|
| 118 |
```bash
|
| 119 |
-
git clone https://github.com/
|
| 120 |
cd GenEvolve
|
| 121 |
|
| 122 |
conda create -n genevolve python=3.11 -y && conda activate genevolve
|
|
@@ -144,7 +142,7 @@ MODEL_PATH=MeiGen-AI/GenEvolve PORT=8000 TP=1 DP=8 bash scripts/serve_vllm.sh
|
|
| 144 |
|
| 145 |
```bash
|
| 146 |
export SERPER_API_KEY=<your_key> # required for search / image_search
|
| 147 |
-
export GOOGLE_API_KEY=<your_key> # only for --backend nano-banana-pro
|
| 148 |
|
| 149 |
# Nano Banana Pro renderer
|
| 150 |
python examples/quickstart.py \
|
|
@@ -183,9 +181,9 @@ The agent's final `<answer>` is a JSON object:
|
|
| 183 |
| Artifact | Link |
|
| 184 |
|---|---|
|
| 185 |
| Project page | https://ephemeral182.github.io/GenEvolve/ |
|
| 186 |
-
| Paper
|
| 187 |
-
| Code | https://github.com/
|
| 188 |
-
| Training data + benchmark | [MeiGen-AI/GenEvolve-Data](https://huggingface.co/datasets/MeiGen-AI/GenEvolve-Data) |
|
| 189 |
| Base model | [Qwen/Qwen3-VL-8B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-8B-Instruct) |
|
| 190 |
|
| 191 |
---
|
|
|
|
| 16 |
- multimodal
|
| 17 |
- qwen3-vl
|
| 18 |
datasets:
|
| 19 |
+
- MeiGen-AI/GenEvolve-Data-Bench
|
| 20 |
---
|
| 21 |
|
| 22 |
<div align="center">
|
|
|
|
| 30 |
<p>
|
| 31 |
<a href="https://ephemeral182.github.io/GenEvolve/">
|
| 32 |
<img alt="Project Page" src="https://img.shields.io/badge/π_Project-Page-1f6feb"></a>
|
| 33 |
+
<a href="https://github.com/MeiGen-AI/GenEvolve">
|
|
|
|
|
|
|
| 34 |
<img alt="Code" src="https://img.shields.io/badge/πΎ_GitHub-Code-181717"></a>
|
| 35 |
+
<a href="https://huggingface.co/datasets/MeiGen-AI/GenEvolve-Data-Bench">
|
| 36 |
<img alt="Dataset" src="https://img.shields.io/badge/π€_Dataset-GenEvolve--Data-FFD21E"></a>
|
| 37 |
</p>
|
| 38 |
|
|
|
|
| 109 |
|
| 110 |
## π Quick Start
|
| 111 |
|
| 112 |
+
The deployed checkpoint is the **student policy** β it consumes a user prompt and returns a JSON `gen_prompt + reference_images` program through a `<think>/<tool_call>/<answer>` loop. The end-to-end runtime (vLLM serving + agent loop + tools + Qwen/Nano renderers) lives in the [GitHub repo](https://github.com/MeiGen-AI/GenEvolve); the snippet below mirrors its installation and usage.
|
| 113 |
|
| 114 |
### 1. Install the main GenEvolve runtime
|
| 115 |
|
| 116 |
```bash
|
| 117 |
+
git clone https://github.com/MeiGen-AI/GenEvolve.git
|
| 118 |
cd GenEvolve
|
| 119 |
|
| 120 |
conda create -n genevolve python=3.11 -y && conda activate genevolve
|
|
|
|
| 142 |
|
| 143 |
```bash
|
| 144 |
export SERPER_API_KEY=<your_key> # required for search / image_search
|
| 145 |
+
export GOOGLE_API_KEY=<your_key> # or GEMINI_API_KEY; only for --backend nano-banana-pro
|
| 146 |
|
| 147 |
# Nano Banana Pro renderer
|
| 148 |
python examples/quickstart.py \
|
|
|
|
| 181 |
| Artifact | Link |
|
| 182 |
|---|---|
|
| 183 |
| Project page | https://ephemeral182.github.io/GenEvolve/ |
|
| 184 |
+
| Paper | Coming soon |
|
| 185 |
+
| Code | https://github.com/MeiGen-AI/GenEvolve |
|
| 186 |
+
| Training data + benchmark | [MeiGen-AI/GenEvolve-Data-Bench](https://huggingface.co/datasets/MeiGen-AI/GenEvolve-Data-Bench) |
|
| 187 |
| Base model | [Qwen/Qwen3-VL-8B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-8B-Instruct) |
|
| 188 |
|
| 189 |
---
|