metadata
license: apache-2.0
language:
- en
base_model:
- Qwen/Qwen-Image-Edit-2509
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Inference & Evaluation
The test datasets used in our experiments is provided within each task's directory. We recommend using the same data to ensure the reproducibility of our results and to facilitate comparison with other models. If you wish to generate your own test data, please refer to the gen.txt file in each task directory.
cd Maze
# 1. Inference and Parsing
bash eval/gen_and_parse.sh
# 2. Evaluation
bash eval/eval_path.sh
# 3. Individual Inference
python ../DiffSynth-Studio/add/infer/infer.py
python ../DiffSynth-Studio/add/infer/infer_with_middle.py