DiffThinker / README.md
yhx12's picture
Update README.md
bb0cd5e verified
|
raw
history blame
898 Bytes
metadata
license: apache-2.0
language:
  - en
base_model:
  - Qwen/Qwen-Image-Edit-2509

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Project Page

Inference & Evaluation

The test datasets used in our experiments is provided within each task's directory. We recommend using the same data to ensure the reproducibility of our results and to facilitate comparison with other models. If you wish to generate your own test data, please refer to the gen.txt file in each task directory.

cd Maze

# 1. Inference and Parsing
bash eval/gen_and_parse.sh

# 2. Evaluation
bash eval/eval_path.sh

# 3. Individual Inference
python ../DiffSynth-Studio/add/infer/infer.py
python ../DiffSynth-Studio/add/infer/infer_with_middle.py