--- license: other language: - en - zh tags: - medical - radiology - chest-xray - report-generation - discrete-diffusion - vision-language pipeline_tag: image-text-to-text ---
# ECHO_Base_block8 **ECHO_Base_block8** is the **RAD (Response-Asymmetric Diffusion) stage** teacher model with a block length of 8. It performs multi-step block diffusion decoding and serves as the teacher for Direct Conditional Distillation (DCD) to produce the distilled `ECHO_block8` student. ECHO (Efficient Chest X-ray Report Generation with One-step Block Diffusion) is a discrete diffusion vision–language model for automated chest X-ray report generation. It converts a pretrained autoregressive model into a one-step-per-block decoder via RAD adaptation and DCD. ## Model Details | Property | Value | |----------|-------| | Stage | RAD (teacher) | | Block Length | 8 | | Decoding | Multi-step block diffusion | | Architecture | `EchoForConditionalGeneration` (based on Qwen2.5-VL) | | Hidden Size | 3584 | | Languages | English, Chinese | | License | [Midea NC](LICENSE) | ## Usage ```bash git clone https://github.com/midea-ai/ECHO.git cd ECHO pip install transformers==4.55.4 ``` ```python # Multi-step inference with ECHO_Base_block8 python inference/generate_vl_block.py \ --model_dir Midea-AIRC/ECHO_Base_block8 \ --image_path /path/to/chest_xray.jpg \ --prompt_text "Review this chest X-ray and write a report. Use this format: Findings: {}, Impression: {}." \ --remasking_strategy "low_confidence_dynamic" \ --block_length 8 \ --denoising_steps 8 ``` For Chinese prompts: ```python python inference/generate_vl_block.py \ --model_dir Midea-AIRC/ECHO_Base_block8 \ --image_path /path/to/chest_xray.jpg \ --prompt_text "这是一组胸部X光图像,请生成一份医学报告,包括所见和结论。以以下格式返回报告:所见:{} 结论:{}。" \ --remasking_strategy "low_confidence_dynamic" \ --block_length 8 \ --denoising_steps 8 ``` ## Model Zoo | Model | Stage | Description | Link | |-------|-------|-------------|------| | `ECHO_Base_block4` | RAD | Multi-step block diffusion (block length 4), teacher for distillation | [ECHO_Base_block4](https://huggingface.co/Midea-AIRC/ECHO_Base_block4) | | `ECHO_Base_block8` | RAD | Multi-step block diffusion (block length 8), teacher for distillation | [ECHO_Base_block8](https://huggingface.co/Midea-AIRC/ECHO_Base_block8) | | `ECHO_block4` | DCD | Single-step distilled student (block length 4) | [ECHO_block4](https://huggingface.co/Midea-AIRC/ECHO_block4) | | `ECHO_block8` | DCD | Single-step distilled student (block length 8) | [ECHO_block8](https://huggingface.co/Midea-AIRC/ECHO_block8) | ## License This model is released under the [Midea Model License Agreement - Non-Commercial Use Version](LICENSE). Use for research, study, and personal non-commercial purposes only. Commercial use is strictly prohibited. ## Citation ```bibtex @misc{chen2026echoefficientchestxray, title={ECHO: Efficient Chest X-ray Report Generation with One-step Block Diffusion}, author={Lifeng Chen and Tianqi You and Hao Liu and Zhimin Bao and Jile Jiao and Xiao Han and Zhicai Ou and Tao Sun and Xiaofeng Mou and Xiaojie Jin and Yi Xu}, year={2026}, eprint={2604.09450}, archivePrefix={arXiv}, primaryClass={cs.LG}, url={https://arxiv.org/abs/2604.09450}, } ``` ## Contact - **Lifeng Chen**, Beijing Jiaotong University (lfchen@bjtu.edu.cn) - **Hao Liu** (Corresponding Author), AI Research Center, Midea Group (liuhao249@midea.com)