| | --- |
| | license: apache-2.0 |
| | pipeline_tag: image-to-image |
| | --- |
| | |
| | # MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation |
| |
|
| | This model was presented in the paper [MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation](https://huggingface.co/papers/2509.21265). |
| | The official code repository can be found at: [https://github.com/CUHK-AIM-Group/MedVSR](https://github.com/CUHK-AIM-Group/MedVSR). |
| |
|
| | ## Overview |
| | **MedVSR** is a tailored model for medical VSR. |
| | It first employs Cross State-Space Propagation (CSSP) to address the imprecise alignment by projecting distant frames as control matrices within state-space models, enabling the selective propagation of consistent and informative features to neighboring frames for effective alignment. |
| | It also features an Inner State-Space Reconstruction (ISSR) module that enhances tissue structures and reduces artifacts with joint long-range spatial feature learning and large-kernel short-range information aggregation. |
| |
|
| | ## Installation |
| |
|
| | Clone this repository: |
| | ```bash |
| | git clone https://github.com/CUHK-AIM-Group/MedVSR |
| | cd MedVSR |
| | |
| | conda create -n MedVSR python==3.9 |
| | conda activate MedVSR |
| | |
| | pip install torch==2.1.1+cu121 torchvision==0.16.1+cu121 --extra-index-url https://download.pytorch.org/whl/cu121 |
| | pip install -r requirements.txt |
| | |
| | pip install -e causal_conv1d>=1.1.0 |
| | pip install -e mamba-1p1p1 |
| | ``` |
| |
|
| | ## Dataset preparation |
| |
|
| | For the preprocessed HyperKvasir, LDPolyp, and EndoVis18, please download from [huggingface link](https://huggingface.co/datasets/jeffrey423/MedVSR_dataset). Modify L14-16 and L39-40 to the extracted HyperKvasir training and validation folders. |
| |
|
| | ## Test the model |
| |
|
| | Download our pretrained model at [here](https://huggingface.co/jeffrey423/MedVSR). |
| |
|
| | ```python |
| | python test_model.py -opt ./options/medvsr_train.yml --weight <PATH_TO_PRETRAINED_MEDVSR> |
| | ``` |
| |
|
| | ## Training |
| | ```bash |
| | bash dist_train.sh 2 options/medvsr_train.yml 25623 |
| | ``` |
| |
|
| | ## Citation |
| | ```bibtex |
| | @inproceedings{liu2025medvsr, |
| | title = {MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation}, |
| | author = {Liu, Xinyu and Sun, Guolei and Wang, Cheng and Yuan, Yixuan and Konukoglu, Ender}, |
| | booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, |
| | year = {2025} |
| | } |
| | ``` |
| |
|
| | ## Acknowledgement |
| |
|
| | We sincerely thank the authors and contributors of the following projects for their awesome codebases, which have greatly benefited our work: |
| |
|
| | - [BasicSR](https://github.com/XPixelGroup/BasicSR) |
| | - [IART](https://github.com/kai422/IART) |
| | - [RVRT](https://github.com/JingyunLiang/RVRT) |
| | - [Mamba](https://github.com/state-spaces/mamba) |
| | - [MambaVision](https://github.com/NVlabs/MambaVision) |
| | - [Vim](https://github.com/hustvl/Vim) |
| |
|
| | ## Contact |
| |
|
| | Please contact [xinyuliu@link.cuhk.edu.hk](mailto:xinyuliu@link.cuhk.edu.hk) or open an issue. |