Improve model card for MedVSR: Add pipeline tag, paper, code, and usage details

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +73 -3
README.md CHANGED
@@ -1,3 +1,73 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pipeline_tag: image-to-image
4
+ ---
5
+
6
+ # MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation
7
+
8
+ This model was presented in the paper [MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation](https://huggingface.co/papers/2509.21265).
9
+ The official code repository can be found at: [https://github.com/CUHK-AIM-Group/MedVSR](https://github.com/CUHK-AIM-Group/MedVSR).
10
+
11
+ ## Overview
12
+ **MedVSR** is a tailored model for medical VSR.
13
+ It first employs Cross State-Space Propagation (CSSP) to address the imprecise alignment by projecting distant frames as control matrices within state-space models, enabling the selective propagation of consistent and informative features to neighboring frames for effective alignment.
14
+ It also features an Inner State-Space Reconstruction (ISSR) module that enhances tissue structures and reduces artifacts with joint long-range spatial feature learning and large-kernel short-range information aggregation.
15
+
16
+ ## Installation
17
+
18
+ Clone this repository:
19
+ ```bash
20
+ git clone https://github.com/CUHK-AIM-Group/MedVSR
21
+ cd MedVSR
22
+
23
+ conda create -n MedVSR python==3.9
24
+ conda activate MedVSR
25
+
26
+ pip install torch==2.1.1+cu121 torchvision==0.16.1+cu121 --extra-index-url https://download.pytorch.org/whl/cu121
27
+ pip install -r requirements.txt
28
+
29
+ pip install -e causal_conv1d>=1.1.0
30
+ pip install -e mamba-1p1p1
31
+ ```
32
+
33
+ ## Dataset preparation
34
+
35
+ For the preprocessed HyperKvasir, LDPolyp, and EndoVis18, please download from [huggingface link](https://huggingface.co/datasets/jeffrey423/MedVSR_dataset). Modify L14-16 and L39-40 to the extracted HyperKvasir training and validation folders.
36
+
37
+ ## Test the model
38
+
39
+ Download our pretrained model at [here](https://huggingface.co/jeffrey423/MedVSR).
40
+
41
+ ```python
42
+ python test_model.py -opt ./options/medvsr_train.yml --weight <PATH_TO_PRETRAINED_MEDVSR>
43
+ ```
44
+
45
+ ## Training
46
+ ```bash
47
+ bash dist_train.sh 2 options/medvsr_train.yml 25623
48
+ ```
49
+
50
+ ## Citation
51
+ ```bibtex
52
+ @inproceedings{liu2025medvsr,
53
+ title = {MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation},
54
+ author = {Liu, Xinyu and Sun, Guolei and Wang, Cheng and Yuan, Yixuan and Konukoglu, Ender},
55
+ booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
56
+ year = {2025}
57
+ }
58
+ ```
59
+
60
+ ## Acknowledgement
61
+
62
+ We sincerely thank the authors and contributors of the following projects for their awesome codebases, which have greatly benefited our work:
63
+
64
+ - [BasicSR](https://github.com/XPixelGroup/BasicSR)
65
+ - [IART](https://github.com/kai422/IART)
66
+ - [RVRT](https://github.com/JingyunLiang/RVRT)
67
+ - [Mamba](https://github.com/state-spaces/mamba)
68
+ - [MambaVision](https://github.com/NVlabs/MambaVision)
69
+ - [Vim](https://github.com/hustvl/Vim)
70
+
71
+ ## Contact
72
+
73
+ Please contact [xinyuliu@link.cuhk.edu.hk](mailto:xinyuliu@link.cuhk.edu.hk) or open an issue.