root commited on Nov 25, 2025

Commit

4a2b9ca

1 Parent(s): a44108d

Initial model upload

Files changed (27) hide show

.gitattributes +2 -0
README.md +81 -0
agent_tools/checkpoints/DiffLL/model.pth.tar +3 -0
agent_tools/checkpoints/ESRGAN/RealESRGAN_x4plus.pth +3 -0
agent_tools/checkpoints/HVICIDNet/generalization.pth +3 -0
agent_tools/checkpoints/IDT/epoch100.pth.tar +3 -0
agent_tools/checkpoints/Img2img_turbo/rainy2day.pkl +3 -0
agent_tools/checkpoints/Img2img_turbo/snow2day.pkl +3 -0
agent_tools/checkpoints/KANet/trained_model_epoch1.pk +3 -0
agent_tools/checkpoints/LightenDiffusion/stage2_weight.pth.tar +3 -0
agent_tools/checkpoints/RIDCP/pretrained_HQPs.pth +3 -0
agent_tools/checkpoints/RIDCP/pretrained_RIDCP.pth +3 -0
agent_tools/checkpoints/RIDCP/weight_for_matching_dehazing_Flickr.pth +3 -0
agent_tools/checkpoints/Retinexformer/FiveK.pth +3 -0
agent_tools/checkpoints/S2Former/udrs2former_demo.pth +3 -0
agent_tools/checkpoints/S2Former/udrs2former_raindrop_real.pth +3 -0
agent_tools/checkpoints/S2Former/udrs2former_raindrop_syn.pth +3 -0
agent_tools/checkpoints/SCUNet/scunet_color_real_gan.pth +3 -0
agent_tools/checkpoints/SnowMaster/checkpoint_0318.pth +3 -0
degradation_synthesis/rainy/GuidedDisent/weights/pretrained.pth +3 -0
degradation_synthesis/snow/checkpoints/day2snow.pkl +3 -0
config.json → pretrained/mrrhf/config.json +0 -0
preprocessor_config.json → pretrained/mrrhf/preprocessor_config.json +0 -0
pytorch_model.bin → pretrained/mrrhf/pytorch_model.bin +0 -0
special_tokens_map.json → pretrained/mrrhf/special_tokens_map.json +0 -0
tokenizer.json → pretrained/mrrhf/tokenizer.json +0 -0
tokenizer_config.json → pretrained/mrrhf/tokenizer_config.json +0 -0

.gitattributes CHANGED Viewed

@@ -34,3 +34,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 *.json filter=lfs diff=lfs merge=lfs -text

 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 *.json filter=lfs diff=lfs merge=lfs -text
+*.pth.tar filter=lfs diff=lfs merge=lfs -text
+*.pk filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,81 @@

+---
+license: apache-2.0
+tags:
+- cvpr25
+- JarvisIR
+- weights
+description: |
+  This repository contains the official weights for the CVPR 2025 paper "JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration".
+---
+# JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration
+## Model Description
+JarvisIR is a novel system that leverages a Vision-Language Model (VLM) to intelligently restore images for autonomous driving perception in adverse weather. It acts as a central controller, dynamically coordinating multiple expert restoration models to tackle complex degradations such as rain, fog, low-light, and snow.
+## Key Features
+- **VLM Controller**: The first framework to employ a Vision-Language Model for orchestrating image restoration workflows.
+- **Multi-Expert Coordination**: Dynamically schedules specialized restoration models for tasks like denoising, super-resolution, and deraining.
+- **Adaptive Restoration**: Effectively handles a wide range of adverse weather conditions, including night/low-light, rain, fog, and snow.
+- **Advanced Training Strategy**: Utilizes a two-stage process of Supervised Fine-Tuning (SFT) followed by alignment with Mixed-Rank Reward-based Human Feedback (MRRHF).
+## Model Architecture
+The system comprises three core components:
+1.  **VLM Controller**: A LLaVA-v1.5-7B model serves as the core for task planning and expert model selection.
+2.  **Expert Models**: A suite of specialized networks, each tailored for a specific restoration task (e.g., deraining, defogging).
+3.  **Reward Models**: A set of Image Quality Assessment (IQA) models that provide feedback for quality assessment and alignment during training.
+## Training Data
+JarvisIR was trained on a large-scale, comprehensive dataset:
+- **CleanBench-Synthetic**: A dataset of 150,000 synthetically degraded images with corresponding annotations.
+- **CleanBench-Real**: A collection of 80,000 real-world images captured in adverse weather, used for alignment training.
+- **Comprehensive Coverage**: The data covers four primary weather scenarios (night, rain, fog, snow) with various combinations of degradations.
+## Performance
+- Achieves a **50% average improvement** in perception metrics on the CleanBench-Real dataset compared to state-of-the-art all-in-one methods.
+- Demonstrates superior performance across all tested weather conditions.
+- Exhibits enhanced robustness and generalization capabilities in real-world driving scenarios.
+## Intended Use
+**Primary Use Cases:**
+- Enhancing perception systems in autonomous vehicles.
+- Building robust, multi-weather image restoration pipelines.
+- Advancing research into the applications of Vision-Language Models in image processing.
+## Model Checkpoints
+This repository provides the following model weights:
+- `pertained`: The complete model after both Supervised Fine-Tuning and MRRHF alignment stages.
+- `agent-tools/`: The weights for each individual expert restoration model.
+## Citation
+If you find JarvisIR useful in your research, please cite our paper:
+```bibtex
+@inproceedings{lin2025jarvisir,
+  title={Jarvisir: Elevating autonomous driving perception with intelligent image restoration},
+  author={Lin, Yunlong and Lin, Zixu and Chen, Haoyu and Pan, Panwang and Li, Chenxin and Chen, Sixiang and Wen, Kairun and Jin, Yeying and Li, Wenbo and Ding, Xinghao},
+  booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
+  pages={22369--22380},
+  year={2025}
+}
+```
+## Related Resources
+- **Project Page**: https://cvpr2025-jarvisir.github.io/
+- **Code Repository**: https://github.com/LYL1015/JarvisIR
+- **Paper**: https://arxiv.org/pdf/2504.04158
+## Acknowledgments
+This work contributes to the advancement of intelligent image restoration by integrating Vision-Language Models with expert system coordination.

agent_tools/checkpoints/DiffLL/model.pth.tar ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:acc3b4a41b7a0a12dbd24bbd904d44ddc070cffdcc638dbb038acc3c50715c9d
+size 353865641

agent_tools/checkpoints/ESRGAN/RealESRGAN_x4plus.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4fa0d38905f75ac06eb49a7951b426670021be3018265fd191d2125df9d682f1
+size 67040989

agent_tools/checkpoints/HVICIDNet/generalization.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:143e88e8a92d1bc21f05550f415f43ceae02d1c83360ec062428ebd6f8d06914
+size 7971269

agent_tools/checkpoints/IDT/epoch100.pth.tar ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6a7291b98969ce4b0f5ce867ca2fc63369340d347258b078539da46eea056189
+size 197713581

agent_tools/checkpoints/Img2img_turbo/rainy2day.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:09811f30d7cacfe33e618a7dcfaab3913a66b127972641ccedfffbe9a560d796
+size 1229932962

agent_tools/checkpoints/Img2img_turbo/snow2day.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4c72af2be175e89b33aeac852f0e66c3d7054ce789dc95f1407533cda69d1c3d
+size 1200345822

agent_tools/checkpoints/KANet/trained_model_epoch1.pk ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:adca80cf604f41b6bca3da236c2473b728e75698dfa8b626540d365f0f7bbf81
+size 223509311

agent_tools/checkpoints/LightenDiffusion/stage2_weight.pth.tar ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4bde72decbd99c4bd23bb1748c521e52c3a2b41299b355d5ba0d12fda1c4e014
+size 111512513

agent_tools/checkpoints/RIDCP/pretrained_HQPs.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2b63ca2a3cb0e65a7614f5f2377ef4c070a36ecf41749e7dddbec37e1f1288d6
+size 25118706

agent_tools/checkpoints/RIDCP/pretrained_RIDCP.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1ea9f4344d2e46eb07d95a5a67b2c80cecb78d26ad0e3250c624031178c68271
+size 122065395

agent_tools/checkpoints/RIDCP/weight_for_matching_dehazing_Flickr.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:050ce30a772299c4b6d30754235b910a0087761b8e8144c67a10ff02b230b4fc
+size 8939

agent_tools/checkpoints/Retinexformer/FiveK.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:800f6a9281fe8d95daca3108f2b826d5a2adead09031e0e998d30a615286d9c1
+size 6478393

agent_tools/checkpoints/S2Former/udrs2former_demo.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:89223310c5102e9fd7253fa3224947696bf2ca395b94ae81b02561abfaa98165
+size 35369131

agent_tools/checkpoints/S2Former/udrs2former_raindrop_real.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4517e7f724cc0e3e2f094070b424221cf0b25851d857505b18e438659b4e7907
+size 35379671

agent_tools/checkpoints/S2Former/udrs2former_raindrop_syn.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3fe58f8b7687cfac612bc13e04dd4bd13a8993adf95aff6e966baef65e0358ab
+size 35378507

agent_tools/checkpoints/SCUNet/scunet_color_real_gan.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:892c83f812c59173273b74f4f34a14ecaf57a2fdb68df056664589beb55c966e
+size 71982835

agent_tools/checkpoints/SnowMaster/checkpoint_0318.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:18342bc16e8fe1bfbcddf9f419103dccf4cd83598668b34e702e17ad9abb3899
+size 274874370

degradation_synthesis/rainy/GuidedDisent/weights/pretrained.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b083eb487709d3e6193cef7ce99ffa13f4b1aa57340e006c442da607fa73d594
+size 120316019

degradation_synthesis/snow/checkpoints/day2snow.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bd694b6d2d200f62ef9a8dbd1257c055f4a7fdb16e3e660766d48356fc6b95bb
+size 1200347142

config.json → pretrained/mrrhf/config.json RENAMED Viewed

File without changes

preprocessor_config.json → pretrained/mrrhf/preprocessor_config.json RENAMED Viewed

File without changes

pytorch_model.bin → pretrained/mrrhf/pytorch_model.bin RENAMED Viewed

File without changes

special_tokens_map.json → pretrained/mrrhf/special_tokens_map.json RENAMED Viewed

File without changes

tokenizer.json → pretrained/mrrhf/tokenizer.json RENAMED Viewed

File without changes

tokenizer_config.json → pretrained/mrrhf/tokenizer_config.json RENAMED Viewed

File without changes