# Vision transformer - RVSA for object detection ## Table of Contents - [Features](#Features) - [Getting Started](#getting-started) - [Prerequisites](#prerequisites) - [Installation](#installation) ## Features Load object detection model - remote sensing image. * Visualize ## Getting Started Below is the list of dependencies and steps to use the repository ### Prerequisites List any software or libraries that are required to run your project. * Python 3 * mmcv * mmrotate * torch * mmdetection ### Installation Step-by-step instructions on how to install your project. 1. Install environment: - Install torch * CPU: ``` pip install -q torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cpu pip install -q mmcv==2.0.1 -f https://download.openmmlab.com/mmcv/dist/cpu/torch2.0/index.html ``` * GPU: ``` pip install -q torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pip install -q mmcv==2.0.1 -f https://download.openmmlab.com/mmcv/dist/cu117/torch2.0/index.html ``` - Install other dependencies: ``` pip install -q "mmdet>=3.0.0rc6,<3.1.0" git clone https://github.com/open-mmlab/mmrotate -b 1.x cd mmrotate pip install -q -e . cd .. git clone https://github.com/ViTAE-Transformer/MTP cp -r ./MTP/RS_Tasks_Finetune/Rotated_Detection/mmrotate1.x/mmrotate/* ./mmrotate/mmrotate git clone https://huggingface.co/datasets/KhangTruong/NWPU-Caption tar -xf NWPU-Caption/02_NWPU_RESISC45.tar tar -xf NWPU-Caption/02_NWPU_caption.tar mv 02_NWPU_RESISC45 data mv 02_NWPU_caption data python install.py --root . ``` 2. Change the directory to *mmrotate*, which is inside the current working directory ``` cd mmrotate ``` 3. Run the inference ``` python ../extract_feature.py -n ../data/airplane/airplane_002.jpg ``` 4. It will produce an image in *vis_data*, which is the labelled image.