--- license: mit tags: - image-classification - remote-sensing - resnet - pytorch - transformers --- # RSP-ResNet-50 ResNet-50 based model for remote sensing scene classification (51 classes). ## Usage ```python from transformers import AutoModelForImageClassification import torch # Load model model = AutoModelForImageClassification.from_pretrained( "BiliSakura/RSP-ResNet-50", trust_remote_code=True ) # Inference model.eval() input_image = torch.randn(1, 3, 224, 224) # (batch, channels, height, width) with torch.no_grad(): outputs = model(pixel_values=input_image) logits = outputs.logits # Shape: (1, 51) predicted_class = logits.argmax(dim=-1).item() ``` ## Model Details - **Architecture:** ResNet-50 - **Input size:** 224×224×3 - **Number of classes:** 51 - **Parameters:** ~23.6M ## Citation If you use this model, please cite the original RSP paper: ```bibtex @ARTICLE{rsp, author={Wang, Di and Zhang, Jing and Du, Bo and Xia, Gui-Song and Tao, Dacheng}, journal={IEEE Transactions on Geoscience and Remote Sensing}, title={An Empirical Study of Remote Sensing Pretraining}, year={2023}, volume={61}, number={}, pages={1-20}, doi={10.1109/TGRS.2022.3176603} } ``` **Original Repository:** [ViTAE-Transformer/RSP](https://github.com/ViTAE-Transformer/RSP)