File size: 4,877 Bytes
115c800
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
# LAJ CNN Image-to-GPS Model Iteration 1

This project features a convolutional neural network (CNN) for predicting GPS coordinates (latitude and longitude) from image inputs. Below, you'll find details on loading the model, performing inference, and the architecture of the network.

---

## 1. Loading the Model

To load the model, look at the sampleRun_v4.ipynb and run the same commands.



## 2. Running the Model



To perform inference on our model, just normalize the latitudes and longitudes to our means and standard deviations below.

Then run code similar to the code provided to test code provided below:



```

# Evaluate on Test Set

model.eval()

all_preds, all_actuals = [], []

with torch.no_grad():
    for images, gps_coords in val_loader:

        images, gps_coords = images.to(device), gps_coords.to(device)

        outputs = model(images)

        all_preds.append(outputs.cpu())

        all_actuals.append(gps_coords.cpu())

all_preds = torch.cat(all_preds).numpy()

all_actuals = torch.cat(all_actuals).numpy()


# Denormalize Predictions
all_preds_denorm = all_preds * np.array([lat_std, lon_std]) + np.array([lat_mean, lon_mean])

all_actuals_denorm = all_actuals * np.array([lat_std, lon_std]) + np.array([lat_mean, lon_mean])

# Compute Error Metrics
mae = mean_absolute_error(all_actuals_denorm, all_preds_denorm)
rmse = mean_squared_error(all_actuals_denorm, all_preds_denorm, squared=False)
print(f"Test Set Mean Absolute Error: {mae:}")
print(f"Test Set Root Mean Squared Error: {rmse:}")
```



## 3. Latitude and Longitude Means and Standard Deviations



The following values represent the **means** and **standard deviations** of the latitude and longitude used in this model:



- **Latitude Mean**: `39.95173729922173`

- **Latitude Standard Deviation**: `0.0006877829213952256`

- **Longitude Mean**: `-75.19138804851796`

- **Longitude Standard Deviation**: `0.0006182574854250925`

  These values are used to normalize and denormalize the latitude and longitude predictions during inference.



## 4. CNN Architecture



Finally here is the architecture of the CNN we used:



```

# Model Definition
class CustomGPSModel(nn.Module):
    def __init__(self):

        super(CustomGPSModel, self).__init__()


        # Load EfficientNetV2-S with pretrained weights

        self.efficientnet = efficientnet_v2_s(pretrained=True)


        # Modify the final layer for regression (predicting latitude and longitude)

        num_features = self.efficientnet.classifier[1].in_features

        self.efficientnet.classifier[1] = nn.Linear(num_features, 2)  # Output layer has 2 outputs for latitude & longitude


        # Don't freeze earlier layers

        for param in self.efficientnet.features.parameters():

            param.requires_grad = True


    def forward(self, x):

        return self.efficientnet(x)  # Forward pass through EfficientNet


```



## 5. Sample Run Code (how to install and run everything)



```
!pip install datasets
!pip install huggingface_hub

!pip install requests

import torch

import torch.nn as nn

import torch.optim as optim

from torchvision.models import efficientnet_v2_s

from torch.optim.lr_scheduler import CosineAnnealingLR
from torchvision import transforms
from torch.utils.data import DataLoader, Dataset
from torchvision.transforms import functional as F
from PIL import Image
import numpy as np
from sklearn.metrics import mean_absolute_error, mean_squared_error
from huggingface_hub import PyTorchModelHubMixin

import os





# Model Definition

class CustomGPSModel(nn.Module):

    def __init__(self):

        super(CustomGPSModel, self).__init__()



        # Load EfficientNetV2-S with pretrained weights

        self.efficientnet = efficientnet_v2_s(pretrained=True)



        # Modify the final layer for regression (predicting latitude and longitude)

        num_features = self.efficientnet.classifier[1].in_features

        self.efficientnet.classifier[1] = nn.Linear(num_features, 2)  # Output layer has 2 outputs for latitude & longitude

        # Don't freeze earlier layers

        for param in self.efficientnet.features.parameters():

            param.requires_grad = True


    def forward(self, x):

        return self.efficientnet(x)  # Forward pass through EfficientNet


from huggingface_hub import hf_hub_download

import torch

path_name = "efficientnet_gps_regressor_complete_changed_betas_v2.pth"
repo_name = "CustomGPSModel_EfficientNetV2_Run3"

organization_name = "LAJ-519-Image-Project"

# Specify the repository and the filename of the model you want to load
repo_id = f"{organization_name}/{repo_name}"

filename = f"{path_name}"

model_path = hf_hub_download(repo_id=repo_id, filename=filename)



# Load the model using torch

model_test = torch.load(model_path)

model_test.eval()
```