AventIQ-AI
/

resnet18-cataract-detection-system

Model card Files Files and versions

xet

Community

ayushsinha commited on Mar 5, 2025

Commit

e8dd40e

verified ·

1 Parent(s): c6b5d9d

Create README.md

Browse files

Files changed (1) hide show

README.md +117 -0

README.md ADDED Viewed

	@@ -0,0 +1,117 @@

+### **🩺 ResNet-18 Cataract Detection System**
+This repository hosts a quantized version of **ResNet-18-based** model optimized for **cataract detection** having two labels either normal or cataract. The model detects images into these 2 labels.
+---
+## **📌 Model Details**
+- **Model Architecture**: ResNet-18
+- **Task**: Cataract Detection System
+- **Dataset**: Cataract Dataset ([Kaggle](https://www.kaggle.com/datasets/nandanp6/cataract-image-dataset))
+- **Framework**: PyTorch
+- **Input Image Size**: 224x224
+- **Number of Classes**: 2
+---
+## **🚀 Usage**
+### **Installation**
+```bash
+pip install torch torchvision pillow
+```
+### **Loading the Model**
+```python
+import torch
+import torchvision.models as models
+from huggingface_hub import hf_hub_download
+import json
+from PIL import Image
+import torchvision.transforms as transforms
+device = "cuda" if torch.cuda.is_available() else "cpu"
+weights_path = hf_hub_download(repo_id="AventIQ-AI/resnet18-cataract-detection-system", filename="cataract_detection_resnet18_quantized.pth")
+labels_path = hf_hub_download(repo_id="AventIQ-AI/resnet18-cataract-detection-system", filename="class_names.json")
+with open(labels_path, "r") as f:
+    class_labels = json.load(f)
+model = models.resnet18(pretrained=False)
+num_classes = len(class_labels)
+model.fc = torch.nn.Linear(in_features=512, out_features=num_classes)
+model.load_state_dict(torch.load(weights_path, map_location=torch.device('cpu')))
+model.eval()
+print("Model loaded successfully!")
+```
+---
+### **🔍 Perform Classification**
+```python
+transform = transforms.Compose([
+    transforms.Resize((224, 224)),
+    transforms.ToTensor(),
+    transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225])
+])
+def predict_image(image_path):
+    image = Image.open(image_path).convert("RGB")
+    image = transform(image).unsqueeze(0).to(device)  # Add batch dimension
+    with torch.no_grad():
+        outputs = model(image)
+        _, predicted_class = torch.max(outputs, 1)
+    predicted_label = class_labels[predicted_class.item()]
+    print(f"Predicted Class: {predicted_label}")
+# Example usage:
+image_path = "your_image_path"
+predict_image(image_path)
+```
+## **📊 Evaluation Results**
+After fine-tuning, the model was evaluated on the **Chest X-ray Pneumonia Dataset**, achieving the following performance:
+| **Metric**        | **Score** |
+|------------------|----------|
+| **Accuracy**      | 97.52%    |
+| **Precision**     | 98.31%    |
+| **Recall**        | 96.67%    |
+| **F1-Score**      | 97.48%    |
+---
+## **🔧 Fine-Tuning Details**
+### **Dataset**
+The model was trained on **Cataract Dataset** having two labels.
+### **Training Configuration**
+- **Number of epochs**: 10
+- **Batch size**: 32
+- **Optimizer**: Adam
+- **Learning rate**: 1e-4
+- **Loss Function**: Cross-Entropy
+- **Evaluation Strategy**: Validation at each epoch
+---
+## **⚠️ Limitations**
+- **Misclassification risk**: The model may produce **false positives or false negatives**. Always verify results with a radiologist.
+- **Dataset bias**: Performance may be affected by **dataset distribution**. It may not generalize well to **different populations**.
+- **Black-box nature**: Like all deep learning models, it does not explain why a prediction was made.
+---