depth_of_field / README.md
aslakey's picture
Update README.md
8a30ae0 verified
---
license: apache-2.0
---
# Deoth of Field
This model predicts an image's cinematic depth of field [deep, shallow]. The model is a DinoV2 with registers backbone (initiated with `facebook/dinov2-with-registers-large` weights) and trained on a diverse set of five thousand human-annotated images.
## How to use:
```python
import torch
from PIL import Image
from transformers import AutoImageProcessor
from transformers import AutoModelForImageClassification
image_processor = AutoImageProcessor.from_pretrained("facebook/dinov2-with-registers-large")
model = AutoModelForImageClassification.from_pretrained('aslakey/depth_of_field')
model.eval()
# Model labels: [deep, shallow]
image = Image.open('cinematic_shot.jpg')
inputs = image_processor(image, return_tensors="pt")
with torch.no_grad():
outputs = model(**inputs)
predicted_label = outputs.logits.argmax(-1).item()
print(model.config.id2label[predicted_label])
```
## Performance:
| Category | Precision | Recall |
|----------|-----------|--------|
| deep | 85% | 77% |
| shallow | 75% | 84% |