aslakey
/

depth_of_field

dinov2_with_registers

Model card Files Files and versions

depth_of_field / README.md

aslakey's picture

Update README.md

8a30ae0 verified 8 months ago

|

history blame contribute delete

1.1 kB

	---
	license: apache-2.0
	---

	# Deoth of Field

	This model predicts an image's cinematic depth of field [deep, shallow]. The model is a DinoV2 with registers backbone (initiated with `facebook/dinov2-with-registers-large` weights) and trained on a diverse set of five thousand human-annotated images.

	## How to use:
	```python

	import torch
	from PIL import Image
	from transformers import AutoImageProcessor
	from transformers import AutoModelForImageClassification

	image_processor = AutoImageProcessor.from_pretrained("facebook/dinov2-with-registers-large")
	model = AutoModelForImageClassification.from_pretrained('aslakey/depth_of_field')
	model.eval()

	# Model labels: [deep, shallow]
	image = Image.open('cinematic_shot.jpg')
	inputs = image_processor(image, return_tensors="pt")
	with torch.no_grad():
	outputs = model(**inputs)

	predicted_label = outputs.logits.argmax(-1).item()
	print(model.config.id2label[predicted_label])
	```

	## Performance:


	\| Category \| Precision \| Recall \|
	\|----------\|-----------\|--------\|
	\| deep \| 85% \| 77% \|
	\| shallow \| 75% \| 84% \|