Spaces:

13hbeltran
/

Animal_Classification

Sleeping

App Files Files Community

13hbeltran commited on Feb 3

Commit

7e35296

verified ·

1 Parent(s): adf685b

Upload 3 files

Browse files

Files changed (3) hide show

app.py +33 -0
readme.md +47 -0
requirements.txt +4 -0

app.py ADDED Viewed

	@@ -0,0 +1,33 @@

+import gradio as gr
+from transformers import pipeline
+from PIL import Image
+# Load image classification pipeline
+classifier = pipeline(
+    task="image-classification",
+    model="google/vit-base-patch16-224"
+)
+def classify_image(image):
+    if image is None:
+        return "No image provided."
+    # Convert to PIL Image if needed
+    if not isinstance(image, Image.Image):
+        image = Image.fromarray(image)
+    results = classifier(image)
+    return {r["label"]: r["score"] for r in results}
+# Gradio interface
+app = gr.Interface(
+    fn=classify_image,
+    inputs=gr.Image(type="pil", label="Upload an Animal Image"),
+    outputs=gr.Label(label="Prediction"),
+    title="Animal Image Classification",
+    description="Upload an image of an animal and the model will predict what it is."
+)
+if __name__ == "__main__":
+    app.launch()

readme.md ADDED Viewed

	@@ -0,0 +1,47 @@

+Model Card: Vision Transformer (ViT) for Animal Image Classification
+Model Description
+This application uses a pretrained Vision Transformer (ViT) model from Hugging Face for animal image classification. Vision Transformers adapt the transformer architecture—originally developed for NLP tasks like BERT—to image data by processing images as patches rather than pixels.
+The model is pretrained on large-scale image datasets (such as ImageNet) and is used as-is for inference. Images are resized to 224×224 pixels, which matches the model’s expected input size. No additional fine-tuning was performed for this assignment.
+The goal of this project is to demonstrate how a pretrained computer vision model can be deployed as a simple interactive application that accepts an animal image and returns a predicted class.
+Intended Uses & Limitations
+Intended Uses
+Animal Image Classification:
+Classify images of animals using a pretrained vision model.
+Educational Demonstration:
+Showcase how Hugging Face models and Spaces can be used to build and deploy a simple ML application.
+Limitations
+The model was not fine-tuned specifically for animals, so predictions may be inaccurate for uncommon species or low-quality images.
+Results depend heavily on image clarity, lighting, and background.
+This application is intended for demonstration and learning, not production use.
+How to Use
+Upload an image of an animal using the interface.
+The application preprocesses the image and returns the model’s predicted label.
+Internally, the app uses the Hugging Face image-classification pipeline to handle preprocessing, inference, and output formatting.
+Training Data
+This project does not train a new model.
+It relies on a pretrained Vision Transformer that was originally trained on large, publicly available image datasets (e.g., ImageNet).
+Notes
+This Space is part of a coursework assignment focused on:
+Using pretrained models responsibly
+Understanding model inputs and outputs
+Deploying simple ML applications locally and via Hugging Face Spaces

requirements.txt ADDED Viewed

	@@ -0,0 +1,4 @@

+gradio
+transformers
+torch
+pillow