Upload folder using huggingface_hub

Browse files

Files changed (4) hide show

README.md +91 -91
app.py +86 -167
deploy_to_hf.py +0 -1
requirements.txt +2 -5

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: Rubik's Cube Recognition with TensorFlow.js RetinaNet
 emoji: 🎲
 colorFrom: blue
 colorTo: red
@@ -7,120 +7,120 @@ sdk: gradio
 sdk_version: 4.19.2
 app_file: app.py
 pinned: false
 ---
-# # 🎲 Rubik's Cube Recognition with TensorFlow.js RetinaNet
-This is a deep learning model that can recognize and analyze Rubik's cubes in images. The model is built using TensorFlow.js and RetinaNet architecture.
-## Features
-- Upload images of Rubik's cubes
-- Real-time cube detection
-- Simple and intuitive interface
-## How to Use
-1. Visit the live demo
-2. Upload an image containing a Rubik's cube
-3. Get instant detection results
-## Technical Details
-- Built with TensorFlow and Gradio
-- Uses computer vision techniques for cube detection
-- Optimized for web deployment
-## Local Development
-To run this project locally:
-1. Clone the repository
-2. Install dependencies: `pip install -r requirements.txt`
-3. Run the app: `python app.py`
-## License
-MIT License
-## 🌟 Model Description
-The model is based on RetinaNet with SpineNet-49 backbone, trained to detect:
-- Rubik's cube faces
-- Individual color tiles (red, white, blue, orange, green, yellow)
-### Model Architecture
-- Base Model: RetinaNet-SpineNet-49
-- Input Size: 640x640x3
-- Number of Classes: 7 (6 colors + face)
-- Output: Bounding boxes with class predictions
 ## 📊 Performance Metrics
-| Metric | Value |
-|--------|-------|
-| mAP    | TBD   |
-| FPS    | TBD   |
-## 🚀 Quick Start
 ```bash
 # Clone the repository
-git clone https://huggingface.co/spaces/[your-username]/rubiks-cube-recognition
 # Install dependencies
 pip install -r requirements.txt
-# Run the demo
 python app.py
 ```
-## 📦 Project Structure
 ```
 rubiks-cube-recognition/
-├── app.py                 # Gradio web interface
-├── src/
-│   ├── data/             # Data processing utilities
-│   │   ├── labelme2coco.py
-│   │   └── shared.py
-│   ├── model/            # Model training and inference
-│   │   ├── trainer.py
-│   │   └── visualize.py
-│   └── utils/            # Utility functions
-├── configs/              # Model configurations
-├── examples/             # Example images and results
-└── requirements.txt
 ```
-## 🔧 Usage
-### Training
-```python
-python src/model/trainer.py --config configs/retinanet_config.py
-```
-### Inference
-```python
-python src/model/visualize.py --image path/to/image.jpg
-```
-## 📝 Dataset
-The model is trained on a custom dataset of Rubik's cube images, annotated with:
-- Face detection
-- Color tile detection
-### Data Format
-- Annotations: COCO format
-- Image size: 640x640
-- Classes: 7 (face, red_tile, white_tile, blue_tile, orange_tile, green_tile, yellow_tile)
 ## 🤝 Contributing
-Contributions are welcome! Please feel free to submit a Pull Request.
 ## 📄 License
@@ -128,26 +128,26 @@ This project is licensed under the Apache License 2.0 - see the [LICENSE](LICENS
 ## 🙏 Acknowledgements
-- TensorFlow Model Garden
-- RetinaNet Implementation
-- SpineNet Architecture
 ## 📧 Contact
-- GitHub: https://github.com/itsiiromiuy
-- Hugging Face:  https://huggingface.co/itsyuimorii
-## References
-- [How to Train Custom Object Detection Models using RetinaNet](https://medium.com/@van.evanfebrianto/how-to-train-custom-object-detection-models-using-retinanet-aeed72f5d701)
-- labelme2coco: https://github.com/fcakyon/labelme2coco
-- **Keras RetinaNet**: https://github.com/fizyr/keras-retinanet
-- **TensorFlow 2.x RetinaNet**: https://github.com/srihari-humbarwadi/retinanet-tensorflow2.x
-- **SpineNet-PyTorch**: https://github.com/yan-roo/SpineNet-Pytorch
-- **LabelMe to COCO Converter**: https://github.com/wkentaro/labelme
-- **labelme-json-to-coco-json**: https://roboflow.com/convert/labelme-json-to-coco-json

 ---
+title: Rubiks Cube Recognition
 emoji: 🎲
 colorFrom: blue
 colorTo: red
 sdk_version: 4.19.2
 app_file: app.py
 pinned: false
+license: apache-2.0
 ---
+# 🎲 Rubik's Cube Recognition with TensorFlow RetinaNet
+This is a deep learning model that can recognize and analyze Rubik's cubes in images. The model is built using TensorFlow and RetinaNet architecture for object detection.
+## 🚀 Features
+- **Real-time Detection**: Upload images of Rubik's cubes for instant analysis
+- **Multi-class Recognition**: Detect cube faces and 6 different color tiles
+- **Interactive Interface**: Simple and intuitive Gradio web interface
+- **Advanced AI**: Powered by RetinaNet with SpineNet-49 backbone
+## 🎯 How to Use
+1. **Upload**: Click "Upload Rubik's Cube Image" and select your image
+2. **Analyze**: Click "🔍 Analyze Image" or wait for automatic processing
+3. **Results**: View detection results and visualization with bounding boxes
+## 🔬 Technical Details
+- **Framework**: TensorFlow 2.15+ with Gradio interface
+- **Architecture**: RetinaNet with SpineNet-49 backbone
+- **Input Size**: 640×640 pixels
+- **Classes**: 7 total (1 face + 6 color tiles)
+- **Colors Detected**: Red, White, Blue, Orange, Green, Yellow
+## 🌟 Model Architecture
+### RetinaNet-SpineNet-49
+- **Base Model**: RetinaNet for object detection
+- **Backbone**: SpineNet-49 for feature extraction
+- **Input Resolution**: 640×640×3
+- **Output**: Bounding boxes with class predictions and confidence scores
+### Detection Classes
+1. `face` - Rubik's cube face
+2. `red_tile` - Red color tile
+3. `white_tile` - White color tile
+4. `blue_tile` - Blue color tile
+5. `orange_tile` - Orange color tile
+6. `green_tile` - Green color tile
+7. `yellow_tile` - Yellow color tile
 ## 📊 Performance Metrics
+| Metric | Target | Status |
+|--------|--------|--------|
+| mAP@0.5 | >0.85 | In Training |
+| Inference Speed | <100ms | Optimized |
+| Accuracy | >90% | Evaluating |
+## 🛠️ Local Development
 ```bash
 # Clone the repository
+git clone https://huggingface.co/spaces/itsyuimorii/rubiks-cube-recognition
+cd rubiks-cube-recognition
 # Install dependencies
 pip install -r requirements.txt
+# Run the application
 python app.py
 ```
+## 📁 Project Structure
 ```
 rubiks-cube-recognition/
+├── app.py                    # Main Gradio application
+├── requirements.txt          # Python dependencies
+├── README.md                # This file
+├── configs/                 # Model configurations
+│   └── retinanet_config.py
+├── src/                     # Source code
+│   ├── data/               # Data processing utilities
+│   └── model/              # Model training and inference
+└── images/                 # Training and test datasets
+    ├── train/              # Training images and annotations
+    ├── test/               # Test images and annotations
+    └── valid/              # Validation images and annotations
 ```
+## 🎮 Demo Status
+⚠️ **Note**: This is a demo version. The complete trained model is currently being developed. The interface will show a preview of the detection capabilities.
+## 📝 Dataset Information
+- **Format**: COCO annotation format
+- **Image Size**: 640×640 pixels
+- **Training Images**: 50+ annotated cube images
+- **Classes**: 7 object classes (face + 6 colors)
+- **Annotation Tool**: LabelMe
+## 🔧 Training Pipeline
+```python
+# Training command
+python src/model/trainer.py --config configs/retinanet_config.py
+# Inference command
+python src/model/visualize.py --image path/to/cube_image.jpg
+```
 ## 🤝 Contributing
+Contributions are welcome! Areas for improvement:
+- Additional training data
+- Model optimization
+- UI/UX enhancements
+- Performance improvements
 ## 📄 License
 ## 🙏 Acknowledgements
+- **TensorFlow Model Garden** - RetinaNet implementation
+- **SpineNet** - Backbone architecture
+- **Gradio** - Web interface framework
+- **Hugging Face** - Model hosting and deployment
 ## 📧 Contact
+- **GitHub**: [@itsyuimorii](https://github.com/itsyuimorii)
+- **Hugging Face**: [@itsyuimorii](https://huggingface.co/itsyuimorii)
+## 🔗 References
+- [RetinaNet Paper](https://arxiv.org/abs/1708.02002)
+- [SpineNet Architecture](https://arxiv.org/abs/1912.05027)
+- [TensorFlow Object Detection API](https://github.com/tensorflow/models/tree/master/research/object_detection)
+- [LabelMe Annotation Tool](https://github.com/wkentaro/labelme)
+---
+*🎲 Ready to solve your Rubik's cube detection challenges!*

app.py CHANGED Viewed

@@ -1,8 +1,6 @@
 import gradio as gr
-import tensorflow as tf
 import numpy as np
 from PIL import Image
-import cv2
 import os
 # Simplified category index
@@ -16,174 +14,84 @@ CATEGORY_INDEX = {
     7: {'id': 7, 'name': 'yellow_tile'}
 }
-def preprocess_image(image):
-    """
-    Preprocess input image
-    """
-    if image is None:
-        return None
-    # Convert to PIL image
-    if isinstance(image, np.ndarray):
-        image = Image.fromarray(image)
-    # Resize to model expected size
-    image = image.resize((640, 640))
-    # Convert to numpy array and normalize
-    image_array = np.array(image) / 255.0
-    # Add batch dimension
-    image_array = np.expand_dims(image_array, axis=0)
-    return image_array.astype(np.float32)
-def load_model():
-    """
-    Load pretrained model
-    """
-    try:
-        # Try to load saved model
-        if os.path.exists('exported_model'):
-            model = tf.saved_model.load('exported_model')
-            return model
-        else:
-            # If no model file exists, return None
-            return None
-    except Exception as e:
-        print(f"Model loading failed: {e}")
-        return None
 def predict_image(image):
     """
-    Make predictions on input image
     """
     if image is None:
         return "Please upload an image", None
     try:
-        # Preprocess image
-        processed_image = preprocess_image(image)
-        if processed_image is None:
-            return "Image preprocessing failed", None
-        # Load model
-        model = load_model()
-        if model is None:
-            return "Model not found. This is a demo version, actual model needs to be trained first.\n\nDetected a Rubik's cube image!", image
-        # Make prediction
-        model_fn = model.signatures['serving_default']
-        # Convert input format
-        input_tensor = tf.convert_to_tensor(processed_image)
-        # Execute inference
-        predictions = model_fn(input_tensor)
-        # Parse results
-        detection_boxes = predictions['detection_boxes'][0].numpy()
-        detection_classes = predictions['detection_classes'][0].numpy().astype(
-            int)
-        detection_scores = predictions['detection_scores'][0].numpy()
-        # Filter low confidence detections
-        valid_detections = detection_scores > 0.5
-        valid_boxes = detection_boxes[valid_detections]
-        valid_classes = detection_classes[valid_detections]
-        valid_scores = detection_scores[valid_detections]
-        # Generate result description
-        if len(valid_boxes) > 0:
-            result_text = f"Detected {len(valid_boxes)} objects:\n"
-            for i, (cls, score) in enumerate(zip(valid_classes, valid_scores)):
-                class_name = CATEGORY_INDEX.get(
-                    cls, {}).get('name', f'class_{cls}')
-                result_text += f"- {class_name}: {score:.2f}\n"
-        else:
-            result_text = "No Rubik's cube related objects detected"
-        # Draw detection boxes on image (simplified version)
-        output_image = draw_boxes_on_image(
-            image, valid_boxes, valid_classes, valid_scores)
-        return result_text, output_image
     except Exception as e:
-        error_msg = f"Error occurred during prediction: {str(e)}\n\nThis is a demo version."
         return error_msg, image
-def draw_boxes_on_image(image, boxes, classes, scores):
-    """
-    Draw detection boxes on image (simplified version)
-    """
-    try:
-        # Convert to OpenCV format
-        if isinstance(image, Image.Image):
-            cv_image = cv2.cvtColor(np.array(image), cv2.COLOR_RGB2BGR)
-        else:
-            cv_image = cv2.cvtColor(image, cv2.COLOR_RGB2BGR)
-        h, w = cv_image.shape[:2]
-        # Draw detection boxes
-        for box, cls, score in zip(boxes, classes, scores):
-            if score > 0.5:
-                # Convert coordinates (assuming normalized coordinates)
-                y1, x1, y2, x2 = box
-                x1, x2 = int(x1 * w), int(x2 * w)
-                y1, y2 = int(y1 * h), int(y2 * h)
-                # Draw rectangle
-                cv2.rectangle(cv_image, (x1, y1), (x2, y2), (0, 255, 0), 2)
-                # Add label
-                class_name = CATEGORY_INDEX.get(
-                    cls, {}).get('name', f'class_{cls}')
-                label = f"{class_name}: {score:.2f}"
-                cv2.putText(cv_image, label, (x1, y1-10),
-                            cv2.FONT_HERSHEY_SIMPLEX, 0.5, (0, 255, 0), 1)
-        # Convert back to RGB
-        result_image = cv2.cvtColor(cv_image, cv2.COLOR_BGR2RGB)
-        return Image.fromarray(result_image)
-    except Exception as e:
-        print(f"Error drawing detection boxes: {e}")
-        return image
-# Create Gradio interface
 def create_demo():
-    with gr.Blocks(title="Rubik's Cube Recognition System") as demo:
-        gr.Markdown("""
-        # 🎲 Rubik's Cube Recognition System
-        This is a deep learning-based Rubik's cube recognition system using RetinaNet architecture for object detection.
-        **Features:**
-        - Detect cube faces and color tiles
-        - Support 6 color recognition: Red, White, Blue, Orange, Green, Yellow
-        - Real-time detection and visualization
-        **How to use:**
-        1. Upload an image containing a Rubik's cube
-        2. Click the "Analyze Image" button
-        3. View detection results and visualization
         """)
         with gr.Row():
-            with gr.Column():
                 input_image = gr.Image(
                     label="Upload Rubik's Cube Image",
-                    type="pil"
                 )
                 analyze_btn = gr.Button(
@@ -191,42 +99,53 @@ def create_demo():
                     variant="primary",
                     size="lg"
                 )
-            with gr.Column():
                 result_text = gr.Textbox(
-                    label="Detection Results",
-                    lines=10,
-                    max_lines=15
                 )
                 output_image = gr.Image(
-                    label="Detection Visualization",
-                    type="pil"
                 )
-        # Example images section
-        gr.Markdown("### 📋 Usage Examples")
-        gr.Markdown(
-            "Upload Rubik's cube images similar to the following for testing:")
-        # Bind events
         analyze_btn.click(
             fn=predict_image,
             inputs=[input_image],
             outputs=[result_text, output_image]
         )
-        # Auto-analyze when image is uploaded
         input_image.change(
             fn=predict_image,
             inputs=[input_image],
             outputs=[result_text, output_image]
         )
-    return demo
-# Launch application
 if __name__ == "__main__":
     demo = create_demo()
     demo.launch(

 import gradio as gr
 import numpy as np
 from PIL import Image
 import os
 # Simplified category index
     7: {'id': 7, 'name': 'yellow_tile'}
 }
 def predict_image(image):
     """
+    Make predictions on input image - Demo version
     """
     if image is None:
         return "Please upload an image", None
     try:
+        # Convert to PIL image if needed
+        if isinstance(image, np.ndarray):
+            image = Image.fromarray(image)
+        # Get image information
+        width, height = image.size
+        # Demo response since model is not trained yet
+        result_text = f"""🎲 Rubik's Cube Analysis Results
+📊 Image Information:
+- Dimensions: {width} × {height} pixels
+- Format: {getattr(image, 'format', 'PIL Image')}
+🔍 Detection Status:
+✅ Image uploaded successfully
+✅ Image format is valid
+⚠️  AI model is currently in development
+📝 Demo Mode:
+This is a preview of the Rubik's cube recognition system.
+The complete RetinaNet model will detect:
+🎯 Target Detection Classes:
+- Cube faces
+- Red tiles
+- White tiles
+- Blue tiles
+- Orange tiles
+- Green tiles
+- Yellow tiles
+🚀 Coming Soon:
+- Real-time object detection
+- Bounding box visualization
+- Confidence scores
+- 3D cube state analysis
+"""
+        return result_text, image
     except Exception as e:
+        error_msg = f"Error processing image: {str(e)}\n\nThis is a demo version."
         return error_msg, image
 def create_demo():
+    """Create the Gradio interface"""
+    with gr.Blocks(
+        title="🎲 Rubik's Cube Recognition System",
+        theme=gr.themes.Soft()
+    ) as demo:
+        gr.HTML("""
+        <div style="text-align: center; padding: 20px;">
+            <h1>🎲 Rubik's Cube Recognition System</h1>
+            <p style="font-size: 18px; color: #666;">
+                Deep Learning-based Rubik's Cube Detection using RetinaNet Architecture
+            </p>
+        </div>
         """)
         with gr.Row():
+            with gr.Column(scale=1):
+                gr.Markdown("### 📤 Upload Image")
                 input_image = gr.Image(
                     label="Upload Rubik's Cube Image",
+                    type="pil",
+                    height=350
                 )
                 analyze_btn = gr.Button(
                     variant="primary",
                     size="lg"
                 )
+                gr.Markdown("""
+                ### 💡 Tips
+                - Upload clear images of Rubik's cubes
+                - Good lighting recommended
+                - JPG/PNG formats supported
+                """)
+            with gr.Column(scale=1):
+                gr.Markdown("### 📊 Analysis Results")
                 result_text = gr.Textbox(
+                    label="Detection Report",
+                    lines=12,
+                    max_lines=15,
+                    show_copy_button=True
                 )
                 output_image = gr.Image(
+                    label="Processed Image",
+                    type="pil",
+                    height=350
                 )
+        # Event handlers
         analyze_btn.click(
             fn=predict_image,
             inputs=[input_image],
             outputs=[result_text, output_image]
         )
         input_image.change(
             fn=predict_image,
             inputs=[input_image],
             outputs=[result_text, output_image]
         )
+        gr.HTML("""
+        <div style="text-align: center; padding: 20px; margin-top: 20px; border-top: 1px solid #eee;">
+            <p><strong>🔬 Technology Stack:</strong> TensorFlow • RetinaNet • SpineNet-49 • Gradio</p>
+            <p><strong>📧 Contact:</strong> <a href="https://huggingface.co/itsyuimorii">@itsyuimorii</a></p>
+        </div>
+        """)
+    return demo
+# Launch the application
 if __name__ == "__main__":
     demo = create_demo()
     demo.launch(

deploy_to_hf.py CHANGED Viewed

@@ -11,7 +11,6 @@ def deploy_to_huggingface():
     """Deploy the project to Hugging Face Spaces"""
     # Set your token
-    token = "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"
     # Initialize HF API
     api = HfApi()

     """Deploy the project to Hugging Face Spaces"""
     # Set your token
     # Initialize HF API
     api = HfApi()

requirements.txt CHANGED Viewed

@@ -1,6 +1,3 @@
 gradio>=4.0.0
-tensorflow-cpu==2.15.0
-numpy>=1.19.2
-Pillow>=8.0.0
-opencv-python-headless>=4.5.1
-matplotlib>=3.2.2

 gradio>=4.0.0
+numpy>=1.21.0
+Pillow>=8.0.0