Spaces:

0xZohar
/

object-assembler

Paused

App Files Files Community

0xZohar commited on Nov 13, 2025

Commit

6b6c80c

verified ·

1 Parent(s): 01fda47

Update README.md for ZeroGPU (sdk_version 5.12.0)

Browse files

Files changed (1) hide show

README.md +78 -7

README.md CHANGED Viewed

@@ -1,12 +1,83 @@
 ---
-title: Object Assembler
-emoji: 🌍
-colorFrom: gray
-colorTo: purple
 sdk: gradio
-sdk_version: 5.49.1
-app_file: app.py
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: ObjectAssembler - LEGO 3D Assembly Prediction
+emoji: 🧱
+colorFrom: blue
+colorTo: red
 sdk: gradio
+sdk_version: 5.12.0
+app_file: code/demo.py
 pinned: false
+license: mit
+python_version: 3.10
+preload_from_hub:
+  - 0xZohar/object-assembler-models save_shape_cars_whole_p_rot_scratch_4mask_randp.safetensors
+  - openai/clip-vit-base-patch32
 ---
+# ObjectAssembler: Assemble Your Object with Diverse Components
+A deep learning-based system for predicting and visualizing LEGO brick assembly sequences using transformer models.
+## Features
+- **LDR File Processing**: Upload `.ldr` (LDraw format) files containing LEGO brick assembly instructions
+- **Text-to-LEGO**: Describe your desired design in natural language (e.g., "red sports car")
+- **AI-Powered Prediction**: Uses GPT-based transformer models to predict optimal assembly sequences
+- **3D Rendering**: Generates high-quality 3D visualizations using Blender
+- **Part Visualization**: Displays individual brick components with rendered previews
+- **Interactive UI**: User-friendly Gradio interface for easy interaction
+## Model Architecture
+- **GPT Model**: 5-layer transformer with 12 attention heads
+- **Shape Tokenizer**: VQ-VAE autoencoder with 16,384 token vocabulary
+- **CLIP Integration**: Vision-language model for text-to-design retrieval
+- **Model Size**: ~1.6GB total weights
+## Usage
+### Option 1: Upload LDR File
+1. Upload an `.ldr` file containing LEGO part names
+2. Click "Load Input" to process and visualize individual parts
+3. Click "Generate New LDR & Render" to predict assembly sequence
+4. Download the generated LDR file or view the 3D rendering
+### Option 2: Text-to-LEGO
+1. Enter a text description (e.g., "red sports car", "blue police car")
+2. Click "Load Input" to retrieve a similar design
+3. Click "Generate New LDR & Render" to generate assembly sequence
+4. View the AI-generated LEGO model
+## Hardware Requirements
+- **GPU**: Runs on HuggingFace ZeroGPU (H200, 70GB VRAM)
+- **Memory**: At least 8GB RAM
+- **Storage**: ~2GB for model weights and dependencies
+## Technical Stack
+- PyTorch 2.0+
+- Gradio 5.12.0+
+- Blender 3.6 (for rendering)
+- Transformers (Hugging Face)
+- CLIP (OpenAI)
+- **ZeroGPU** for efficient GPU allocation
+## Citation
+If you use this work in your research, please cite:
+```bibtex
+@article{objectassembler2024,
+  title={ObjectAssembler: Assemble Your Object with Diverse Components},
+  year={2024}
+}
+```
+## License
+MIT License
+## Acknowledgments
+Built with PyTorch, Gradio, Blender, and HuggingFace ZeroGPU.