| license: apache-2.0 | |
| language: | |
| - en | |
| - zh | |
| # GarmentGPT Models | |
| This repository contains all the necessary model components for the GarmentGPT project. | |
| ## Models Included | |
| This repository hosts three key components: | |
| 1. **Vision-Language Model (LLM)**: A fine-tuned multi-modal model responsible for generating discrete garment tokens from an input image. | |
| 2. **Edge Codec**: A VQ-VAE-based model for decoding edge indices into high-fidelity geometric curves. The configuration is in `codec_config.yaml` and weights are in `codec_model.pth`. | |
| 3. **RT Codec**: A VQ-VAE-based model for decoding location indices into 3D panel rotation and translation. The configuration is in `rt_config.yaml` and weights are in `rt_model.pth`. | |
| ## Usage | |
| These models are designed to be used with the main application code available at [Your GitHub Repo URL]. The inference script will automatically download these files. | |