File size: 936 Bytes
e51dbc6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f9442a2
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---
license: apache-2.0
language:
- en
- zh
---

# GarmentGPT Models

This repository contains all the necessary model components for the GarmentGPT project.

## Models Included

This repository hosts three key components:

1.  **Vision-Language Model (LLM)**: A fine-tuned multi-modal model responsible for generating discrete garment tokens from an input image.
2.  **Edge Codec**: A VQ-VAE-based model for decoding edge indices into high-fidelity geometric curves. The configuration is in `codec_config.yaml` and weights are in `codec_model.pth`.
3.  **RT Codec**: A VQ-VAE-based model for decoding location indices into 3D panel rotation and translation. The configuration is in `rt_config.yaml` and weights are in `rt_model.pth`.

## Usage

These models are designed to be used with the main application code available at https://github.com/ChimerAI-MMLab/Garment-GPT. The inference script will automatically download these files.