ChimerAI
/

GarmentGPT

Model card Files Files and versions

GarmentGPT / README.md

Mathewgar's picture

Update README.md

f9442a2 verified 4 months ago

|

history blame contribute delete

936 Bytes

	---
	license: apache-2.0
	language:
	- en
	- zh
	---

	# GarmentGPT Models

	This repository contains all the necessary model components for the GarmentGPT project.

	## Models Included

	This repository hosts three key components:

	1. Vision-Language Model (LLM): A fine-tuned multi-modal model responsible for generating discrete garment tokens from an input image.
	2. Edge Codec: A VQ-VAE-based model for decoding edge indices into high-fidelity geometric curves. The configuration is in `codec_config.yaml` and weights are in `codec_model.pth`.
	3. RT Codec: A VQ-VAE-based model for decoding location indices into 3D panel rotation and translation. The configuration is in `rt_config.yaml` and weights are in `rt_model.pth`.

	## Usage

	These models are designed to be used with the main application code available at https://github.com/ChimerAI-MMLab/Garment-GPT. The inference script will automatically download these files.