balgot
/

text-2-stylegan3

face-generation

Model card Files Files and versions

text-2-stylegan3 / README.md

balgot's picture

Update README.md

04e7ebf over 2 years ago

|

history blame contribute delete

1.86 kB

	---
	license: openrail
	datasets:
	- balgot/stylegan3-annotated
	language:
	- en
	metrics:
	- mse
	tags:
	- face-generation
	- stylegan3
	library_name: pytorch
	---

	# Text-to-StyleGAN3 Latent Space Translation

	This model was created as a part of the project for FI:PA228 (Masaryk University),
	inspired by this paper: [Face Generation from Textual Features using Conditionally trained Inputs to Generative Adversarial Networks](https://arxiv.org/abs/2301.09123)

	It was trained on the generated dataset from BLIP and StyleGAN3. See the [corresponding notebook](https://colab.research.google.com/drive/14rDcCc0Xr1L1Ax3aKezEhmcn81vXGVQ7?usp=sharing)
	for further details.


	## How to use:


	```python
	import torch.nn as nn


	# for now, the model class needs to be defined, so...
	class LaTran(nn.Module):
	def __init__(self, args, *kwargs):
	super().__init__(args, *kwargs)
	self.pipe = nn.Sequential(
	nn.Linear(384, 512),
	nn.ReLU(),
	nn.Linear(512, 512)
	)

	def forward(self, v):
	return self.pipe(v.unsqueeze(1))



	# Instantiate and load the model
	dev = ... # device to use
	PATH = "translation_model-sd.pt" # local path
	model = LaTran().to(dev)
	model.load_state_dict(torch.load(TRANSLATION_MODEL, map_location=dev))
	```

	## Demo

	For the demo of the whole pipeline, or how this model helps to generate a final image,
	visits [text-to-stylegan HF space](https://huggingface.co/spaces/balgot/text-to-stylegan3).

	## Examples

	* Prompt: `attractive young woman, blond hair`
	![image of attractive young women](attractive_young_woman_blonde.png)




	* Prompt initial: `cute young boy, blond hair, blue eyes, smiling`
	* Prompt second: `old man, short gray hair, glasses, wearing hat`
	<img src="https://huggingface.co/balgot/bert-2-stylegan3/resolve/main/young2old.gif" width="200" height="200" />