ffurfaro
/

aPixelBytes-Pokemon

image-generation

text-generation

autio-generation

Model card Files Files and versions

ffurfaro commited on Sep 12, 2024

Commit

d94d560

·

verified ·

1 Parent(s): 4aaaf7f

Create README.md

Files changed (1) hide show

README.md +38 -0

README.md ADDED Viewed

	@@ -0,0 +1,38 @@

+---
+datasets:
+- ffurfaro/PixelBytes-Pokemon
+language: en
+library_name: pytorch
+license: mit
+pipeline_tag: text-to-image
+tags:
+- image-generation
+- text-generation
+- autio-generation
+- multimodal
+---
+# PixelBytes: Unified Multimodal Generation
+Welcome to the **PixelBytes** repository! This project features models designed to generate text and images simultaneously, pixel by pixel, using a unified embedding. (only testing weight)
+## Overview
+### Key Concepts
+- **Image Transformer**: Generates images pixel by pixel.
+- **Bi-Mamba+**: A bidirectional model for time series prediction.
+- **MambaByte**: A selective state-space model without tokens.
+The PixelByte model generates mixed sequences of text and images, handling transitions with line breaks and maintaining image dimension consistency.
+## Dataset
+We use the **PixelBytes-Pokemon** dataset, available on Hugging Face: [PixelBytes-Pokemon](https://huggingface.co/datasets/ffurfaro/PixelBytes-Pokemon). It contains text and image sequences of Pokémon for training our model.
+## Models Trained
+- **3 LSTM Models**: 2 Auto-regressive and 1 only predictive.
+---
+Thank you for exploring **PixelBytes**! We hope this model aids your multimodal generation projects.