flax-community
/

transformer-vae

Model card Files Files and versions

Fraser commited on Jul 1, 2021

Commit

db1be99

·

1 Parent(s): 03c20bd

add todos

Files changed (3) hide show

README.md +1 -0
model/t5_vae.py +1 -0
train.py +1 -0

README.md CHANGED Viewed

@@ -10,6 +10,7 @@ Builds on T5, using an autoencoder to convert it into a VAE.
 ## ToDo
 - [ ] Convert `transformers/examples/flax/language-modeling/run_clm_flax.py` into a new training script for transformer-VAE's.
   - Use an "empty VAE"  a.k.a just sends the encoding to the decoder with no regularisation loss, use the T5 encoder & decoder.
 - [ ] Make a `autoencoders.py` version of `autoencoders.py`.

 ## ToDo
+- [ ] Save a wikipedia sentences dataset to Huggingface (see original https://github.com/ChunyuanLI/Optimus/blob/master/data/download_datasets.md)
 - [ ] Convert `transformers/examples/flax/language-modeling/run_clm_flax.py` into a new training script for transformer-VAE's.
   - Use an "empty VAE"  a.k.a just sends the encoding to the decoder with no regularisation loss, use the T5 encoder & decoder.
 - [ ] Make a `autoencoders.py` version of `autoencoders.py`.

model/t5_vae.py CHANGED Viewed

@@ -153,4 +153,5 @@ class FlaxT5VAEForAutoencoding(FlaxPreTrainedModel):
         params: dict = None,
         dropout_rng: PRNGKey = None,
     ):
         raise NotImplementedError()

         params: dict = None,
         dropout_rng: PRNGKey = None,
     ):
+        # TODO run `FlaxT5ForConditionalGeneration.decode` with above args
         raise NotImplementedError()

train.py CHANGED Viewed

@@ -2,6 +2,7 @@
     Pre-training/Fine-tuning seq2seq models on autoencoding a dataset.
     TODO:
     - [ ] Add reg loss
         - [ ] config
         - [ ] calculate MMD loss

     Pre-training/Fine-tuning seq2seq models on autoencoding a dataset.
     TODO:
+    - [ ] Don't make decoder input ids.
     - [ ] Add reg loss
         - [ ] config
         - [ ] calculate MMD loss