georeactor
/

t5-reddit-2014

text2text-generation

text-generation-inference

Model card Files Files and versions

Nick Doiron commited on Jan 17, 2023

Commit

653ddae

·

1 Parent(s): 7bc3773

readme fix and code sample

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -1,3 +1,4 @@
 language:
   - en
 license: apache-2.0
@@ -5,6 +6,7 @@ tags:
   - reddit
 datasets:
   - georeactor/reddit_one_ups_seq2seq_2014
 # t5-reddit-2014
@@ -21,3 +23,15 @@ Training notebook: https://github.com/Georeactor/reddit-one-ups/blob/main/traini
 - Fine-tuned on first 80% of [georeactor/reddit_one_ups_seq2seq_2014](https://huggingface.co/datasets/georeactor/reddit_one_ups_seq2seq_2014) for one epoch, batch size = 2.
 - Loss did not move much during this epoch.
 - Future experiments should use a larger model, larger batch size (could easily have done batch_size = 4 on CoLab), full dataset if we are not worried about eval.

+---
 language:
   - en
 license: apache-2.0
   - reddit
 datasets:
   - georeactor/reddit_one_ups_seq2seq_2014
+---
 # t5-reddit-2014
 - Fine-tuned on first 80% of [georeactor/reddit_one_ups_seq2seq_2014](https://huggingface.co/datasets/georeactor/reddit_one_ups_seq2seq_2014) for one epoch, batch size = 2.
 - Loss did not move much during this epoch.
 - Future experiments should use a larger model, larger batch size (could easily have done batch_size = 4 on CoLab), full dataset if we are not worried about eval.
+## Inference
+```
+from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
+model = AutoModelForSeq2SeqLM.from_pretrained('georeactor/t5-reddit-2014')
+tokenizer = AutoTokenizer.from_pretrained('georeactor/t5-reddit-2014')
+input = tokenizer.encode('Looks like a potato bug', return_tensors="pt")
+output = model.generate(input, max_length=256)
+tokenizer.decode(output[0])
+```