GeorgiaTech
/

bert-generative-pubmedqa

encoder-decoder

text2text-generation

Model card Files Files and versions

hbeadles commited on Apr 28, 2024

Commit

6c0c17a

·

verified ·

1 Parent(s): e4b51ff

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ tags:
 *Author - Hayden Beadles*
-This model is meant to evaluate the results of creating an Encoder / Decoder generative model using BERT. The model is then finetuned on $30000$ samples of the PubMedQA dataset. Instead of being finetuned on the columns question and final_answer, where final_answer is a set of yes / no answers, we instead fine tune on the more challenging long_answer column, which gives a short answer to the question.
 The model was fine-tuned over 3 epochs, using the Adam learning rate scheduler, with a max length of 128 tokens.

 *Author - Hayden Beadles*
+This model is meant to evaluate the results of creating an Encoder / Decoder generative model using BERT. The model is then finetuned on 30000 samples of the PubMedQA dataset. Instead of being finetuned on the columns question and final_answer, where final_answer is a set of yes / no answers, we instead fine tune on the more challenging long_answer column, which gives a short answer to the question.
 The model was fine-tuned over 3 epochs, using the Adam learning rate scheduler, with a max length of 128 tokens.