andresnowak
/

MNLP_M3_mcqa_model

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

andresnowak commited on Jun 10, 2025

Commit

85a6989

·

verified ·

1 Parent(s): 3d3340d

Update README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -46,9 +46,12 @@ Training was done on the training splits of
 ## Training procedure
-The procedure for training was to only leave the question that have only 4 choices to chose from, and from there we do the training
-by only grabbing the last logit form doing a feedforward on the whole prompt (question with choices) and we do cross entropy loss on this last logit with the 4 options to choose 4 from
-(so we don't do cross entyropy on the whole vocabulary we only do it on the tokens of the letters of the 4 options (A, B, C and D))
 ### Training hyperparameters

 ## Training procedure
+The procedure for training was done with example of any amount of choices, for each batch size we padd the options to the biggest amount of options and example has in that batch,
+and from there we do the training  by only grabbing the last logit form doing a feedforward on the whole prompt (question with choices) and we do cross entropy loss on this last logit with the 4 options to choose 4 from
+(so we don't do cross entyropy on the whole vocabulary we only do it on the tokens of the letters of the 4 options (A, B, C and D)).
+We also template all the training examples with 7 random templates as to make the model robuts to different types of ways one could ask an MCQA question, using different prompts can
+make the results vary a lot
 ### Training hyperparameters