andresnowak
/

MNLP_M2_mcqa_model

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

andresnowak commited on May 27, 2025

Commit

3392877

·

verified ·

1 Parent(s): 7dea30b

Update README.md

Files changed (1) hide show

README.md +15 -2

README.md CHANGED Viewed

@@ -27,10 +27,21 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -46,7 +57,9 @@ The following hyperparameters were used during training:
 - num_epochs: 2
 ### Training results
 ### Framework versions

 ## Training and evaluation data
+Training was done on the training splits of
+- MEDMCQA
+- MMLU
+- Sciq
+- Ai2 Arc
+- Math_qa
+- ScienceQa
+- Openbookqa
 ## Training procedure
+The procedure for training was to only leave the question that have only 4 choices to chose from, and from there we do the training
+by only grabbing the last logit form doing a feedforward on the whole prompt (question with choices) and we do cross entropy loss on this last logit with the 4 options to choose 4 from
+(so we don't do cross entyropy on the whole vocabulary we only do it on the tokens of the letters of the 4 options (A, B, C and D))
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - num_epochs: 2
 ### Training results
+| Model               | MMLU | MMLU-pro | arc-easy | arc-challenge | nlp4education | GPQA | Musr |
+|---------------------|--------------|-------------|---------------|---------------| ---- | ---- |
+| Qwen3-0.6B-base-MCQA | 52% | 17%   | 86%         | 72%           |  51%          |  29% | 38%  |
 ### Framework versions