andresnowak commited on
Commit
3392877
·
verified ·
1 Parent(s): 7dea30b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -2
README.md CHANGED
@@ -27,10 +27,21 @@ More information needed
27
 
28
  ## Training and evaluation data
29
 
30
- More information needed
 
 
 
 
 
 
 
31
 
32
  ## Training procedure
33
 
 
 
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
@@ -46,7 +57,9 @@ The following hyperparameters were used during training:
46
  - num_epochs: 2
47
 
48
  ### Training results
49
-
 
 
50
 
51
 
52
  ### Framework versions
 
27
 
28
  ## Training and evaluation data
29
 
30
+ Training was done on the training splits of
31
+ - MEDMCQA
32
+ - MMLU
33
+ - Sciq
34
+ - Ai2 Arc
35
+ - Math_qa
36
+ - ScienceQa
37
+ - Openbookqa
38
 
39
  ## Training procedure
40
 
41
+ The procedure for training was to only leave the question that have only 4 choices to chose from, and from there we do the training
42
+ by only grabbing the last logit form doing a feedforward on the whole prompt (question with choices) and we do cross entropy loss on this last logit with the 4 options to choose 4 from
43
+ (so we don't do cross entyropy on the whole vocabulary we only do it on the tokens of the letters of the 4 options (A, B, C and D))
44
+
45
  ### Training hyperparameters
46
 
47
  The following hyperparameters were used during training:
 
57
  - num_epochs: 2
58
 
59
  ### Training results
60
+ | Model | MMLU | MMLU-pro | arc-easy | arc-challenge | nlp4education | GPQA | Musr |
61
+ |---------------------|--------------|-------------|---------------|---------------| ---- | ---- |
62
+ | Qwen3-0.6B-base-MCQA | 52% | 17% | 86% | 72% | 51% | 29% | 38% |
63
 
64
 
65
  ### Framework versions