Tasks Version Filter n-shot Metric Value Stderr
mmlu 2 none acc 0.2295 ± 0.0035
- humanities 2 none 0 acc 0.2421 ± 0.0062
- formal_logic 1 none 0 acc 0.2857 ± 0.0404
- high_school_european_history 1 none 0 acc 0.2182 ± 0.0323
- high_school_us_history 1 none 0 acc 0.2500 ± 0.0304
- high_school_world_history 1 none 0 acc 0.2700 ± 0.0289
- international_law 1 none 0 acc 0.2397 ± 0.0390
- jurisprudence 1 none 0 acc 0.2593 ± 0.0424
- logical_fallacies 1 none 0 acc 0.2209 ± 0.0326
- moral_disputes 1 none 0 acc 0.2486 ± 0.0233
- moral_scenarios 1 none 0 acc 0.2380 ± 0.0142
- philosophy 1 none 0 acc 0.1865 ± 0.0221
- prehistory 1 none 0 acc 0.2160 ± 0.0229
- professional_law 1 none 0 acc 0.2458 ± 0.0110
- world_religions 1 none 0 acc 0.3216 ± 0.0358
- other 2 none 0 acc 0.2398 ± 0.0076
- business_ethics 1 none 0 acc 0.3000 ± 0.0461
- clinical_knowledge 1 none 0 acc 0.2151 ± 0.0253
- college_medicine 1 none 0 acc 0.2081 ± 0.0310
- global_facts 1 none 0 acc 0.1800 ± 0.0386
- human_aging 1 none 0 acc 0.3139 ± 0.0311
- management 1 none 0 acc 0.1748 ± 0.0376
- marketing 1 none 0 acc 0.2906 ± 0.0297
- medical_genetics 1 none 0 acc 0.3000 ± 0.0461
- miscellaneous 1 none 0 acc 0.2375 ± 0.0152
- nutrition 1 none 0 acc 0.2255 ± 0.0239
- professional_accounting 1 none 0 acc 0.2340 ± 0.0253
- professional_medicine 1 none 0 acc 0.1838 ± 0.0235
- virology 1 none 0 acc 0.2831 ± 0.0351
- social sciences 2 none 0 acc 0.2171 ± 0.0074
- econometrics 1 none 0 acc 0.2368 ± 0.0400
- high_school_geography 1 none 0 acc 0.1768 ± 0.0272
- high_school_government_and_politics 1 none 0 acc 0.1969 ± 0.0287
- high_school_macroeconomics 1 none 0 acc 0.2026 ± 0.0204
- high_school_microeconomics 1 none 0 acc 0.2101 ± 0.0265
- high_school_psychology 1 none 0 acc 0.1927 ± 0.0169
- human_sexuality 1 none 0 acc 0.2595 ± 0.0384
- professional_psychology 1 none 0 acc 0.2500 ± 0.0175
- public_relations 1 none 0 acc 0.2182 ± 0.0396
- security_studies 1 none 0 acc 0.1878 ± 0.0250
- sociology 1 none 0 acc 0.2438 ± 0.0304
- us_foreign_policy 1 none 0 acc 0.2800 ± 0.0451
- stem 2 none 0 acc 0.2125 ± 0.0073
- abstract_algebra 1 none 0 acc 0.2200 ± 0.0416
- anatomy 1 none 0 acc 0.1852 ± 0.0336
- astronomy 1 none 0 acc 0.1776 ± 0.0311
- college_biology 1 none 0 acc 0.2569 ± 0.0365
- college_chemistry 1 none 0 acc 0.2000 ± 0.0402
- college_computer_science 1 none 0 acc 0.2600 ± 0.0441
- college_mathematics 1 none 0 acc 0.2100 ± 0.0409
- college_physics 1 none 0 acc 0.2157 ± 0.0409
- computer_security 1 none 0 acc 0.2800 ± 0.0451
- conceptual_physics 1 none 0 acc 0.2638 ± 0.0288
- electrical_engineering 1 none 0 acc 0.2414 ± 0.0357
- elementary_mathematics 1 none 0 acc 0.2090 ± 0.0209
- high_school_biology 1 none 0 acc 0.1774 ± 0.0217
- high_school_chemistry 1 none 0 acc 0.1527 ± 0.0253
- high_school_computer_science 1 none 0 acc 0.2500 ± 0.0435
- high_school_mathematics 1 none 0 acc 0.2111 ± 0.0249
- high_school_physics 1 none 0 acc 0.1987 ± 0.0326
- high_school_statistics 1 none 0 acc 0.1528 ± 0.0245
- machine_learning 1 none 0 acc 0.3125 ± 0.0440
arc_challenge 1 none 0 acc 0.1792 ± 0.0112
none 0 acc_norm 0.2312 ± 0.0123
arc_easy 1 none 0 acc 0.2946 ± 0.0094
none 0 acc_norm 0.2782 ± 0.0092
Groups Version Filter n-shot Metric Value Stderr
mmlu 2 none acc 0.2295 ± 0.0035
- humanities 2 none 0 acc 0.2421 ± 0.0062
- other 2 none 0 acc 0.2398 ± 0.0076
- social sciences 2 none 0 acc 0.2171 ± 0.0074
- stem 2 none 0 acc 0.2125 ± 0.0073
Downloads last month
38
Safetensors
Model size
18.9M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Datasets used to train 56m/Dumb-1.1-Preview-05-10