·
AI & ML interests
NLP, RL
Organizations
Viewer
• Updated • 12.5k • 35
Viewer
• Updated • 361k • 104
Dahoas/aimo-validation-aime
Viewer
• Updated • 90 • 59
Dahoas/qwen-1.5-4B-default-positives-epoch-1-100
Viewer
• Updated • 290k • 70
Dahoas/qwen-1.5-4B-tree-positives-epoch-2-100
Viewer
• Updated • 491k • 88
Dahoas/qwen-1.5-4B-tree-positives-epoch-1-100
Viewer
• Updated • 477k • 17
Dahoas/qwen-1.5-4B-epoch-1-test-100
Viewer
• Updated • 498k • 32
Dahoas/qwen-1.5-4B-K-100-test
Viewer
• Updated • 500k • 257
Dahoas/MATH_train_K_100_qwen_1.5_4B_outputs
Viewer
• Updated • 750k • 14
Viewer
• Updated • 750k • 25
• 2
Viewer
• Updated • 8.79k • 19
Dahoas/MATH_full_chat_format
Viewer
• Updated • 12.5k • 7
• 1
Viewer
• Updated • 7.91k • 9
Viewer
• Updated • 4.01k • 7
Viewer
• Updated • 1k • 53
• 1
Viewer
• Updated • 1k • 75
Dahoas/prompted_hf_cot_gsm8k
Viewer
• Updated • 8.79k • 17
• 7
Viewer
• Updated • 8.79k • 10
• 1
Dahoas/cot_gsm8k_three_step
Viewer
• Updated • 741 • 8
Dahoas/no_nl_cot_gsm8k_three_step
Viewer
• Updated • 2.09k • 10
Dahoas/no_nl_cot_gsm8k_toy
Viewer
• Updated • 2.42k • 12
Viewer
• Updated • 578 • 11
Viewer
• Updated • 32.2k • 16
Dahoas/split_no_nl_cot_gsm8k
Viewer
• Updated • 28k • 168
• 1
Viewer
• Updated • 8.68k • 50
• 2
Dahoas/gsm_socratic_conditional
Viewer
• Updated • 52.4k • 14
• 1
Dahoas/cot_gsm8k_socratic
Viewer
• Updated • 8.79k • 180
• 4
Viewer
• Updated • 8.79k • 121
• 6
Viewer
• Updated • 20k • 39
• 4
Viewer
• Updated • 207k • 52
• 3