Auto-formalized versions of GSM8K and MATH500 auto-formalized and filtered with Goedel models
Ujan PRO
Ujan
·
AI & ML interests
NLP, Speech
Recent Activity
updated a dataset about 1 hour ago
Ujan/gsm8k_formal_eval_Qwen3.5-9B_prover_judge published a dataset about 1 hour ago
Ujan/gsm8k_formal_eval_Qwen3.5-9B_prover_judge updated a dataset about 1 hour ago
Ujan/gsm8k_formal_eval_Falcon-H1R-7B_prover_judgeOrganizations
Formal v1
Auto-formalized versions of GSM8K with the state-of-the-art Goedel-Prover-V2 and filtered using Deepseek-Prover-V2
-
Ujan/gsm8k_formal_goedel_few_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.18k • 22 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_DeepSeek-Prover-V2-7B
Viewer • Updated • 1.15k • 10 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.23k • 9 -
Ujan/gsm8k_formal_goedel_few_shot
Viewer • Updated • 1.3k • 7
Formal v2
Auto-formalized versions of GSM8K and MATH500 auto-formalized and filtered with Goedel models
Formal v1
Auto-formalized versions of GSM8K with the state-of-the-art Goedel-Prover-V2 and filtered using Deepseek-Prover-V2
-
Ujan/gsm8k_formal_goedel_few_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.18k • 22 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_DeepSeek-Prover-V2-7B
Viewer • Updated • 1.15k • 10 -
Ujan/gsm8k_formal_goedel_zero_shot_filtered_Goedel-Prover-V2-32B
Viewer • Updated • 1.23k • 9 -
Ujan/gsm8k_formal_goedel_few_shot
Viewer • Updated • 1.3k • 7
models 8
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_2048_epoch_1
Text Generation • 4B • Updated • 2
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_16384_epoch_1
Text Generation • 4B • Updated • 1
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_8192_epoch_1
Text Generation • 4B • Updated • 3
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_4096_epoch_1
Text Generation • 4B • Updated • 4
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_16384_epoch_1
Text Generation • 4B • Updated • 2
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_4096_epoch_1
Text Generation • 4B • Updated • 2 •
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_8192_epoch_1
Text Generation • 4B • Updated • 1
Ujan/whisper-small_moe_k_means
Automatic Speech Recognition • Updated • 4
datasets 66
Ujan/gsm8k_formal_eval_Qwen3.5-9B_prover_judge
Viewer • Updated • 719
Ujan/gsm8k_formal_eval_Falcon-H1R-7B_prover_judge
Viewer • Updated • 520
Ujan/gsm8k_formal_eval_Olmo-3-7B-Think_prover_judge
Viewer • Updated • 384
Ujan/gsm8k_formal_eval_NVIDIA-Nemotron-Nano-12B-v2_prover_judge
Viewer • Updated • 498
Ujan/gsm8k_formal_eval_Falcon-H1R-7B_prover
Viewer • Updated • 691
Ujan/gsm8k_formal_eval_Qwen3-4B-Thinking-2507_prover_judge
Viewer • Updated • 636
Ujan/gsm8k_formal_eval_Qwen3-8B_prover_judge
Viewer • Updated • 668
Ujan/gsm8k_formal_eval_Ministral-3-8B-Reasoning-2512_prover_judge
Viewer • Updated • 20
Ujan/gsm8k_formal_eval_Falcon-H1R-7B
Viewer • Updated • 759
Ujan/gsm8k_formal_eval_NVIDIA-Nemotron-Nano-12B-v2_prover
Viewer • Updated • 676 • 8