·
AI & ML interests
None yet
Organizations
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_declare-rwd3-v14.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_struct-rwd3-v4.2
Text Generation
•
3B
•
Updated
•
9
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_base-rwd3-v1.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_reflect-rwd3-v8.2-seed42
Text Generation
•
3B
•
Updated
•
3
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_reflect-rwd3-v8.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_reflect-rwd2-v8.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_declare-rwd2-v14.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_struct-rwd2-v4.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_base-rwd2-v1.2
Text Generation
•
3B
•
Updated
•
4
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_reflect-rwd1-v8.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_declare-rwd1-v14.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_struct-rwd1-v4.2
Text Generation
•
3B
•
Updated
•
89
niklasm222/qwen2.5-3b-grpo-1.75k-gsm8k-prolog-v2.2-rwd1
Text Generation
•
3B
•
Updated
niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_base-rwd1-v1.2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v13-rwd4
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v14-rwd4
Text Generation
•
3B
•
Updated
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v11-rwd3
Text Generation
•
3B
•
Updated
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v10-rwd3
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v12-rwd3
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v7
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v5
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v4
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v3
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v2
Text Generation
•
3B
•
Updated
•
1
niklasm222/qwen2.5-3b-grpo-1.7k-gsm8k-prolog-v1
Text Generation
•
3B
•
Updated
•
1
niklasm222/Qwen2.5-3B-Instruct-1K_subset-GRPO-gsm8k-prolog-prover-v1
Text Generation
•
3B
•
Updated
•
1
niklasm222/Qwen2.5-3B-Instruct-GRPO-2K-gsm8k-prolog
Text Generation
•
3B
•
Updated
•
1
niklasm222/llama-3.2-1b-it-GRPO-gsm8k-prolog
Updated
niklasm222/gemma-2-2b-it-gsm8k-prolog-sft-lora-v1
Updated
niklasm222/gemma-2-2B-it-thinking-function_calling-V0
Updated