meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval
Updated
meng-lab/MATH-OLMo-3-1025-7B-GRPO-Serval
Updated
meng-lab/MATH-OLMo-3-1025-7B-GRPO-Serval-15K
Updated
meng-lab/llama_3.1_8b_instruct_paradec_humaneval_medusa
8B • Updated • 1
meng-lab/AdaDecode-CodeLlama-34B-Instruct-XSum
34B • Updated • 11
meng-lab/AdaDecode-CodeLlama-13B-Instruct-HumanEval
13B • Updated • 6
meng-lab/AdaDecode-CodeLlama-34B-Instruct-HumanEval
34B • Updated • 9
meng-lab/AdaDecode-CodeLlama-13B-Instruct-XSum
13B • Updated • 8
meng-lab/AdaDecode-Llama-3.1-8B-Instruct-HumanEval
8B • Updated • 63
meng-lab/AdaDecode-CodeLlama-34B-Instruct-GSM8K
34B • Updated • 7
meng-lab/AdaDecode-Llama-3.1-8B-Instruct-GSM8K
8B • Updated • 4
meng-lab/AdaDecode-CodeLlama-13B-Instruct-GSM8K
13B • Updated • 6
meng-lab/AdaDecode-Llama-3.1-8B-Instruct-XSum
8B • Updated • 4
meng-lab/PopQA-InstructRAG-FT
Text Generation
• 8B • Updated • 8
meng-lab/TriviaQA-InstructRAG-FT
Text Generation
• 8B • Updated • 7
meng-lab/NaturalQuestions-InstructRAG-FT
Text Generation
• 8B • Updated • 12
meng-lab/ASQA-InstructRAG-FT
Text Generation
• 8B • Updated • 7
meng-lab/2WikiMultiHopQA-InstructRAG-FT
Text Generation
• 8B • Updated • 11