CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation Paper • 2406.07054 • Published Jun 11, 2024
One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning Paper • 2510.26167 • Published Oct 30
CoEvol Collection Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation • 5 items • Updated Oct 26, 2024
CAS-SIAT-ConsistencyAI/CoEvol-Mixtral_Mistral-7B-v0.1_SFT Text Generation • 7B • Updated Jun 6, 2024 • 12
CAS-SIAT-ConsistencyAI/CoEvol-ChatGPT_Mistral-7B-v0.1_SFT Text Generation • 7B • Updated Jun 6, 2024 • 13