Naholav/CodeGen-Diverse-5K
Viewer • Updated • 5k • 58
This model is a fine-tuned version of Qwen/Qwen2.5-1.5B aimed at improving coding capabilities and reasoning traces. It was trained using the Naholav/CodeGen-Diverse-5K dataset with a focus on structured reasoning using <think> tags.
The model is designed to generate code solutions given a problem statement. It is specifically trained to output a "thinking" process before the final solution, which helps in complex logical tasks.
The model expects the following format to trigger the reasoning capability:
Input: [Your Problem Description]
<think>
[Model generates reasoning here]
</think>
Solution: [Model generates code here]
Base model
Qwen/Qwen2.5-1.5B