SynGen
Collection
State-of-the-art models & datasets for synthetic reasoning trace generation. Credit for the original dataset goes to https://huggingface.co/Pinkstack
•
5 items
•
Updated
This is a 1.7B parameter LLM designed for synthetic grounded reasoning generation between the user prompt and the final model output, specifically for dataset modifications, but can be used for multiple use cases which require reasoning.
For example, this model allows you to turn any chat dataset into a reasoning dataset as if it was generated by DeepSeek R1 or OpenAI's GPT OSS!
Sampler Settings: Pretty standard, temp = 0.7, top_p = 0.95.
<reasoning_style>deepseek_r1</reasoning_style> # Can replace deepseek_r1 with gpt_oss
<system_prompt>Original System Prompt</system_prompt>
<user>User Message Here</user>
<assistant>Assistant Final Response Here (without reasoning)</assistant>
<think>Generated Reasoning</think>