# Annoy: This should be a paper Title

📑 Paper | 🌐 Project Page | 💾 Released Resources | 📦 Repo This is the resource page of the our resources collection on Huggingface, we highlight your currect position with a blue block. **Dataset**

Dataset Link

Annoy-PythonEdu-Rs 🤗

Please also check the raw data after our processing if you are interested: [mm-tool/Annoy-PyEdu-Rs-Raw](https://huggingface.co/datasets/mm-tool/Annoy-PyEdu-Rs-Raw). **Models**

Base Model / Training Annoy Annoy++

Stage 1 Stage 2 Stage 1 Stage 2

Qwen 2.5 7B Coder 🤗 🤗 🤗 🤗

LLaMA 3.1 8B 🤗 🤗 🤗 🤗

DeepSeek v2 Lite Coder 🤗 🤗 🤗 🤗

**Introduction** While having full executable code theoretically allows us to generate reliable execution trajectories as responses, two challenges arise: 1) Obtaining a deterministic reverse function for input prediction is impractical; 2) Automatically constructed trajectories are constrained by pre-designed templates and lack the expressiveness and generalizability of free-form natural language reasoning. Thus, we adopt a fully LLM-based approach for synthesizing all the desired responses using DeepSeek-V2.5, as it has top-tier performance but extremely low cost compared to other advanced LLMs. *Due to our collaborators' compliance requirements, we only release the PythonEdu-Rs subset (this page) of full dataset. **License** The license for this dataset is CC-BY-4.0. **License** The license for this dataset is The Stack v2 License. **License** The license for this dataset is Apache 2.0.

Base Model / Training	Annoy		Annoy++
Base Model / Training	Stage 1	Stage 2	Stage 1	Stage 2
Qwen 2.5 7B Coder	🤗	🤗	🤗	🤗
LLaMA 3.1 8B	🤗	🤗	🤗	🤗
DeepSeek v2 Lite Coder	🤗	🤗	🤗	🤗