LiteCoder-4b-Terminal-preview / README.md

Upload folder using huggingface_hub

8adf01b verified 10 days ago

3.04 kB

	---
	license: mit
	base_model:
	- Qwen/Qwen3-4B-Instruct-2507
	---

	## LiteCoder-4b-Terminal-preview

	LiteCoder-4b-Terminal-preview is part of our series of models specialized in terminal-based interactions and stems from our recent efforts to develop capable small and medium-sized code agent models. The model is fine-tuned from `
	Qwen3-4B-Instruct-2507` on the [LiteCoder-SFT-Terminal-preview](https://huggingface.co/datasets/Lite-Coder/LiteCoder-SFT-Terminal-preview) dataset.

	Notably, this model achieves competitive results using fewer than 1,000 training samples. By relying entirely on a fully synthetic pipeline—without converting any existing datasets—we were able to secure significant gains on the challenging Terminal Bench, matching the performance of leading open-source models with extreme data efficiency.

	## Released Artifacts

	\| 2025/12/17 \| \| \|
	\| --- \| --- \| --- \|
	\| LiteCoder-4b-Terminal-preview \| Model \| https://huggingface.co/Lite-Coder/LiteCoder-4b-Terminal-preview \|
	\| LiteCoder-SFT-Terminal-preview \| Dataset \| https://huggingface.co/datasets/Lite-Coder/LiteCoder-SFT-Terminal-preview \|

	## Results

	Our models achieve competitive results on Terminal Bench, significantly outperforming general-purpose models of similar (and even larger) sizes.

	Terminal Bench 1.0 Performance

	\| Model \| Agent \| Results \|
	\| --- \| --- \| --- \|
	\| LiteCoder-30a3b-Terminal-preview \| Terminus 2 \| 18.75% \|
	\| Qwen3-30B-A3B-Nex-N1 \| Terminus 2 \| 18.75% \|
	\| LiteCoder-4b-Terminal-preview \| Terminus 2 \| 13.75% \|
	\| Qwen3-30B-A3B-Instruct \| Terminus 2 \| 12.5% \|
	\| Qwen3-4B-Instruct \| Terminus 2 \| 5.0% \|

	Terminal Bench 2.0 Performance

	\| Model \| Agent \| Results \|
	\| --- \| --- \| --- \|
	\| LiteCoder-30a3b-Terminal-preview \| Terminus 2 \| 5.6% \|
	\| LiteCoder-4b-Terminal-preview \| Terminus 2 \| 3.3% \|
	\| Qwen3-32B \| Terminus 2 \| 1.9% \|
	\| InternLM3-8B-Nex-N1 \| Terminus 2 \| 0% \|
	\| Qwen3-8B \| Terminus 2 \| 0% \|

	## Citation

	```latex
	@misc{LiteCoder Team,
	title={LiteCoder: Advancing Small and Medium-sized Code Agents},
	author={Xiaoxuan Peng and Xinyu Lu and Kaiqi Zhang and Taosong Fang and Boxi Cao and Yaojie Lu},
	year={2025},
	}
	```

	## Future Directions

	- Scaling Environments: Expanding the diversity of Docker environments and teacher models to improve generalization.
	- Agentic RL: Implementing Reinforcement Learning specifically for multi-turn agentic workflows.

	## Team & Contributions

	- Xiaoxuan Peng: Main Contributor
	- [Xinyu Lu](https://scholar.google.com/citations?user=_OsLG8EAAAAJ&hl=zh-CN): Project Lead
	- Kaiqi Zhang: Contributor
	- Taosong Fang: Contributor
	- Boxi Cao: Contributor
	- Yaojie Lu: Contributor

	## Acknowledgements

	LiteCoder builds upon multiple open-source projects, including [Harbor](https://github.com/laude-institute/harbor). The models are trained using [AutoAlign](https://github.com/icip-cas/AutoAlign).

	## Join Us

	Join the discussion on our [Discord](https://discord.gg/EX9qZe8B).

	---
	license: mit
	base_model:
	- Qwen/Qwen3-4B-Instruct-2507
	---

	## LiteCoder-4b-Terminal-preview

	LiteCoder-4b-Terminal-preview is part of our series of models specialized in terminal-based interactions and stems from our recent efforts to develop capable small and medium-sized code agent models. The model is fine-tuned from `
	Qwen3-4B-Instruct-2507` on the [LiteCoder-SFT-Terminal-preview](https://huggingface.co/datasets/Lite-Coder/LiteCoder-SFT-Terminal-preview) dataset.

	Notably, this model achieves competitive results using fewer than 1,000 training samples. By relying entirely on a fully synthetic pipeline—without converting any existing datasets—we were able to secure significant gains on the challenging Terminal Bench, matching the performance of leading open-source models with extreme data efficiency.

	## Released Artifacts

	\| 2025/12/17 \| \| \|
	\| --- \| --- \| --- \|
	\| LiteCoder-4b-Terminal-preview \| Model \| https://huggingface.co/Lite-Coder/LiteCoder-4b-Terminal-preview \|
	\| LiteCoder-SFT-Terminal-preview \| Dataset \| https://huggingface.co/datasets/Lite-Coder/LiteCoder-SFT-Terminal-preview \|

	## Results

	Our models achieve competitive results on Terminal Bench, significantly outperforming general-purpose models of similar (and even larger) sizes.

	Terminal Bench 1.0 Performance

	\| Model \| Agent \| Results \|
	\| --- \| --- \| --- \|
	\| LiteCoder-30a3b-Terminal-preview \| Terminus 2 \| 18.75% \|
	\| Qwen3-30B-A3B-Nex-N1 \| Terminus 2 \| 18.75% \|
	\| LiteCoder-4b-Terminal-preview \| Terminus 2 \| 13.75% \|
	\| Qwen3-30B-A3B-Instruct \| Terminus 2 \| 12.5% \|
	\| Qwen3-4B-Instruct \| Terminus 2 \| 5.0% \|

	Terminal Bench 2.0 Performance

	\| Model \| Agent \| Results \|
	\| --- \| --- \| --- \|
	\| LiteCoder-30a3b-Terminal-preview \| Terminus 2 \| 5.6% \|
	\| LiteCoder-4b-Terminal-preview \| Terminus 2 \| 3.3% \|
	\| Qwen3-32B \| Terminus 2 \| 1.9% \|
	\| InternLM3-8B-Nex-N1 \| Terminus 2 \| 0% \|
	\| Qwen3-8B \| Terminus 2 \| 0% \|

	## Citation

	```latex
	@misc{LiteCoder Team,
	title={LiteCoder: Advancing Small and Medium-sized Code Agents},
	author={Xiaoxuan Peng and Xinyu Lu and Kaiqi Zhang and Taosong Fang and Boxi Cao and Yaojie Lu},
	year={2025},
	}
	```

	## Future Directions

	- Scaling Environments: Expanding the diversity of Docker environments and teacher models to improve generalization.
	- Agentic RL: Implementing Reinforcement Learning specifically for multi-turn agentic workflows.

	## Team & Contributions

	- Xiaoxuan Peng: Main Contributor
	- [Xinyu Lu](https://scholar.google.com/citations?user=_OsLG8EAAAAJ&hl=zh-CN): Project Lead
	- Kaiqi Zhang: Contributor
	- Taosong Fang: Contributor
	- Boxi Cao: Contributor
	- Yaojie Lu: Contributor

	## Acknowledgements

	LiteCoder builds upon multiple open-source projects, including [Harbor](https://github.com/laude-institute/harbor). The models are trained using [AutoAlign](https://github.com/icip-cas/AutoAlign).

	## Join Us

	Join the discussion on our [Discord](https://discord.gg/EX9qZe8B).