---
license: apache-2.0
language:
  - en
tags:
  - code
  - programming
  - mathematics
  - reasoning
  - text-generation
  - conversational
pipeline_tag: text-generation
library_name: transformers
datasets:
  - uaytug/UCDS
model-index:
  - name: ucoder-mini
    results: []
---

# uCoder Mini

<div>
  <img src="https://img.shields.io/badge/Parameters-1.5B-blue" alt="Parameters">
  <img src="https://img.shields.io/badge/Context-4096-green" alt="Context Length">
  <img src="https://img.shields.io/badge/License-Apache%202.0-orange" alt="License">
</div>

## Overview

**uCoder Mini** is a compact 1.5B parameter language model fine-tuned for code generation and mathematical reasoning. Despite its small size, it delivers strong performance on programming tasks across multiple languages and competitive programming challenges.

Trained on the [UCDS (uCoder Dataset)](https://huggingface.co/datasets/uaytug/UCDS) — a curated collection of 420K+ high-quality coding and mathematics samples.

## Intended Use

- **Code generation** across Python, JavaScript, C++, Java, and more
- **Competitive programming** problem solving
- **Mathematical reasoning** and problem breakdown
- **Code explanation** and debugging assistance
- **Learning companion** for programming concepts

## Quick Start

```python
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "uaytug/ucoder-mini"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype="auto",
    device_map="auto"
)

messages = [
    {"role": "user", "content": "Write a Python function to find the longest palindromic substring."}
]

text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt").to(model.device)

outputs = model.generate(
    **inputs,
    max_new_tokens=512,
    temperature=0.7,
    top_p=0.9,
    do_sample=True
)

response = tokenizer.decode(outputs[0][inputs["input_ids"].shape[1]:], skip_special_tokens=True)
print(response)
```

## Chat Template

uCoder Mini uses the ChatML format:

```
<|im_start|>user
Your question here<|im_end|>
<|im_start|>assistant
```

## Training Details

| Attribute | Value |
|-----------|-------|
| **Training Dataset** | [uaytug/UCDS](https://huggingface.co/datasets/uaytug/UCDS) |
| **Dataset Size** | 420,686 samples |
| **Training Method** | Supervised Fine-Tuning (SFT) |
| **Precision** | bfloat16 |
| **Context Length** | 4096 tokens |

### Dataset Composition

The UCDS dataset combines high-quality sources:

| Source | Description |
|--------|-------------|
| CodeForces | Competitive programming problems |
| Code-Contests-Plus | Algorithm challenges with solutions |
| CodeAlpaca | Instruction-following code samples |
| OpenMathInstruct | Mathematical reasoning chains |
| Additional curated sources | Tool use, chain-of-thought examples |

## Limitations

- **Model size**: As a 1.5B model, it may struggle with highly complex multi-file projects
- **Knowledge cutoff**: Training data has a knowledge cutoff; recent libraries/APIs may not be covered
- **Verification needed**: Always review and test generated code before production use
- **Language coverage**: Strongest in Python; other languages may have reduced quality

## Example Outputs

<details>
<summary><b>Competitive Programming</b></summary>

**Prompt:** Solve the two-sum problem efficiently.

**Response:** 
```python
def two_sum(nums, target):
    seen = {}
    for i, num in enumerate(nums):
        complement = target - num
        if complement in seen:
            return [seen[complement], i]
        seen[num] = i
    return []
```

</details>

<details>
<summary><b>Mathematical Reasoning</b></summary>

**Prompt:** Find the time complexity of merge sort and explain why.

**Response:** Merge sort has O(n log n) time complexity. Here's why:

1. **Divide step**: The array is split in half recursively, creating log(n) levels
2. **Merge step**: At each level, we process all n elements exactly once
3. **Total**: n elements × log(n) levels = O(n log n)

This holds for best, average, and worst cases, making merge sort consistently efficient.

</details>

## Citation

```bibtex
@misc{ucoder-mini,
  author = {uaytug},
  title = {uCoder Mini: A Compact Code and Math Language Model},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/uaytug/ucoder-mini}
}
```