---
library_name: peft
license: apache-2.0
base_model: Qwen/Qwen2.5-1.5B-Instruct
tags:
- base_model:adapter:Qwen/Qwen2.5-1.5B-Instruct
- llama-factory
- lora
- transformers
- question-generation
- education
- secondary-school
pipeline_tag: text-generation
model-index:
- name: question_generation_1.5B_model_v2
  results: []
---

# Question Generation 1.5B Model v2

A fine-tuned language model specifically designed to generate high-quality English comprehension and assessment questions for secondary school students. This model is optimized to create questions aligned with standard educational curricula and learning objectives.

## Model Description

This model is a LoRA (Low-Rank Adaptation) fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct). It has been trained specifically on educational question generation tasks to produce contextually relevant, pedagogically sound questions suitable for secondary school assessment.

### Key Features

- **Lightweight and Efficient**: 1.5B parameters with LoRA adaptation for fast inference
- **Education-Focused**: Trained on curated educational content
- **Curriculum-Aligned**: Questions follow standard secondary school curricula and learning outcomes
- **Question Variety**: Capable of generating multiple question types (multiple choice, short answer, essay prompts, etc.)
- **Context-Aware**: Generates questions based on provided text passages or topics

## Intended Use

This model is intended for:
- **Educational Content Creation**: Generating practice questions and assessments for secondary school students
- **Curriculum Support**: Creating supplementary learning materials aligned with educational standards
- **Assessment Design**: Assisting educators in developing comprehension questions and quiz content
- **Language Learning**: Generating English language proficiency assessment questions

### Limitations

- Designed for English language question generation
- Best performance on secondary school level content (ages 14-18)
- May require post-processing or human review for use in high-stakes assessments
- Performance may vary with non-English text inputs

## Training Data

The model was fine-tuned on a curated dataset of secondary school English curriculum materials and assessment question templates. Training data includes various question types aligned with standard educational frameworks.

## Training Procedure

### Hyperparameters

| Parameter | Value |
|-----------|-------|
| Learning Rate | 0.0005 |
| Training Batch Size | 8 (gradient accumulation) |
| Epochs | 10 |
| Optimizer | AdamW (fused) |
| LR Scheduler | Cosine with 0.1 warmup ratio |
| Seed | 42 |
| Training Precision | Native AMP (Mixed Precision) |

### Training Performance

The model achieved strong convergence with decreasing training loss across epochs:

| Epoch | Step | Training Loss |
|-------|------|---------------|
| 1.1   | 100  | 0.6345        |
| 2.3   | 200  | 0.4720        |
| 3.4   | 300  | 0.3499        |
| 4.5   | 400  | 0.2457        |
| 5.7   | 500  | 0.1229        |
| 6.8   | 600  | 0.0728        |
| 8.0   | 700  | 0.0398        |
| 9.1   | 800  | 0.0213        |

The model demonstrates consistent improvement in question generation quality as training progresses, with training loss decreasing from 0.63 to 0.02.

## Framework Versions

- PEFT: 0.17.1
- Transformers: 4.57.1
- PyTorch: 2.9.0+cu126
- Datasets: 4.0.0
- Tokenizers: 0.22.1

## Usage

```python
from peft import AutoPeftModelForCausalLM
from transformers import AutoTokenizer

model_id = "tokhey/question_generation_1.5B_model_v2"
model = AutoPeftModelForCausalLM.from_pretrained(model_id)
tokenizer = AutoTokenizer.from_pretrained(model_id)

# Generate questions from a passage
prompt = "Generate 3 comprehension questions about: [your text passage]"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_length=512)
print(tokenizer.decode(outputs[0]))
```

## Recommendations for Use

- Test the model on sample content before deploying in production
- Review generated questions for accuracy and appropriateness
- Use as an assistive tool to reduce educator workload, not as a sole assessment creation method
- Provide context and learning materials with generated questions for optimal student engagement

## License

Apache License 2.0

---

*This model card was automatically generated and updated. For questions or contributions, please reach out to the model developers.*