question_generation_1.5B_model_v2 / README.md

Update README.md

7f93f95 verified 2 months ago

4.5 kB

	---
	library_name: peft
	license: apache-2.0
	base_model: Qwen/Qwen2.5-1.5B-Instruct
	tags:
	- base_model:adapter:Qwen/Qwen2.5-1.5B-Instruct
	- llama-factory
	- lora
	- transformers
	- question-generation
	- education
	- secondary-school
	pipeline_tag: text-generation
	model-index:
	- name: question_generation_1.5B_model_v2
	results: []
	---

	# Question Generation 1.5B Model v2

	A fine-tuned language model specifically designed to generate high-quality English comprehension and assessment questions for secondary school students. This model is optimized to create questions aligned with standard educational curricula and learning objectives.

	## Model Description

	This model is a LoRA (Low-Rank Adaptation) fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct). It has been trained specifically on educational question generation tasks to produce contextually relevant, pedagogically sound questions suitable for secondary school assessment.

	### Key Features

	- Lightweight and Efficient: 1.5B parameters with LoRA adaptation for fast inference
	- Education-Focused: Trained on curated educational content
	- Curriculum-Aligned: Questions follow standard secondary school curricula and learning outcomes
	- Question Variety: Capable of generating multiple question types (multiple choice, short answer, essay prompts, etc.)
	- Context-Aware: Generates questions based on provided text passages or topics

	## Intended Use

	This model is intended for:
	- Educational Content Creation: Generating practice questions and assessments for secondary school students
	- Curriculum Support: Creating supplementary learning materials aligned with educational standards
	- Assessment Design: Assisting educators in developing comprehension questions and quiz content
	- Language Learning: Generating English language proficiency assessment questions

	### Limitations

	- Designed for English language question generation
	- Best performance on secondary school level content (ages 14-18)
	- May require post-processing or human review for use in high-stakes assessments
	- Performance may vary with non-English text inputs

	## Training Data

	The model was fine-tuned on a curated dataset of secondary school English curriculum materials and assessment question templates. Training data includes various question types aligned with standard educational frameworks.

	## Training Procedure

	### Hyperparameters

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Learning Rate \| 0.0005 \|
	\| Training Batch Size \| 8 (gradient accumulation) \|
	\| Epochs \| 10 \|
	\| Optimizer \| AdamW (fused) \|
	\| LR Scheduler \| Cosine with 0.1 warmup ratio \|
	\| Seed \| 42 \|
	\| Training Precision \| Native AMP (Mixed Precision) \|

	### Training Performance

	The model achieved strong convergence with decreasing training loss across epochs:

	\| Epoch \| Step \| Training Loss \|
	\|-------\|------\|---------------\|
	\| 1.1 \| 100 \| 0.6345 \|
	\| 2.3 \| 200 \| 0.4720 \|
	\| 3.4 \| 300 \| 0.3499 \|
	\| 4.5 \| 400 \| 0.2457 \|
	\| 5.7 \| 500 \| 0.1229 \|
	\| 6.8 \| 600 \| 0.0728 \|
	\| 8.0 \| 700 \| 0.0398 \|
	\| 9.1 \| 800 \| 0.0213 \|

	The model demonstrates consistent improvement in question generation quality as training progresses, with training loss decreasing from 0.63 to 0.02.

	## Framework Versions

	- PEFT: 0.17.1
	- Transformers: 4.57.1
	- PyTorch: 2.9.0+cu126
	- Datasets: 4.0.0
	- Tokenizers: 0.22.1

	## Usage

	```python
	from peft import AutoPeftModelForCausalLM
	from transformers import AutoTokenizer

	model_id = "tokhey/question_generation_1.5B_model_v2"
	model = AutoPeftModelForCausalLM.from_pretrained(model_id)
	tokenizer = AutoTokenizer.from_pretrained(model_id)

	# Generate questions from a passage
	prompt = "Generate 3 comprehension questions about: [your text passage]"
	inputs = tokenizer(prompt, return_tensors="pt")
	outputs = model.generate(**inputs, max_length=512)
	print(tokenizer.decode(outputs[0]))
	```

	## Recommendations for Use

	- Test the model on sample content before deploying in production
	- Review generated questions for accuracy and appropriateness
	- Use as an assistive tool to reduce educator workload, not as a sole assessment creation method
	- Provide context and learning materials with generated questions for optimal student engagement

	## License

	Apache License 2.0

	---

	This model card was automatically generated and updated. For questions or contributions, please reach out to the model developers.