harsh762011
/

numinao14

Text Generation

text-generation-inference

Model card Files Files and versions

numinao14 / README.md

harsh762011's picture

Update README.md

e17ef42 verified about 6 hours ago

|

history blame contribute delete

1.76 kB

	---
	base_model: unsloth/phi-4-mini-reasoning
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- phi3
	license: cc-by-nc-3.0
	language:
	- en
	---

	# Uploaded finetuned model
	- Developed by: Harsh Srivastava
	- License: cc-by-nc-3.0
	- Finetuned from model : unsloth/phi-4-mini-reasoning
	This phi3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

	# Phi-4 Mini Reasoning – JEE Mathematics Finetuned Model

	## Developer
	Harsh Srivastava

	## Base Model
	unsloth/phi-4-mini-reasoning

	## Description
	This model is a finetuned version of Phi-4 Mini Reasoning designed for solving
	JEE-level mathematics problems.

	The model is optimized for step-by-step mathematical reasoning and symbolic problem solving.

	## Training Dataset

	Total samples used: 356,532 not that much but above 200k samples trained we are still training it better on various datasets for jee by the help of the keyword filters


	Sources:
	- AI-MO/NuminaMath-TIR — 68,850
	- MetaMathQA — 230,808
	- TIGER-Lab MathInstruct — 125,220
	- PhysicsWallahAI JEE Main 2025 (Jan) — 182
	- PhysicsWallahAI JEE Main 2025 (Apr) — 169
	- MMLU High School Mathematics — 78
	- MMLU College Mathematics — 50
	- MMLU Abstract Algebra — 25

	## Training Details
	Base model: Phi-4 Mini Reasoning
	Framework: Unsloth + HuggingFace TRL
	Training method: LoRA finetuning
	Sequence length: 2048
	Optimizer: AdamW 8bit

	## Purpose
	The model is designed for:
	- JEE mathematics reasoning
	- Step-by-step mathematical explanations
	- Competitive exam problem solving