You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Uploaded finetuned model

  • Developed by: Harsh Srivastava
  • License: cc-by-nc-3.0
  • Finetuned from model : unsloth/phi-4-mini-reasoning This phi3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Phi-4 Mini Reasoning – JEE Mathematics Finetuned Model

Developer

Harsh Srivastava

Base Model

unsloth/phi-4-mini-reasoning

Description

This model is a finetuned version of Phi-4 Mini Reasoning designed for solving JEE-level mathematics problems.

The model is optimized for step-by-step mathematical reasoning and symbolic problem solving.

Training Dataset

Total samples used: 356,532 not that much but above 200k samples trained we are still training it better on various datasets for jee by the help of the keyword filters

Sources:

  • AI-MO/NuminaMath-TIR — 68,850
  • MetaMathQA — 230,808
  • TIGER-Lab MathInstruct — 125,220
  • PhysicsWallahAI JEE Main 2025 (Jan) — 182
  • PhysicsWallahAI JEE Main 2025 (Apr) — 169
  • MMLU High School Mathematics — 78
  • MMLU College Mathematics — 50
  • MMLU Abstract Algebra — 25

Training Details

Base model: Phi-4 Mini Reasoning
Framework: Unsloth + HuggingFace TRL
Training method: LoRA finetuning
Sequence length: 2048
Optimizer: AdamW 8bit

Purpose

The model is designed for:

  • JEE mathematics reasoning
  • Step-by-step mathematical explanations
  • Competitive exam problem solving
Downloads last month
2
Safetensors
Model size
4B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support