shivs28's picture
Update README.md
5ca12b8 verified
# JEE NUJAN Math Expert 🎯📚
**The Ultimate JEE Mathematics AI Tutor - Fine-tuned Specialist**
This is a fine-tuned version of [JEE NUJAN Mix v2 Base](https://huggingface.co/shivs28/jee_nujan_mix_v2_base) specifically trained on JEE-style mathematics problems to excel at Indian competitive exam mathematics.
## 🏆 Model Details
- **Base Model**: `shivs28/jee_nujan_mix_v2_base`
- **Fine-tuning Dataset**: 500+ JEE-relevant mathematics problems from MATH dataset
- **Training Steps**: 150 (optimized for mathematical reasoning)
- **LoRA Configuration**: Rank 32, Alpha 64 (high-performance setup)
- **Specialization**: JEE Main & Advanced mathematics problems
## 🎯 Mathematical Capabilities
This model excels at:
### Core JEE Topics
- **Algebra**: Quadratic equations, inequalities, sequences & series
- **Calculus**: Limits, derivatives, integrals, applications
- **Coordinate Geometry**: Lines, circles, parabolas, ellipses, hyperbolas
- **Trigonometry**: Identities, equations, inverse functions
- **Probability**: Conditional probability, distributions, combinatorics
- **Number Theory**: Divisibility, modular arithmetic, prime numbers
- **Vector Algebra**: Dot product, cross product, scalar triple product
### Problem-Solving Approach
- **Step-by-step Solutions**: Clear mathematical progression
- **Multiple Methods**: Shows different approaches when applicable
- **Error Prevention**: Highlights common JEE mistakes
- **Time-Efficient**: Optimized for exam conditions
## 🚀 Usage Examples
### Basic Usage
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
model_name = "shivs28/jee_nujan_math_expert"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(model_name, trust_remote_code=True)
# JEE problem format
jee_prompt = '''<|problem|>
Find the number of real solutions of the equation x³ - 3x² + 2x - 1 = 0 in the interval [0, 3].
<|solution|>'''
inputs = tokenizer(jee_prompt, return_tensors="pt")
outputs = model.generate(
**inputs,
max_length=800,
temperature=0.1, # Low temperature for mathematical accuracy
do_sample=True,
pad_token_id=tokenizer.pad_token_id,
repetition_penalty=1.05
)
solution = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(solution)
```
### Advanced JEE Problem
```python
complex_problem = '''<|problem|>
In triangle ABC, if a = 7, b = 8, c = 9, find:
1. The area of triangle ABC
2. The radius of the circumscribed circle
3. The radius of the inscribed circle
<|solution|>'''
# Generate comprehensive solution
inputs = tokenizer(complex_problem, return_tensors="pt")
outputs = model.generate(
**inputs,
max_length=1200,
temperature=0.05, # Very low for multi-step problems
top_p=0.95,
do_sample=True,
pad_token_id=tokenizer.pad_token_id
)
```
## ⚙️ Recommended Generation Settings
### For JEE Main Problems
```python
generation_config = {
"max_length": 800,
"temperature": 0.1,
"top_p": 0.95,
"do_sample": True,
"repetition_penalty": 1.05,
"pad_token_id": tokenizer.pad_token_id
}
```
### For JEE Advanced Problems
```python
advanced_config = {
"max_length": 1200, # Longer for complex solutions
"temperature": 0.05, # Very low for accuracy
"top_p": 0.9,
"do_sample": True,
"repetition_penalty": 1.1,
"pad_token_id": tokenizer.pad_token_id
}
```
## 🎯 Training Details
- **Architecture**: LoRA fine-tuning on base model
- **Training Data**: Carefully curated JEE-relevant problems
- **Optimization**: Focused on mathematical reasoning patterns
- **Validation**: Tested on held-out JEE problems
### LoRA Configuration
- **Rank (r)**: 32
- **Alpha**: 64
- **Dropout**: 0.1
- **Target Modules**: All attention and MLP layers
- **Trainable Parameters**: ~2.1% of total parameters
## 🏅 Best Practices for JEE Preparation
1. **Use specific problem format**: Always use `<|problem|>` and `<|solution|>` tags
2. **Low temperature**: Use 0.05-0.1 for mathematical accuracy
3. **Adequate length**: Set max_length based on problem complexity
4. **Multiple attempts**: Try different seeds for various solution approaches
5. **Verify results**: Always cross-check mathematical calculations
## 📈 Use Cases
### For Students
- **Practice Problems**: Generate solutions with explanations
- **Concept Clarification**: Understand mathematical reasoning
- **Exam Preparation**: Practice with JEE-style problems
- **Error Analysis**: Learn from common mistakes
### For Educators
- **Solution Generation**: Create detailed problem solutions
- **Teaching Aid**: Step-by-step mathematical explanations
- **Problem Variation**: Generate similar problems for practice
- **Assessment**: Evaluate student understanding
## 🔧 Technical Specifications
- **Base Architecture**: Transformer-based language model
- **Fine-tuning Method**: LoRA (Low-Rank Adaptation)
- **Precision**: 16-bit floating point
- **Context Length**: 768 tokens (optimized for detailed solutions)
- **Vocabulary**: Extended with mathematical notation
## 📝 Citation
If you use this model in your research or educational content, please cite:
```bibtex
@model{jee_nujan_math_expert,
title={JEE NUJAN Math Expert: Fine-tuned Mathematics Specialist},
author={shivs28},
year={2025},
url={https://huggingface.co/shivs28/jee_nujan_math_expert}
}
```
## 🤝 Contributing
Found an issue or have suggestions? Open an issue on the model repository!