Elle-72B-Geo-v1
Model Description
Elle-72B-Geo-v1 is a LoRA adapter fine-tuned on Qwen/Qwen2.5-72B-Instruct for geometry problem solving.
Model Details
- Base Model: Qwen/Qwen2.5-72B-Instruct
- Adapter Type: LoRA
- LoRA Rank: 64
- LoRA Alpha: 128
- Target Modules: All linear layers
Training Data
Fine-tuned on geometry-focused mathematical problems including:
- Coordinate geometry
- Triangle and circle problems
- Area and perimeter calculations
Usage
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base_model = AutoModelForCausalLM.from_pretrained(
"Qwen/Qwen2.5-72B-Instruct",
torch_dtype="auto",
device_map="auto",
trust_remote_code=True
)
model = PeftModel.from_pretrained(base_model, "aphoticshaman/elle-72b-geo-v1")
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-72B-Instruct")
License
Apache 2.0
- Downloads last month
- 24