Elle-72B-Math-v2

Model Description

Elle-72B-Math-v2 is a LoRA adapter fine-tuned on Qwen/Qwen2.5-72B-Instruct for mathematical reasoning using the NuminaMath-CoT dataset.

Model Details

Base Model: Qwen/Qwen2.5-72B-Instruct
Adapter Type: LoRA
LoRA Rank: 64
LoRA Alpha: 128
Target Modules: All linear layers
Training Data: NuminaMath-CoT

Training

Fine-tuned using:

Chain-of-thought mathematical reasoning examples
Step-by-step problem decomposition
Multiple solution strategies (algebraic, numerical, symbolic)

Usage

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

base_model = AutoModelForCausalLM.from_pretrained(
    "Qwen/Qwen2.5-72B-Instruct",
    torch_dtype="auto",
    device_map="auto",
    trust_remote_code=True
)
model = PeftModel.from_pretrained(base_model, "aphoticshaman/elle-72b-math-v2")
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-72B-Instruct")

License

Apache 2.0

Downloads last month: 1

Model tree for aphoticshaman/elle-72b-math-v2

Base model

Qwen/Qwen2.5-72B

Finetuned

Qwen/Qwen2.5-72B-Instruct

Adapter

(90)

this model

aphoticshaman
/

elle-72b-math-v2