Nrex
/

lfm2-math-finetuned

Model card Files Files and versions

lfm2-math-finetuned / README.md

Nrex's picture

Updated Model Card

c2697e4 verified 5 months ago

|

history blame contribute delete

2.14 kB

	---
	library_name: transformers
	datasets:
	- HuggingFaceH4/MATH-500
	language:
	- en
	base_model:
	- LiquidAI/LFM2-350M-Math
	---
	# LFM2-350M-Math - Fine-tuned

	## Model Description
	This is a LoRA fine-tuned version of the `LiquidAI/LFM2-350M-Math` model on the HuggingFaceH4/MATH-500 dataset.
	The model is designed to answer math questions and generate step-by-step solutions in natural language.

	---

	## Intended Uses
	- Solve math problems directly in natural language.
	- Serve as a base for further fine-tuning on other math datasets.
	- Educational tools, tutoring systems, or research in automated math reasoning.

	---

	## Out-of-Scope Uses
	- Not intended for general reasoning beyond mathematics.
	- May fail or hallucinate on complex, unseen problem types.
	- Should not be used for critical calculations without verification.

	---

	## Limitations & Biases
	- The training dataset is small (500 problems), so the model may overfit.
	- Step-by-step reasoning is learned from dataset patterns, not true reasoning.
	- Accuracy can vary; always verify outputs for correctness.

	---

	## Training Details
	- Base Model: LiquidAI/LFM2-350M-Math
	- Dataset: HuggingFaceH4/MATH-500
	- Fine-tuning libraries: transformers, datasets, peft, accelerate, bitsandbytes
	- Training arguments:
	- num_train_epochs: 3
	- per_device_train_batch_size: 4
	- gradient_accumulation_steps: 4
	- learning_rate: 2e-4
	- fp16: True

	---

	## Usage

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel

	base_model = "LiquidAI/LFM2-350M-Math"
	adapter_repo = "Nrex/lfm2-math-finetuned"

	tokenizer = AutoTokenizer.from_pretrained(adapter_repo)
	model = AutoModelForCausalLM.from_pretrained(base_model)
	model = PeftModel.from_pretrained(model, adapter_repo)

	inputs = tokenizer("What is 12*13?", return_tensors="pt")
	outputs = model.generate(**inputs)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```
	---
	# Evaluation

	- Tested on a held-out 10% split of the dataset.
	- Small dataset; outputs should be verified for correctness.
	- Accuracy is limited by the dataset size