Instructions to use Dapinsky/PIT-4B-FT-202212-math-dapo-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Dapinsky/PIT-4B-FT-202212-math-dapo-lora with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Diamegs/PIT-4B-FT-202212") model = PeftModel.from_pretrained(base_model, "Dapinsky/PIT-4B-FT-202212-math-dapo-lora") - Notebooks
- Google Colab
- Kaggle
Dapinsky/PIT-4B-FT-202212-math-dapo-lora
This repo contains a PEFT LoRA adapter trained from Diamegs/PIT-4B-FT-202212.
It intentionally does not include merged base-model weights.
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base_model = "Diamegs/PIT-4B-FT-202212"
adapter_id = "Dapinsky/PIT-4B-FT-202212-math-dapo-lora"
tokenizer = AutoTokenizer.from_pretrained(adapter_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(base_model, trust_remote_code=True)
model = PeftModel.from_pretrained(model, adapter_id)
Training Summary
- Base model:
Diamegs/PIT-4B-FT-202212 - Fine-tuning method: LoRA post-training, adapter-only upload.
- Tokenizer files copied from:
base model
LoRA Configuration
{
"peft_type": "LORA",
"task_type": "CAUSAL_LM",
"r": 64,
"lora_alpha": 128,
"lora_dropout": 0.05,
"target_modules": [
"c_fc",
"c_proj",
"c_k",
"c_q",
"c_v"
],
"base_model_name_or_path": "Diamegs/PIT-4B-FT-202212"
}
- Downloads last month
- 25
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for Dapinsky/PIT-4B-FT-202212-math-dapo-lora
Base model
Diamegs/PIT-4B-FT-202212