Instructions to use lifatsastain/teach_lora1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use lifatsastain/teach_lora1 with PEFT:

from peft import PeftModel
from transformers import AutoModelForCausalLM

base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-7B-Instruct")
model = PeftModel.from_pretrained(base_model, "lifatsastain/teach_lora1")

Transformers

How to use lifatsastain/teach_lora1 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="lifatsastain/teach_lora1")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("lifatsastain/teach_lora1", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use lifatsastain/teach_lora1 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "lifatsastain/teach_lora1"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lifatsastain/teach_lora1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/lifatsastain/teach_lora1

SGLang

How to use lifatsastain/teach_lora1 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "lifatsastain/teach_lora1" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lifatsastain/teach_lora1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "lifatsastain/teach_lora1" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lifatsastain/teach_lora1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use lifatsastain/teach_lora1 with Docker Model Runner:
```
docker model run hf.co/lifatsastain/teach_lora1
```

teach_lora1 — ML Tutor LoRA

A LoRA adapter fine-tuned on top of Qwen2.5-7B-Instruct to teach machine learning concepts clearly and accessibly — the way great teachers do.

The model explains ML topics using:

Intuitive analogies first, before the math
Gradual concept build-up, one step at a time
An encouraging, patient tone that makes learners feel capable
A practice question at the end of every answer to reinforce understanding

Dataset Quality (DeepSeek judge, 5-point scale)

Dimension	Score / 5
Analogy quality	4.67
Clarity	5.00
Encouraging tone	5.00
Practice question	5.00
Conciseness	4.33
Average total	24.0 / 25

0 entries flagged below threshold.

Usage

pip install "bitsandbytes<0.50"

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
from peft import PeftModel
import torch

BASE_MODEL = "Qwen/Qwen2.5-7B-Instruct"
LORA_PATH  = "lifatsastain/teach_lora1"

quant_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_use_double_quant=True,
    bnb_4bit_compute_dtype=torch.bfloat16,
)

model = AutoModelForCausalLM.from_pretrained(
    BASE_MODEL, quantization_config=quant_config, device_map="auto"
)
model = PeftModel.from_pretrained(model, LORA_PATH)
model.eval()

tokenizer = AutoTokenizer.from_pretrained(LORA_PATH)

SYSTEM = (
    "You are an ML tutor teaching CS students who know coding but not ML. "
    "Always start with an intuitive analogy, build up to the concept, "
    "and end with a practice question. Be encouraging and patient."
)

messages = [
    {"role": "system", "content": SYSTEM},
    {"role": "user",   "content": "What is gradient descent?"},
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt").to(model.device)

with torch.no_grad():
    output_ids = model.generate(**inputs, max_new_tokens=512, temperature=0.7, top_p=0.9,
                                 do_sample=True, pad_token_id=tokenizer.eos_token_id)

new_tokens = output_ids[0][inputs["input_ids"].shape[-1]:]
print(tokenizer.decode(new_tokens, skip_special_tokens=True))

IF YOU WANT TALK IN LONG CONVERSATION

from peft import PeftModel
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
import bitsandbytes
BASE_MODEL = "Qwen/Qwen2.5-7B-Instruct"
LORA_PATH  = "lifatsastain/teach_lora1"

bnb_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_compute_dtype=torch.float16,
)

tokenizer = AutoTokenizer.from_pretrained(LORA_PATH)

base_model = AutoModelForCausalLM.from_pretrained(
    BASE_MODEL,
    quantization_config=bnb_config,
    device_map="auto",
)
model = PeftModel.from_pretrained(base_model, LORA_PATH)
model.eval()

SYSTEM = (
    "You are an ML tutor teaching CS students who know coding but not ML. "
    "Always start with an intuitive analogy, build up to the concept, "
    "and end with a practice question. Be encouraging and patient."
)

conversation_history = []

def chat(user_message):
    conversation_history.append({"role": "user", "content": user_message})

    messages = [{"role": "system", "content": SYSTEM}] + conversation_history

    text = tokenizer.apply_chat_template(
        messages, tokenize=False, add_generation_prompt=True
    )
    inputs = tokenizer(text, return_tensors="pt").to(model.device)

    with torch.no_grad():
        output_ids = model.generate(
            **inputs,
            max_new_tokens=1024,
            temperature=0.7,
            top_p=0.9,
            do_sample=True,
            pad_token_id=tokenizer.eos_token_id
        )

    new_tokens = output_ids[0][inputs["input_ids"].shape[-1]:]
    response = tokenizer.decode(new_tokens, skip_special_tokens=True)

    conversation_history.append({"role": "assistant", "content": response})
    return response

print("ML Tutor ready! Type 'quit' to exit, 'reset' to clear history.\n")

while True:
    user_input = input("You: ").strip()
    if not user_input:
        continue
    if user_input.lower() == "quit":
        print("Bye!")
        break
    if user_input.lower() == "reset":
        conversation_history.clear()
        print("Conversation history cleared.\n")
        continue
    response = chat(user_input)
    print(f"\nTutor: {response}\n")

'''



## Training Details

| Parameter              | Value                          |
|------------------------|-------------------------------|
| Base model             | Qwen2.5-7B-Instruct            |
| LoRA rank (r)          | 16                             |
| LoRA alpha             | 32                             |
| LoRA dropout           | 0.05                           |
| Target modules         | q_proj, k_proj, v_proj, o_proj |
| Training epochs        | 1                              |
| Learning rate          | 2e-4                           |
| Batch size             | 1 (grad accum 16)              |
| Max sequence length    | 512                            |
| Quantization           | 4-bit NF4                      |
| Optimizer              | paged_adamw_8bit               |
|----------------------------------------------------------

### Framework Versions
- transformers: 5.3.0
- bitsandbytes: 0.49.2
- peft: 0.18.1
- torch: 2.10.0+cu126
- trl: 0.29.0
- datasets: 4.7.0

Downloads last month: 3

Model tree for lifatsastain/teach_lora1

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct

Adapter

(2251)

this model