omercankoksal
/

springboot-assistant

Text Generation

Model card Files Files and versions

springboot-assistant / README.md

Ömercan Köksal

Update README.md

f7f5a5a verified about 2 months ago

|

History Blame Contribute Delete

2.55 kB

	---
	language:
	- en
	license: apache-2.0
	base_model: Qwen/Qwen2.5-1.5B-Instruct
	tags:
	- spring-boot
	- java
	- code
	- fine-tuned
	- qlora
	- peft
	pipeline_tag: text-generation
	---

	# Spring Boot Assistant

	A fine-tuned version of [Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct)
	specialized for Spring Boot and Java development questions.

	## Model Description

	This model was fine-tuned using QLoRA (Quantized Low-Rank Adaptation) on a
	combination of Java code examples from CodeAlpaca-20k and custom Spring Boot
	question-answer pairs. It is designed to assist developers with Spring Boot
	concepts, patterns, and best practices.

	## Training Details

	\| Parameter \| Value \|
	\|---\|---\|
	\| Base Model \| Qwen/Qwen2.5-1.5B-Instruct \|
	\| Fine-tuning Method \| QLoRA (LoRA + float16) \|
	\| LoRA Rank (r) \| 16 \|
	\| LoRA Alpha \| 32 \|
	\| Training Epochs \| 3 \|
	\| Batch Size \| 2 \|
	\| Learning Rate \| 2e-4 \|
	\| LR Scheduler \| Cosine \|
	\| Hardware \| Apple Silicon (MPS) \|

	## Training Data

	- [CodeAlpaca-20k](https://huggingface.co/datasets/sahil2801/CodeAlpaca-20k)
	— filtered for Java examples (~200 samples)
	- Custom Spring Boot Q&A pairs covering:
	- Dependency Injection
	- REST API development
	- Spring annotations
	- Exception handling
	- Database configuration
	- Security with JWT
	- Testing with @SpringBootTest

	## How to Use

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel
	import torch

	# Load base model
	base_model = AutoModelForCausalLM.from_pretrained(
	"Qwen/Qwen2.5-1.5B-Instruct",
	torch_dtype=torch.float16,
	device_map="auto",
	trust_remote_code=True,
	)

	# Load fine-tuned weights
	model = PeftModel.from_pretrained(
	base_model,
	"Break0635/springboot-assistant"
	)

	tokenizer = AutoTokenizer.from_pretrained(
	"Qwen/Qwen2.5-1.5B-Instruct",
	trust_remote_code=True,
	)

	# Generate answer
	def ask(question: str) -> str:
	prompt = f"### Instruction:\n{question}\n\n### Response:\n"
	inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

	with torch.no_grad():
	outputs = model.generate(
	**inputs,
	max_new_tokens=256,
	temperature=0.7,
	do_sample=True,
	pad_token_id=tokenizer.eos_token_id,
	)

	generated = outputs[0][inputs["input_ids"].shape[1]:]
	return tokenizer.decode(generated, skip_special_tokens=True).strip()

	# Example usage
	answer = ask("What is dependency injection in Spring Boot?")
	print(answer)
	```

	## Example Questions