akacaptain
/

dragonclaw_model

Text Generation

Model card Files Files and versions

dragonclaw_model / README.md

akacaptain's picture

Update README.md

86989bb verified 26 days ago

|

history blame contribute delete

2.26 kB

	---
	license: llama3.2
	language:
	- en
	base_model: meta-llama/Llama-3.2-3B-Instruct
	tags:
	- llama
	- lora
	- peft
	- text-generation
	- openclaw
	---

	# akacaptain/dragonclaw_model

	## Model summary

	`akacaptain/dragonclaw_model` is a LoRA (PEFT) adapter fine-tuned on top of Meta’s `meta-llama/Llama-3.2-3B-Instruct`.

	It is not a standalone full model checkpoint: you must load the base `Llama-3.2-3B-Instruct` model and then apply this adapter.

	## How to use (Transformers)

	Prereqs:

	- You must be granted access to the gated base model on Hugging Face and be logged in (`huggingface-cli login`).

	Load:

	```python
	import torch
	from transformers import AutoTokenizer, AutoModelForCausalLM
	from peft import PeftModel

	base_model = "meta-llama/Llama-3.2-3B-Instruct"
	adapter_repo = "akacaptain/dragonclaw_model"

	tok = AutoTokenizer.from_pretrained(adapter_repo)
	base = AutoModelForCausalLM.from_pretrained(base_model)
	model = PeftModel.from_pretrained(base, adapter_repo)
	model.eval()

	messages = [
	{"role": "system", "content": "You are a helpful assistant."},
	{"role": "user", "content": "Hello!"},
	]
	prompt = tok.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
	inputs = tok(prompt, return_tensors="pt")

	with torch.inference_mode():
	out = model.generate(
	**inputs,
	max_new_tokens=128,
	do_sample=False,
	pad_token_id=tok.eos_token_id,
	)

	print(tok.decode(out[0], skip_special_tokens=True))
	```

	## Training details

	- Base model: `meta-llama/Llama-3.2-3B-Instruct`
	- Method: LoRA / PEFT (see `adapter_config.json`)
	- Training hardware: NVIDIA RTX 4090
	- Approximate training duration: ~10 minutes
	- Data: Fine-tuned on synthetically generated training data derived from the OpenClaw source code.

	## Evaluation

	- Automated evaluation: TODO (or: not yet published)

	## Limitations

	- Inherits the limitations and usage constraints of the base `Llama-3.2-3B-Instruct` model.
	- Synthetic training data can produce confident-sounding but incorrect configuration advice; always verify in real environments.

	## License / attribution

	- Ensure your Hub license/settings match the base model requirements and your organization’s policy.