Instructions to use akacaptain/dragonclaw_model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use akacaptain/dragonclaw_model with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.2-3B-Instruct") model = PeftModel.from_pretrained(base_model, "akacaptain/dragonclaw_model") - Notebooks
- Google Colab
- Kaggle
| license: llama3.2 | |
| language: | |
| - en | |
| base_model: meta-llama/Llama-3.2-3B-Instruct | |
| tags: | |
| - llama | |
| - lora | |
| - peft | |
| - text-generation | |
| - openclaw | |
| # akacaptain/dragonclaw_model | |
| ## Model summary | |
| `akacaptain/dragonclaw_model` is a **LoRA (PEFT) adapter** fine-tuned on top of Meta’s `meta-llama/Llama-3.2-3B-Instruct`. | |
| It is **not** a standalone full model checkpoint: you must load the **base** `Llama-3.2-3B-Instruct` model and then apply this adapter. | |
| ## How to use (Transformers) | |
| Prereqs: | |
| - You must be granted access to the gated base model on Hugging Face and be logged in (`huggingface-cli login`). | |
| Load: | |
| ```python | |
| import torch | |
| from transformers import AutoTokenizer, AutoModelForCausalLM | |
| from peft import PeftModel | |
| base_model = "meta-llama/Llama-3.2-3B-Instruct" | |
| adapter_repo = "akacaptain/dragonclaw_model" | |
| tok = AutoTokenizer.from_pretrained(adapter_repo) | |
| base = AutoModelForCausalLM.from_pretrained(base_model) | |
| model = PeftModel.from_pretrained(base, adapter_repo) | |
| model.eval() | |
| messages = [ | |
| {"role": "system", "content": "You are a helpful assistant."}, | |
| {"role": "user", "content": "Hello!"}, | |
| ] | |
| prompt = tok.apply_chat_template(messages, tokenize=False, add_generation_prompt=True) | |
| inputs = tok(prompt, return_tensors="pt") | |
| with torch.inference_mode(): | |
| out = model.generate( | |
| **inputs, | |
| max_new_tokens=128, | |
| do_sample=False, | |
| pad_token_id=tok.eos_token_id, | |
| ) | |
| print(tok.decode(out[0], skip_special_tokens=True)) | |
| ``` | |
| ## Training details | |
| - **Base model**: `meta-llama/Llama-3.2-3B-Instruct` | |
| - **Method**: LoRA / PEFT (see `adapter_config.json`) | |
| - **Training hardware**: NVIDIA RTX 4090 | |
| - **Approximate training duration**: ~10 minutes | |
| - **Data**: Fine-tuned on synthetically generated training data derived from the OpenClaw source code. | |
| ## Evaluation | |
| - **Automated evaluation**: TODO (or: not yet published) | |
| ## Limitations | |
| - Inherits the limitations and usage constraints of the base `Llama-3.2-3B-Instruct` model. | |
| - Synthetic training data can produce confident-sounding but incorrect configuration advice; always verify in real environments. | |
| ## License / attribution | |
| - Ensure your Hub license/settings match the **base model** requirements and your organization’s policy. | |