Bonforger
/

BonforgerLLM2

structured-output

Model card Files Files and versions

Bonforger commited on Feb 15

Commit

039ebe3

·

verified ·

1 Parent(s): e8e94e3

Update README.md

Files changed (1) hide show

README.md +52 -0

README.md CHANGED Viewed

@@ -12,3 +12,55 @@ tags:
 - lora
 - structured-output
 ---

 - lora
 - structured-output
 ---
+## Model Outline
+# qwen3-4b-structured-output-lora
+This repository provides a **LoRA adapter** fine-tuned from
+**Qwen3-4B-Instruct-2507** using **QLoRA (4-bit, Unsloth)**.
+This repository contains **LoRA adapter weights only**.
+The base model must be loaded separately.
+## Training Objective
+This adapter is trained to improve **structured output accuracy**
+(JSON / YAML / XML / TOML / CSV).
+Loss is applied only to the final assistant output,
+while intermediate reasoning (Chain-of-Thought) is masked.
+## Training Configuration
+- Base model: Qwen3-4B-Instruct-2507
+- Method: QLoRA (4-bit)
+- Max sequence length: 512
+- Epochs: 1
+- Learning rate: 5e-6
+- LoRA: r=64, alpha=128
+-
+## Usage
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+from peft import PeftModel
+import torch
+base = "Qwen/Qwen3-4B-Instruct-2507"
+adapter = "your_id/your-repo"
+tokenizer = AutoTokenizer.from_pretrained(base)
+model = AutoModelForCausalLM.from_pretrained(
+    base,
+    torch_dtype=torch.float16,
+    device_map="auto",
+)
+model = PeftModel.from_pretrained(model, adapter)
+##### 6. データセット・ライセンス注意（必須・重要）
+```md
+## Sources & Terms (IMPORTANT)
+Training data: u-10bei/structured_data_with_cot_dataset_512_v2
+Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
+Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.