Bonforger
/

BonforgerLLM1

Text Generation

structured-output

Model card Files Files and versions

Bonforger commited on 16 days ago

Commit

f548a06

·

verified ·

1 Parent(s): f877b38

Create README.md

Files changed (1) hide show

README.md +68 -0

README.md ADDED Viewed

	@@ -0,0 +1,68 @@

+# YAML metadata---
+license: apache-2.0
+datasets:
+- u-10bei/structured_data_with_cot_dataset_512_v2
+language:
+- en
+base_model:
+- Qwen/Qwen3-4B-Instruct-2507
+pipeline_tag: text-generation
+library_name: peft
+tags:
+- qlora
+- lora
+- structured-output
+---
+# qwen3-4b-structured-output-lora
+This repository provides a **LoRA adapter** fine-tuned from
+**Qwen3-4B-Instruct-2507** using **QLoRA (4-bit, Unsloth)**.
+This repository contains **LoRA adapter weights only**.
+The base model must be loaded separately.
+## Training Objective
+This adapter is trained to improve **structured output accuracy**
+(JSON / YAML / XML / TOML / CSV).
+Loss is applied only to the final assistant output,
+while intermediate reasoning (Chain-of-Thought) is masked.
+## Training Configuration
+- Base model: Qwen3-4B-Instruct-2507
+- Method: QLoRA (4-bit)
+- Max sequence length: 512
+- Epochs: 1
+- Learning rate: 1e-6
+- LoRA: r=64, alpha=128
+-
+## Usage
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+from peft import PeftModel
+import torch
+base = "Qwen/Qwen3-4B-Instruct-2507"
+adapter = "your_id/your-repo"
+tokenizer = AutoTokenizer.from_pretrained(base)
+model = AutoModelForCausalLM.from_pretrained(
+    base,
+    torch_dtype=torch.float16,
+    device_map="auto",
+)
+model = PeftModel.from_pretrained(model, adapter)
+---
+##### 6. データセット・ライセンス注意（必須・重要）
+```md
+## Sources & Terms (IMPORTANT)
+Training data: u-10bei/structured_data_with_cot_dataset_512_v2
+Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
+Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.