hcup-llm-humanizer / README.md
Sharpener9290's picture
Updated LoRA adapter from HCUP_LLM training pipeline
30c2b7d verified
# HCUP LLM Humanizer Adapter
## Model Overview
- **Base Model:** mistralai/Mistral-7B-v0.3
- **Adapter Type:** LoRA (QLoRA 4-bit)
- **Purpose:** Humanized, non-AI-sounding clinical manuscript generation from HCUP administrative data
## Training Data
- **400 instruction pairs** from HCUP corpus (trinetx, nis, neds, hcup_general)
- **50 DPO preference pairs** for humanization style transfer
## Humanization Rules
- AVOID: Furthermore, It is important to note, In conclusion, Delve into, Notably
- USE: short declarative sentences, clinical connectors (Then, But, So)
- TONE: confident, direct, patient-centered clinical framing
## LoRA Config
- rank=16, alpha=32
- target_modules: q_proj, v_proj
- dropout: 0.05
## Usage
```python
from peft import PeftModel, AutoModelForCausalLM, AutoTokenizer
import torch
base = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-v0.3", torch_dtype=torch.float16, device_map="auto")
model = PeftModel.from_pretrained(base, "Sharpener9290/hcup-llm-humanizer")
tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-v0.3")
```