--- license: bigscience-openrail-m base_model: - meta-llama/Llama-3.2-3B - Qwen/Qwen2.5-3B-Instruct --- The following example illustrates how to load the FineEdit-XL adapter for inference. For FineEdit-Pro, we use Qwen2.5-3B-Instruct as the base model. ```python from transformers import AutoModelForCausalLM, AutoTokenizer from peft import PeftModel import torch ## Base Model base_model = "meta-llama/Llama-3.2-3B" ## Lora adapter_model = "YimingZeng/FineEdit_Model" subfolder = "FineEdit-XL" ## Load tokenizer and base model tokenizer = AutoTokenizer.from_pretrained(base_model) base = AutoModelForCausalLM.from_pretrained( base_model, torch_dtype=torch.bfloat16, device_map="auto" ) ## Load LoRA adapter model = PeftModel.from_pretrained(base, adapter_model, subfolder=subfolder) ## Test prompt = """Edit Request: Please change 'Captain American' to 'Iron Man'. Original Content: 'Captain American' is a superhero. Edited Content: """ inputs = tokenizer(prompt, return_tensors="pt").to(model.device) with torch.no_grad(): outputs = model.generate(**inputs, max_new_tokens=100) print(tokenizer.decode(outputs[0], skip_special_tokens=True)) ```