Upload README.md with huggingface_hub

970473c verified about 1 month ago

4.13 kB

	---
	language:
	- en
	license: apache-2.0
	library_name: transformers
	tags:
	- legal
	- immigration
	- assistant
	- qwen2
	- qwen2.5
	- fine-tuned
	- lora
	base_model: Qwen/Qwen2.5-3B-Instruct
	model_type: qwen2
	pipeline_tag: text-generation
	widget:
	- text: "What is an H-1B visa?"
	example_title: "H-1B Visa Question"
	- text: "How do I apply for a green card?"
	example_title: "Green Card Process"
	- text: "What documents do I need for an O-1 visa?"
	example_title: "O-1 Visa Requirements"
	datasets:
	- busybisi/dolores-training-data
	---

	# Dolores AI - Immigration Case Manager (Qwen 2.5 LoRA)

	Dolores is a specialized AI Immigration Case Manager fine-tuned using LoRA (Low-Rank Adaptation) on Qwen 2.5-3B-Instruct. Her mission is to de-mystify the complex immigration journey, breaking it down into manageable, actionable steps with high empathy and precision.

	## Model Details

	- Base Model: [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct)
	- Fine-tuning Method: LoRA (Low-Rank Adaptation)
	- LoRA Rank: 16
	- LoRA Alpha: 32
	- Training Data: Immigration law documents, case examples, and expert guidance
	- Use Case: Immigration consultation, visa guidance, document preparation
	- Model Size: ~3B parameters (LoRA adapter only)

	## Training Details

	### Training Configuration
	- Epochs: 3
	- Batch Size: 4 (effective: 16 with gradient accumulation)
	- Learning Rate: 2e-4
	- Quantization: 4-bit (QLoRA) during training
	- Max Sequence Length: 2048 tokens
	- Target Modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj

	### Training Data
	Fine-tuned on curated immigration law datasets including:
	- U.S. immigration policies and procedures
	- Visa types and requirements (H-1B, O-1, EB-1, etc.)
	- Green card processes
	- Case examples and expert guidance
	- Document preparation instructions

	## Usage

	This is a LoRA adapter that needs to be loaded with the base model. For production use, see the merged version: [JustiGuide/DoloresAI-Qwen25-Merged](https://huggingface.co/JustiGuide/DoloresAI-Qwen25-Merged)

	### Loading the LoRA Adapter

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel
	import torch

	base_model_id = "Qwen/Qwen2.5-3B-Instruct"
	lora_adapter_id = "JustiGuide/DoloresAI-Qwen25"

	# Load base model
	model = AutoModelForCausalLM.from_pretrained(
	base_model_id,
	torch_dtype=torch.bfloat16,
	device_map="auto"
	)

	# Load LoRA adapter
	model = PeftModel.from_pretrained(model, lora_adapter_id)

	tokenizer = AutoTokenizer.from_pretrained(base_model_id)
	```

	### Inference Example

	```python
	system_prompt = "You are Dolores, a specialized AI Immigration Case Manager."

	question = "What is an H-1B visa and who qualifies for it?"

	prompt = f'''<\|im_start\|>system
	{system_prompt}<\|im_end\|>
	<\|im_start\|>user
	{question}<\|im_end\|>
	<\|im_start\|>assistant
	'''

	inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

	outputs = model.generate(
	**inputs,
	max_new_tokens=512,
	temperature=0.7,
	top_p=0.9,
	do_sample=True,
	repetition_penalty=1.1,
	)

	response = tokenizer.decode(outputs[0], skip_special_tokens=True)
	print(response)
	```

	## Deployment

	For production deployment, use the merged model: [JustiGuide/DoloresAI-Qwen25-Merged](https://huggingface.co/JustiGuide/DoloresAI-Qwen25-Merged)

	### HuggingFace Inference Endpoint
	- GPU: Nvidia L4 (24GB VRAM)
	- Scale to Zero: Enabled
	- Region: us-east-1

	## Performance

	- Inference Speed: ~10-20 tokens/second (on L4 GPU)
	- Context Length: Up to 2048 tokens
	- Quality: High accuracy on immigration-specific questions

	## Limitations

	- Provides general immigration guidance, not legal advice
	- Always consult with a licensed immigration attorney for specific cases
	- Trained primarily on U.S. immigration law
	- May not have information on very recent policy changes

	## License

	Apache 2.0 License (following base model's license)

	## Contact

	- Organization: JustiGuide
	- Website: https://justi.guide

	---

	Built with ❤️ by JustiGuide to make immigration more accessible