Upload fine-tuned Phi-3 payments model

85d2aa9 verified 3 months ago

6.33 kB

	---
	license: mit
	base_model: microsoft/Phi-3-mini-4k-instruct
	tags:
	- phi-3
	- lora
	- payments
	- finance
	- natural-language-generation
	- finetuned
	datasets:
	- custom
	language:
	- en
	pipeline_tag: text-generation
	library_name: transformers
	---

	# Phi-3 Mini Fine-tuned for Payments Domain

	This is a fine-tuned version of [Microsoft's Phi-3-Mini-4k-Instruct](microsoft/Phi-3-mini-4k-instruct) model, adapted for generating natural language descriptions of payment transactions using LoRA (Low-Rank Adaptation).

	## Model Description

	This model converts structured payment transaction data into clear, customer-friendly language. It was fine-tuned using LoRA on a synthetic payments dataset covering various transaction types.

	### Training Data

	The model was trained on a dataset of 500+ synthetic payment transactions including:
	- Standard payments (ACH, wire transfer, credit/debit card)
	- Refunds (full and partial)
	- Chargebacks
	- Failed/declined transactions
	- International transfers with currency conversion
	- Transaction fees
	- Recurring payments/subscriptions

	### Example Usage

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel

	# Load base model
	base_model = "microsoft/Phi-3-mini-4k-instruct"
	model = AutoModelForCausalLM.from_pretrained(
	base_model,
	torch_dtype="auto",
	device_map="auto"
	)

	# Load LoRA adapters
	model = PeftModel.from_pretrained(model, "aamanlamba/phi3-payments-finetune")
	tokenizer = AutoTokenizer.from_pretrained(base_model)

	# Generate description
	prompt = """<\|system\|>
	You are a financial services assistant that explains payment transactions in clear, customer-friendly language.<\|end\|>
	<\|user\|>
	Convert the following structured payment information into a natural explanation:

	inform(transaction_type[payment], amount[1500.00], currency[USD], sender[Acme Corp], receiver[Global Supplies Inc], status[completed], method[ACH], date[2024-10-27])<\|end\|>
	<\|assistant\|>
	"""

	inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
	outputs = model.generate(**inputs, max_new_tokens=150, temperature=0.7)
	response = tokenizer.decode(outputs[0], skip_special_tokens=True)
	print(response)
	```

	Expected output:
	```
	Your ACH payment of $1,500.00 to Global Supplies Inc was successfully completed on October 27, 2024.
	```

	## Training Details

	### Training Configuration

	- Base Model: microsoft/Phi-3-mini-4k-instruct
	- Fine-tuning Method: LoRA (Low-Rank Adaptation)
	- LoRA Rank: 16
	- LoRA Alpha: 32
	- Target Modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
	- Quantization: 8-bit (training), float16 (inference)
	- Training Epochs: 3
	- Learning Rate: 2e-4
	- Batch Size: 1 (with 8 gradient accumulation steps)
	- Hardware: NVIDIA RTX 3060 (12GB VRAM)
	- Training Time: ~35-45 minutes

	### Training Loss

	- Initial Loss: ~3.5-4.0
	- Final Loss: ~0.9-1.2
	- Validation Loss: ~1.0-1.3

	## Model Size

	- LoRA Adapter Size: ~15MB (only the adapter weights, not the full model)
	- Full Model Size: ~7GB (when combined with base model)

	## Supported Transaction Types

	1. Payments: Standard payment transactions
	2. Refunds: Full and partial refunds
	3. Chargebacks: Dispute and chargeback processing
	4. Failed Payments: Declined or failed transactions with reasons
	5. International Transfers: Cross-border payments with currency conversion
	6. Fees: Transaction and processing fees
	7. Recurring Payments: Subscriptions and scheduled payments
	8. Reversals: Payment reversals and adjustments

	## Limitations

	- Trained on synthetic data - may require additional fine-tuning for production use
	- Optimized for English language only
	- Best performance on transaction patterns similar to training data
	- Not suitable for handling real financial transactions without human oversight
	- Should not be used as the sole system for financial communication

	## Ethical Considerations

	- This model was trained on synthetic, anonymized data only
	- Does not contain any real customer PII or transaction data
	- Should be validated for accuracy before production deployment
	- Implement human review for customer-facing financial communications
	- Consider regulatory compliance (PCI-DSS, GDPR, etc.) in your jurisdiction

	## Intended Use

	Primary Use Cases:
	- Generating transaction descriptions for internal systems
	- Creating customer-friendly payment notifications
	- Automating payment communication drafts (with human review)
	- Training and demonstration purposes
	- Research in financial NLP

	Out of Scope:
	- Direct customer communication without review
	- Real-time transaction processing without validation
	- Compliance-critical communications
	- Medical or legal payment descriptions

	## How to Cite

	If you use this model in your research or application, please cite:

	```bibtex
	@misc{phi3-payments-finetuned,
	author = {aamanlamba},
	title = {Phi-3 Mini Fine-tuned for Payments Domain},
	year = {2024},
	publisher = {HuggingFace},
	howpublished = {\url{https://huggingface.co/aamanlamba/phi3-payments-finetune}}
	}
	```

	## Training Code

	The complete training code and dataset generation scripts are available on GitHub:
	- Repository: [github.com/aamanlamba/phi3-tune-payments](https://github.com/aamanlamba/phi3-tune-payments)
	- Includes: Dataset generator, training scripts, testing utilities, and deployment guides

	## Acknowledgements

	- Base model: [Microsoft Phi-3-Mini-4k-Instruct](microsoft/Phi-3-mini-4k-instruct)
	- Fine-tuning method: [LoRA: Low-Rank Adaptation of Large Language Models](https://arxiv.org/abs/2106.09685)
	- Training framework: HuggingFace Transformers + PEFT
	- Inspired by: [NVIDIA AI Workbench Phi-3 Fine-tuning Example](https://github.com/NVIDIA/workbench-example-phi3-finetune)

	## License

	This model is released under the MIT license, compatible with the base Phi-3 model license.

	## Contact

	For questions or issues, please open an issue on the model repository or contact the author.

	---

	Note: This is a demonstration model. Always validate outputs for accuracy before use in production financial systems.