---
license: mit
base_model: microsoft/Phi-3-mini-4k-instruct
tags:
- phi-3
- lora
- payments
- finance
- information-extraction
- structured-data-extraction
- text-to-data
- finetuned
datasets:
- custom
language:
- en
pipeline_tag: text-generation
library_name: transformers
---

# Phi-3 Mini Reverse Fine-tuned for Payments Domain

This is a **reverse** fine-tuned version of [Microsoft's Phi-3-Mini-4k-Instruct](microsoft/Phi-3-mini-4k-instruct) model, adapted for extracting structured payment metadata from natural language descriptions using LoRA (Low-Rank Adaptation).

## Model Description

This model converts natural language payment descriptions into structured, machine-readable metadata. It performs the **opposite** task of the forward model - instead of generating human-friendly text, it extracts structured data that can be processed by payment APIs and applications.

### Related Models

**Forward Model (Companion):** [aamanlamba/phi3-payments-finetune](https://huggingface.co/aamanlamba/phi3-payments-finetune)
- Converts structured metadata → natural language
- Use together for round-trip validation

### Training Data

The model was trained on a dataset of 500+ synthetic payment transactions where:
- **Input**: Natural language payment descriptions
- **Output**: Structured metadata in `action(field[value], ...)` format

Transaction types covered:
- Standard payments (ACH, wire transfer, credit/debit card)
- Refunds (full and partial)
- Chargebacks and disputes
- Failed/declined transactions
- International transfers with currency conversion
- Transaction fees
- Recurring payments/subscriptions

### Example Usage

```python
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch

# Load base model
base_model = "microsoft/Phi-3-mini-4k-instruct"
model = AutoModelForCausalLM.from_pretrained(
    base_model,
    torch_dtype=torch.float16,
    device_map="auto"
)

# Load LoRA adapters (reverse model)
model = PeftModel.from_pretrained(model, "aamanlamba/phi3-payments-reverse-finetune")
tokenizer = AutoTokenizer.from_pretrained(base_model, trust_remote_code=True)

# Extract structured data
prompt = """<|system|>
You are a financial data extraction assistant that converts natural language payment descriptions into structured metadata that can be processed by payment applications.<|end|>
<|user|>
Extract structured payment information from the following description:

Your payment of USD 1,500.00 to Global Supplies Inc via wire transfer was successfully completed on 2024-10-27.<|end|>
<|assistant|>
"""

inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

with torch.no_grad():
    outputs = model.generate(
        **inputs,
        max_new_tokens=200,
        temperature=0.3,  # Lower temperature for more deterministic extraction
        top_p=0.9,
        do_sample=True
    )

response = tokenizer.decode(outputs[0], skip_special_tokens=True)
structured_data = response.split("<|assistant|>")[-1].strip()
print(structured_data)
```

**Expected output:**
```
inform(transaction_type[payment], amount[1500.00], currency[USD], receiver[Global Supplies Inc], status[completed], method[wire_transfer], date[2024-10-27])
```

### Parsing the Output

```python
import re

def parse_structured_data(structured_str: str) -> dict:
    """Parse structured payment data into a dictionary"""
    action_match = re.match(r'(\w+)\((.*)\)', structured_str)
    if not action_match:
        return None

    action_type = action_match.group(1)
    fields_str = action_match.group(2)

    fields = {}
    field_pattern = r'(\w+)\[(.*?)\]'
    for match in re.finditer(field_pattern, fields_str):
        field_name = match.group(1)
        field_value = match.group(2)

        # Convert numeric values
        if field_name in ['amount', 'refund_amount', 'fee_amount', 'exchange_rate']:
            try:
                field_value = float(field_value)
            except ValueError:
                pass

        fields[field_name] = field_value

    return {
        'action_type': action_type,
        'fields': fields
    }

# Use it
parsed = parse_structured_data(structured_data)
print(parsed)
# Output: {'action_type': 'inform', 'fields': {'transaction_type': 'payment', 'amount': 1500.0, ...}}
```

## Training Details

### Training Configuration

- **Base Model**: microsoft/Phi-3-mini-4k-instruct
- **Fine-tuning Method**: LoRA (Low-Rank Adaptation)
- **Task Direction**: Natural Language → Structured Data (Reverse)
- **LoRA Rank**: 16
- **LoRA Alpha**: 32
- **Target Modules**: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
- **Quantization**: 8-bit (training), float16 (inference)
- **Training Epochs**: 3
- **Learning Rate**: 2e-4
- **Batch Size**: 1 (with 8 gradient accumulation steps)
- **Hardware**: NVIDIA RTX 3060 (12GB VRAM)
- **Training Time**: ~35-45 minutes

### Training Loss

- Initial Loss: ~3.5-4.0
- Final Loss: ~0.8-1.2
- Validation Loss: ~1.0-1.3
- Extraction Accuracy: ~90-95% on validation set

## Model Size

- **LoRA Adapter Size**: ~15MB (only the adapter weights, not the full model)
- **Full Model Size**: ~7GB (when combined with base model)

## Supported Transaction Types

1. **Payments**: Standard payment transactions with various methods
2. **Refunds**: Full and partial refunds
3. **Chargebacks**: Dispute and chargeback processing
4. **Failed Payments**: Declined or failed transactions with reasons
5. **International Transfers**: Cross-border payments with currency conversion
6. **Fees**: Transaction and processing fees
7. **Recurring Payments**: Subscriptions and scheduled payments
8. **Reversals**: Payment reversals and adjustments

## Output Format

The model extracts data in this structured format:
```
action_type(field1[value1], field2[value2], ...)
```

**Action Types:**
- `inform`: Informational transactions (payments, refunds, transfers)
- `alert`: Alerts and notifications (failures, chargebacks)

**Common Fields:**
- `transaction_type`: Type of transaction
- `amount`: Transaction amount (numeric)
- `currency`: Currency code (USD, EUR, GBP, etc.)
- `sender`/`receiver`/`merchant`: Party names
- `status`: Transaction status (completed, pending, failed, etc.)
- `method`: Payment method (credit_card, ACH, wire_transfer, etc.)
- `date`: Transaction date (YYYY-MM-DD)
- `reason`: Failure/chargeback reason (for alerts)

## Use Cases

### 1. Conversational Payment Interfaces
Extract payment details from user messages:
```
User: "I want to send $500 to John via PayPal"
Extracted: inform(transaction_type[payment], amount[500], currency[USD], receiver[John], method[PayPal])
```

### 2. Email Parsing
Extract transaction data from payment notification emails automatically.

### 3. Voice Payment Systems
Convert spoken payment descriptions into structured API calls.

### 4. Payment API Integration
Transform natural language payment requests into API-ready parameters.

## Limitations

- Trained on synthetic data - may require additional fine-tuning for production use
- Optimized for English language only
- Best performance on transaction patterns similar to training data
- Output format is custom - requires parsing (see example above)
- Not suitable for handling real financial transactions without validation
- Lower temperature (0.3) recommended for consistent extraction

## Ethical Considerations

- This model was trained on synthetic, anonymized data only
- Does not contain any real customer PII or transaction data
- Should be validated for accuracy before production deployment
- Implement validation and error handling for extracted data
- Consider regulatory compliance (PCI-DSS, GDPR, etc.) in your jurisdiction
- Always verify extracted financial data before processing

## Intended Use

**Primary Use Cases:**
- Extracting transaction data from natural language descriptions
- Building conversational payment bots
- Parsing payment notifications and emails
- Converting user requests to API parameters
- Training and demonstration purposes
- Research in financial NLP and information extraction

**Out of Scope:**
- Direct transaction processing without validation
- Real-time financial systems without error handling
- Compliance-critical data extraction
- Medical or legal payment processing

## Performance Notes

- **Inference Speed**: ~2-3 seconds per extraction on RTX 3060
- **Temperature**: Use 0.1-0.3 for deterministic extraction
- **Validation**: Always validate output format and field values
- **Error Handling**: Implement fallbacks for malformed outputs

## How to Cite

If you use this model in your research or application, please cite:

```bibtex
@misc{phi3-payments-reverse-finetuned,
  author = {aamanlamba},
  title = {Phi-3 Mini Reverse Fine-tuned for Payments Domain},
  year = {2024},
  publisher = {HuggingFace},
  howpublished = {\url{https://huggingface.co/aamanlamba/phi3-payments-reverse-finetune}}
}
```

## Training Code

The complete training code and dataset generation scripts are available on GitHub:
- **Repository**: [github.com/aamanlamba/phi3-tune-payments](https://github.com/aamanlamba/phi3-tune-payments)
- **Branch**: `reverse-structured-extraction` (this model)
- **Includes**: Reverse dataset generator, training scripts, testing utilities, parsing examples

## Acknowledgements

- Base model: [Microsoft Phi-3-Mini-4k-Instruct](microsoft/Phi-3-mini-4k-instruct)
- Fine-tuning method: [LoRA: Low-Rank Adaptation of Large Language Models](https://arxiv.org/abs/2106.09685)
- Training framework: HuggingFace Transformers + PEFT
- Inspired by: [NVIDIA AI Workbench Phi-3 Fine-tuning Example](https://github.com/NVIDIA/workbench-example-phi3-finetune)

## License

This model is released under the MIT license, compatible with the base Phi-3 model license.

## Contact

For questions or issues, please open an issue on the GitHub repository or contact the author.

---

**Note**: This is a **reverse** model for structured data extraction. For generating natural language from structured data, see the companion forward model.